Reddit has reportedly signed over its content to train AI models

February 21, 2024

Reddit is poised to become a significant player in fueling AI innovation, with reports indicating that the popular social media platform has struck a content licensing deal to allow its data to be utilized for training AI models.

In anticipation of its potential $5 billion IPO debut in March, Bloomberg revealed that Reddit has secured a $60 million agreement with a major AI company, positioning itself as a lucrative avenue for investors interested in the platform’s AI potential.

The agreement entails granting access to Reddit posts, spanning from the most popular subreddits to the contributions of both active users and passive lurkers, thereby enabling the augmentation of existing Language Model Models (LLMs) or laying the groundwork for future generative AI endeavors. However, this move is not without controversy, as it raises concerns among Reddit’s user base regarding the platform’s direction and the ethical implications of leveraging public data for AI development.

Reddit’s decision comes amid ongoing tensions between the platform and its users, exacerbated by previous business decisions. Last year, Reddit’s announcement of plans to monetize access to its APIs prompted widespread backlash, leading to the shutdown of numerous Reddit forums in protest. The platform’s subsequent crash and threats from a group of hackers to release stolen site data further fueled discontent among users.

In response, Reddit implemented various changes, including the removal of years’ worth of private chat logs and messages, ostensibly to prepare for a new chat infrastructure. Additionally, measures such as the introduction of an “official” badge to distinguish genuine accounts from impersonators and the implementation of automatic moderation features were rolled out. However, these efforts did little to quell user dissatisfaction, particularly after the removal of the option to disable ad personalization in September.

The recent AI deal has reignited debates surrounding the ethical use of public data and human-created content to train AI. While the potential for AI innovation is significant, concerns about privacy, data ownership, and the commodification of user-generated content persist. Reddit’s decision to monetize its data for AI development may further alienate its user base and exacerbate existing tensions between the platform and its community.

As Reddit continues to navigate the intersection of technology, business, and ethics, the outcome of this AI deal remains uncertain. However, it underscores the complex challenges inherent in leveraging user-generated content for commercial purposes, highlighting the need for transparency, accountability, and ethical considerations in AI development.

Reddit has reportedly signed over its content to train AI models

Your Personalized Playlist for Migrating Your POS to the Cloud

The EU officially opens up an investigation into TikTok

How to try Sora, OpenAI’s AI video generator

Elon Musk announced support for OpenAI firing Sam Altman. Here’s why.

X / Twitter launches video and audio calls — here’s how to use ‘em

Microsoft offers legal protection for users with AI copyright infringements

Get in Touch

IT Solutions

Design

Development

© 2023 9tek. All Rights Reserved.