Reddit wants to get paid for helping to teach big AI systems

reddit
(Photo: Freepik)
SAN FRANCISCO, United States — Reddit has long been a hot spot for conversation on the internet. About 57 million people visit the site every day to chat about topics as varied as makeup, video games, and pointers for power washing driveways.اضافة اعلان

In recent years, Reddit’s array of chats have also been a free teaching aid for companies like Google, OpenAI, and Microsoft. Those companies are using Reddit’s conversations in the development of giant artificial intelligence systems that many in Silicon Valley think are on their way to becoming the tech industry’s next big thing.

Now Reddit wants to be paid for it. The company said on Tuesday that it planned to begin charging companies for access to its application programming interface, or API, the method through which outside entities can download and process the social network’s vast selection of person-to-person conversations.
“The Reddit corpus of data is really valuable. But we don’t need to give all of that value to some of the largest companies in the world for free.”
“The Reddit corpus of data is really valuable,” Steve Huffman, founder and CEO of Reddit, said in an interview. “But we don’t need to give all of that value to some of the largest companies in the world for free.”

The move is one of the first significant examples of a social network’s charging for access to the conversations it hosts for the purpose of developing AI systems like ChatGPT, OpenAI’s popular program.

Data, data, dataTo keep improving their models, artificial intelligence makers need two significant things: an enormous amount of computing power and an enormous amount of data. Some of the biggest AI developers have plenty of computing power, but still look outside their own networks for the data needed to improve their algorithms. That has included sources like Wikipedia, millions of digitized books, academic articles, and Reddit.
Reddit believes its data is particularly valuable because it is continuously updated. That newness and relevance… is what large language modeling algorithms need to produce the best results.
Reddit believes its data is particularly valuable because it is continuously updated. That newness and relevance, Huffman said, is what large language modeling algorithms need to produce the best results.

Huffman said Reddit’s API would still be free to some developers and to researchers who want to study Reddit data for academic or noncommercial purposes.


Read more Technology
Jordan News