Home Tech OpenAI strikes deal to train AI on Reddit data

OpenAI strikes deal to train AI on Reddit data

by Editorial Staff
0 comment

OpenAI has reached a take care of Reddit to make use of the social information web site’s information to coach synthetic intelligence fashions.

In a weblog publish on OpenAI’s press relations web site, the corporate mentioned the Reddit partnership would give it entry to “structured and distinctive real-time content material” — resembling posts and replies — from Reddit, permitting its instruments and fashions to “higher perceive and showcase’ this content material. Reddit content material shall be built-in into ChatGPT, OpenAI’s common conversational synthetic intelligence, and the businesses will work collectively to supply unspecified new “AI-powered options” to Reddit customers and moderators.

OpenAI can even change into an promoting associate of Reddit.

“Reddit will depend on OpenAI’s synthetic intelligence modeling platform to appreciate its highly effective imaginative and prescient,” OpenAI wrote in a press release. “Utilizing LLM, ML and AI permits Reddit to enhance the person expertise for everybody.”

OpenAI has a number of comparable licensing agreements with content material suppliers, from inventory media libraries to information publishers. However the uncommon angle is that Sam Altman, CEO of OpenAI, owns 8.7% of Reddit, making him the third-largest shareholder, and was as soon as a member of the corporate’s board of administrators.

In an try to deflect scrutiny, OpenAI says in its press launch that whereas Altman stays a Reddit shareholder, the partnership was “managed by OpenAI’s COO [Brad Lightcap]” and “accredited [OpenAI’s] impartial board of administrators’. (I word that Altman is a board member of OpenAI; nevertheless, an OpenAI spokesperson instructed TechCrunch that he recused himself from the choice.)

Reddit is making information licensing agreements an more and more central a part of its progress technique because it navigates the market as a public firm.

In its IPO prospectus, Reddit revealed that it has contractual agreements to license its information to clients, together with Google, totaling greater than $200 million. And in its first earnings report as a public firm, Reddit reported a 450% year-over-year enhance in non-advertising income, largely resulting from these offers.

Reddit shares rose 11% in prolonged buying and selling after the OpenAI deal was introduced.

“The paradox I see is that as extra content material on the internet is written by machines, the premium for content material that comes from actual folks is rising,” Reddit CEO Steve Huffman mentioned through the firm’s earnings name in March. “And we have had virtually twenty years of actual dialog.”

With greater than 1 billion posts and greater than 16 billion feedback, numbers that develop day-after-day because of tons of of tens of millions of energetic customers, Reddit is a gold mine for AI firms whose fashions study from examples of content material , resembling textual content and pictures, to create new comparable content material.

However the firm may face pushback from customers involved about the way it monetizes their information.

It is instructive to take a look at Stack Overflow, a question-and-answer discussion board for software program builders, which lately signed an settlement with OpenAI to supply information to coach the latter’s mannequin. In protest, some customers eliminated their top-rated solutions to questions locally. However Stack Overflow reinstated the deleted posts and banned these customers, claiming they didn’t adjust to its phrases of service.

Reddit has already expressed its displeasure with one try to present Reddit customers extra management over their information.

Vana, a startup constructed on blockchain, is making an attempt to launch an information “DAO” (digital autonomous group) to permit Reddit customers to pool their information and allow them to collectively determine how that pooled information is used (or bought). Reddit banned Vana, a subreddit devoted to the DAO dialogue, in a press release to TechCrunch and accused the corporate of “exploiting” controls over information exports.

We’re Launching AI Mail! Register right here to begin getting it in your mailboxes on June fifth.

Source link

You may also like

Leave a Comment

Our Company

DanredNews is here to give you the latest and trending news online

Newsletter

Subscribe my Newsletter for new blog posts, tips & new photos. Let's stay updated!

Laest News

© 2024 – All Right Reserved. DanredNews