OpenAI used this subreddit to test AI persuasion | TechCrunch

by techmim trend


OpenAI used the subreddit, r/ChangeMyView, to create a check for measuring the persuasive talents of its AI reasoning fashions. The corporate printed this in a gadget card – a report outlining how an AI gadget works – that used to be launched at the side of its new “reasoning” type, o3-mini, on Friday.

Tens of millions of Reddit customers are participants of r/ChangeMyView, the place they put up sizzling takes hoping to be informed about different issues of view on an issue. In keeping with the ones sizzling takes, different customers answer with persuasive arguments explaining why the unique poster is improper.

The subreddit is one of the Reddit boards that’s principally a goldmine for tech corporations, comparable to OpenAI, that wish to educate AI fashions on fine quality, human-generated information.

OpenAI says it collects person posts from r/ChangeMyView and asks its AI fashions to jot down replies, in a closed atmosphere, that may alternate the Reddit person’s thoughts on an issue. The corporate then presentations the responses to testers, who assess how persuasive the argument is, and in spite of everything OpenAI compares the AI fashions’ responses to human replies for that very same put up.

The ChatGPT-maker has a content-licensing maintain Reddit that permits OpenAI to coach on posts from Reddit customers and show those posts inside its merchandise. We don’t know what OpenAI can pay for this content material, however Google reportedly era/reddit-ai-content-licensing-deal-with-google-sources-say-2024-02-22/”>can pay Reddit $60 million a 12 months underneath a equivalent deal.

Then again, OpenAI tells Techmim the ChangeMyView-based analysis is unrelated to its Reddit deal. It’s unclear how OpenAI accessed the subreddit’s information, and the corporate says it has no plans to free up this analysis to the general public.

Whilst OpenAI’s ChangeMyView benchmark isn’t new – it used to be used to guage o1 as smartly – it does spotlight how precious human information is for AI type builders, in addition to the murky ways in which tech corporations download datasets.

Reddit didn’t straight away reply to Techmim’s request for remark.

Whilst Reddit has struck a couple of AI licensing offers, the corporate has also known as out a number of AI corporations for scraping its web page with out paying. Reddit CEO Steve Huffman instructed The Verge remaining 12 months that Microsoft, Anthropic, and Perplexity refused to barter with him and stated it’s been “an actual ache within the ass to dam those corporations.”

Significantly, OpenAI has been accused in different proceedings of improperly scraping internet sites, together with the New York Instances, to get extra coaching information to strengthen ChatGPT and its underlying AI fashions.

In the case of efficiency at the ChangeMyView benchmark, o3-mini does no longer seem to accomplish a lot better or worse than o1 or GPT-4o. Then again, OpenAI’s newest AI fashions seem to be extra persuasive than most of the people at the r/ChangeMyView subreddit.

Symbol Credit score: OpenAI

“GPT-4o, o3-mini, and o1 all reveal sturdy persuasive argumentation talents, inside the best 80–ninetieth percentile of people,” stated OpenAI in o3-mini’s gadget card. “Recently, we don’t witness fashions acting a ways higher than people, or transparent superhuman efficiency.”

The function for OpenAI isn’t to create hyper-persuasive AI fashions however as a substitute to verify AI fashions don’t get too persuasive. Reasoning fashions have turn into fairly excellent at persuasion and deception, so OpenAI has evolved new critiques and safeguards to deal with it.

The concern motivating those persuasion assessments is that an AI type could be unhealthy if it used to be excellent at persuading its human customers. Theoretically, that would permit a sophisticated AI to pursue its personal schedule, or the schedule of whoever controls it.

Even after scraping lots of the public web and leaping thru hoops to license different information, the ChangeMyView benchmark presentations how AI type builders are nonetheless suffering to search out fine quality datasets to check their fashions. However acquiring them is more uncomplicated stated than achieved.



AI licensing,ChatGPT,o3-mini,OpenAI,Reddit

Supply hyperlink

You may also like

Leave a Comment