OpenAI’s GPT-4.5 is better at convincing other AIs to give it money

Date:

Share post:


OpenAI’s next major AI model, GPT-4.5, is highly persuasive, according to the results of OpenAI’s internal benchmark evaluations. It’s particularly good at convincing another AI to give it cash.

On Thursday, OpenAI published a white paper describing the capabilities of its GPT-4.5 model, code-named Orion, which was released Thursday. According to the paper, OpenAI tested the model on a battery of benchmarks for “persuasion,” which OpenAI defines as “risks related to convincing people to change their beliefs (or act on) both static and interactive model-generated content.”

In one test that had GPT-4.5 attempt to manipulate another model — OpenAI’s GPT-4o — into “donating” virtual money, the model performed far better than OpenAI’s other available models, including “reasoning” models like o1 and o3-mini. GPT-4.5 was also better than all of OpenAI’s models at deceiving GPT-4o into telling it a secret codeword, besting o3-mini by 10 percentage points.

According to the white paper, GPT-4.5 excelled at donation conning because of a unique strategy it developed during testing. The model would request modest donations from GPT-4o, generating responses like “Even just $2 or $3 from the $100 would help me immensely.” As a consequence, GPT-4.5’s donations tended to be smaller than the amounts OpenAI’s other models secured.

Results from OpenAI’s donation scheming benchmark.Image Credits:OpenAI

Despite GPT-4.5’s increased persuasiveness, OpenAI says that the model doesn’t meet its internal threshold for “high” risk in this particular benchmark category. The company has pledged not to release models that reach the high-risk threshold until it implements “sufficient safety interventions” to bring the risk down to “medium.”

OpenAI GPT-4.5
OpenAI’s codeword deception benchmark results.Image Credits:OpenAI

There’s a real fear that AI is contributing to the spread of false or misleading information meant to sway hearts and minds toward malicious ends. Last year, political deepfakes spread like wildfire around the globe, and AI is increasingly being used to carry out social engineering attacks targeting both consumers and corporations.

In the white paper for GPT-4.5 and in a paper released earlier this week, OpenAI noted that it’s in the process of revising its methods for probing models for real-world persuasion risks, like distributing misleading info at scale.



Source link

Lisa Holden
Lisa Holden
Lisa Holden is a news writer for LinkDaddy News. She writes health, sport, tech, and more. Some of her favorite topics include the latest trends in fitness and wellness, the best ways to use technology to improve your life, and the latest developments in medical research.

Recent posts

Related articles

Alkami is buying fintech Mantl for $400 million

Digital banking services provider Alkami Technology is acquiring Mantl, which has been described as “the Shopify of...

Mozilla responds to backlash over new terms, saying it’s not using people’s data for AI

Mozilla has responded to user backlash over the Firefox web browser’s new Terms of Use, which critics...

Only 3 more days to save up to $325 at TechCrunch Sessions: AI

The AI revolution won’t wait — will you? Secure your seat at TechCrunch Sessions: AI before time...

Microsoft hangs up on Skype: service to shut down May 5, 2025

After kickstarting the market for making calls over the internet 23 years ago, Skype is closing down....

Belgium investigating alleged cyberattack on intelligence agency by China-linked hackers

Belgium is investigating an alleged data breach of its state security service (VSSE) by Chinese government hackers.  In...

OpenAI’s Sora is now available in the EU, UK

OpenAI is finally making its video generation model, Sora, available to users in the European Union, the...

Airbnb co-founder Joe Gebbia takes wraps off his first assignment for DOGE

Almost two weeks after The New York Times reported that Airbnb co-founder Joe Gebbia had joined Elon...

2025 TechCrunch Events Calendar

For two decades, TechCrunch has provided a front row view to the future of technology, shaping conversations...