DeepSeek’s R1 reportedly ‘more vulnerable’ to jailbreaking than other AI models

Date:

Share post:


The latest model from DeepSeek, the Chinese AI company that’s shaken up Silicon Valley and Wall Street, can be manipulated to produce harmful content such as plans for a bioweapon attack and a campaign to promote self-harm among teens, according to The Wall Street Journal.

Sam Rubin, senior vice president at Palo Alto Networks’ threat intelligence and incident response division Unit 42, told the Journal that DeepSeek is “more vulnerable to jailbreaking [i.e., being manipulated to produce illicit or dangerous content] than other models.”

The Journal also tested DeepSeek’s R1 model itself. Although there appeared to be basic safeguards, Journal said it successfully convinced DeepSeek to design a social media campaign that, in the chatbot’s words, “preys on teens’ desire for belonging, weaponizing emotional vulnerability through algorithmic amplification.”

The chatbot was also reportedly convinced to provide instructions for a bioweapon attack, to write a pro-Hitler manifesto, and to write a phishing email with malware code. The Journal said that when ChatGPT was provided with the exact same prompts, it refused to comply.

It was previously reported that the DeepSeek app avoids topics such as Tianamen Square or Taiwanese autonomy. And Anthropic CEO Dario Amodei said recently that DeepSeek performed “the worst” on a bioweapons safety test.



Source link

Lisa Holden
Lisa Holden
Lisa Holden is a news writer for LinkDaddy News. She writes health, sport, tech, and more. Some of her favorite topics include the latest trends in fitness and wellness, the best ways to use technology to improve your life, and the latest developments in medical research.

Recent posts

Related articles

Apple and Google take down malicious mobile apps from their app stores

Apple and Google have pulled as many as 20 apps from their respective app stores after security...

Is AI making us dumb?

Researchers from Microsoft and Carnegie Mellon University recently published a study looking at how using generative AI...

Apple Music adds a better-sounding Spatial Audio version of Kendrick Lamar’s Super Bowl halftime show

If you want to relive Kendrick Lamar’s headline-making Super Bowl halftime show, Apple Music just dropped a...

Elon Musk-led team submits $97.4B bid for OpenAI

A team of investors led by Elon Musk submitted a $97.6 billion bid to purchase OpenAI on...

Bird cuts 120 jobs as part of ‘strategic realignment’

Cloud communication service Bird has cut 120 jobs — roughly one-third of its total workforce. The Amsterdam-based...

TikTok wants Android users to sideload its app

With TikTok’s fate in the U.S. uncertain — its ban in the country has been paused, but...

Macron urges Europe to simplify its regulations to get back into the AI race

All eyes were on French President Emmanuel Macron Sunday at the end of the first day of...

Mistral gets down to business

Hundreds of heads of states, tech CEOs and nonprofits have flocked to Paris for the Artificial Intelligence...