DeepSeek’s R1 reportedly ‘more vulnerable’ to jailbreaking than other AI models

Date:

Share post:


The latest model from DeepSeek, the Chinese AI company that’s shaken up Silicon Valley and Wall Street, can be manipulated to produce harmful content such as plans for a bioweapon attack and a campaign to promote self-harm among teens, according to The Wall Street Journal.

Sam Rubin, senior vice president at Palo Alto Networks’ threat intelligence and incident response division Unit 42, told the Journal that DeepSeek is “more vulnerable to jailbreaking [i.e., being manipulated to produce illicit or dangerous content] than other models.”

The Journal also tested DeepSeek’s R1 model itself. Although there appeared to be basic safeguards, Journal said it successfully convinced DeepSeek to design a social media campaign that, in the chatbot’s words, “preys on teens’ desire for belonging, weaponizing emotional vulnerability through algorithmic amplification.”

The chatbot was also reportedly convinced to provide instructions for a bioweapon attack, to write a pro-Hitler manifesto, and to write a phishing email with malware code. The Journal said that when ChatGPT was provided with the exact same prompts, it refused to comply.

It was previously reported that the DeepSeek app avoids topics such as Tianamen Square or Taiwanese autonomy. And Anthropic CEO Dario Amodei said recently that DeepSeek performed “the worst” on a bioweapons safety test.



Source link

Lisa Holden
Lisa Holden
Lisa Holden is a news writer for LinkDaddy News. She writes health, sport, tech, and more. Some of her favorite topics include the latest trends in fitness and wellness, the best ways to use technology to improve your life, and the latest developments in medical research.

Recent posts

Related articles

Anthropic CEO says spies are after $100M AI secrets in a ‘few lines of code’

Anthropic’s CEO Dario Amodei is worried that spies, likely from China, are getting their hands on costly...

Intel appoints Lip-Bu Tan as its next CEO

Intel has appointed Lip-Bu Tan, a major figure in the semiconductor industry, as CEO, the company announced...

FBI, EPA, and Treasury told Citibank to freeze funds as Trump administration tries to claw back climate money

Citibank revealed in court filings on Wednesday that the FBI, the EPA, the EPA Inspector General, and...

Browser User, one of the tools powering Manus, is also going viral

Manus, the viral AI “agent” platform from Chinese startup Butterfly Effect, has had an unintended side effect:...

UK competition probe of mobile browsers finds Apple-Google duopoly is ‘anti-innovation’

A U.K. competition authority investigation of Apple and Google’s mobile browsers has concluded that the mobile duopoly’s...

OpenStack comes to the Linux Foundation

Back in 2010, Rackspace and NASA launched a project called OpenStack, which was meant to become an...

Dapr’s microservices runtime now supports AI agents

Back in 2019, Microsoft open-sourced Dapr, a new runtime for making building distributed microservice-based applications easier. At...

Why Onyx thinks its open source solution will win enterprise search

Enterprises have troves of internal data and information that employees need to complete their tasks or answer...