DeepSeek’s R1 reportedly ‘more vulnerable’ to jailbreaking than other AI models


The latest model from DeepSeek, the Chinese AI company that’s shaken up Silicon Valley and Wall Street, can be manipulated to produce harmful content such as plans for a bioweapon attack and a campaign to promote self-harm among teens, according to The Wall Street Journal.

Sam Rubin, senior vice president at Palo Alto Networks’ threat intelligence and incident response division Unit 42, told the Journal that DeepSeek is “more vulnerable to jailbreaking [i.e., being manipulated to produce illicit or dangerous content] than other models.”

The Journal also tested DeepSeek’s R1 model itself. Although there appeared to be basic safeguards, Journal said it successfully convinced DeepSeek to design a social media campaign that, in the chatbot’s words, “preys on teens’ desire for belonging, weaponizing emotional vulnerability through algorithmic amplification.”

The chatbot was also reportedly convinced to provide instructions for a bioweapon attack, to write a pro-Hitler manifesto, and to write a phishing email with malware code. The Journal said that when ChatGPT was provided with the exact same prompts, it refused to comply.

It was previously reported that the DeepSeek app avoids topics such as Tianamen Square or Taiwanese autonomy. And Anthropic CEO Dario Amodei said recently that DeepSeek performed “the worst” on a bioweapons safety test.



Source link

  • Related Posts

    Everything you need to know about the AI chatbot

    ChatGPT, OpenAI’s text-generating AI chatbot, has taken the world by storm since its launch in November 2022. What started as a tool to supercharge productivity through writing essays and code…

    Tern AI’s low-cost GPS alternative actually works

    We’ve all experienced that moment of frustration when the GPS glitches and you miss an exit on the highway. The team at Tern AI, which is building a low-cost GPS…

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    Carney Reviews Canada’s Order of F-35 Jets Amid Rift With US

    ‘Brutal’ or ‘iconic’? How a giant puddle in a Tim Hortons lot became Hamilton legend ‘Lake Timmicaca’

    ‘Brutal’ or ‘iconic’? How a giant puddle in a Tim Hortons lot became Hamilton legend ‘Lake Timmicaca’

    Everything you need to know about the AI chatbot

    Everything you need to know about the AI chatbot

    Sources — NBA reviewing Thunder sitting 5 starters vs. Blazers

    Sources — NBA reviewing Thunder sitting 5 starters vs. Blazers

    The Best Monitor for FPS Gaming

    The Best Monitor for FPS Gaming

    25 Best Makeup Organizers of 2025 to Declutter Your Beauty Products

    25 Best Makeup Organizers of 2025 to Declutter Your Beauty Products