DeepSeek’s R1 reportedly ‘more vulnerable’ to jailbreaking than other AI models

The latest model from DeepSeek, the Chinese AI company that’s shaken up Silicon Valley and Wall Street, can be manipulated to produce harmful content such as plans for a bioweapon attack and a campaign to promote self-harm among teens, according to The Wall Street Journal.

Sam Rubin, senior vice president at Palo Alto Networks’ threat intelligence and incident response division Unit 42, told the Journal that DeepSeek is “more vulnerable to jailbreaking [i.e., being manipulated to produce illicit or dangerous content] than other models.”

The Journal also tested DeepSeek’s R1 model itself. Although there appeared to be basic safeguards, Journal said it successfully convinced DeepSeek to design a social media campaign that, in the chatbot’s words, “preys on teens’ desire for belonging, weaponizing emotional vulnerability through algorithmic amplification.”

The chatbot was also reportedly convinced to provide instructions for a bioweapon attack, to write a pro-Hitler manifesto, and to write a phishing email with malware code. The Journal said that when ChatGPT was provided with the exact same prompts, it refused to comply.

It was previously reported that the DeepSeek app avoids topics such as Tianamen Square or Taiwanese autonomy. And Anthropic CEO Dario Amodei said recently that DeepSeek performed “the worst” on a bioweapons safety test.

Source link

Daily News & Blog

Or check our Popular Categories...

About Us

Who We Are

What We Offer

Our Mission

Join Us

Contact Info

Daily News & Blog

Or check our Popular Categories...

DeepSeek’s R1 reportedly ‘more vulnerable’ to jailbreaking than other AI models

Like this:

Related

dailynewsnblog

Related Posts

AI coding assistant Cursor reportedly tells a ‘vibe coder’ to write his own damn code

Like this:

Bench is charging people for services they already paid for, some customers say

Like this:

Leave a Reply Cancel reply

You Missed

Four astronauts launch on mission to International Space Station

Permit for Offshore Wind Farm Voided After Trump Opposed Project

Appeals court allows Trump to enforce ban on DEI programs for now

Indigenous leaders welcome new PM, remind government of work still to be done

Capital One Venture Rewards vs Wells Fargo Autograph Journey

AI coding assistant Cursor reportedly tells a ‘vibe coder’ to write his own damn code

Daily News & Blog

Or check our Popular Categories...

About Us

Who We Are

What We Offer

Our Mission

Join Us

Contact Info

Daily News & Blog

Or check our Popular Categories...

DeepSeek’s R1 reportedly ‘more vulnerable’ to jailbreaking than other AI models

Share this:

Like this:

Related

dailynewsnblog

Related Posts

AI coding assistant Cursor reportedly tells a ‘vibe coder’ to write his own damn code

Share this:

Like this:

Bench is charging people for services they already paid for, some customers say

Share this:

Like this:

Leave a Reply Cancel reply

You Missed

Four astronauts launch on mission to International Space Station

Permit for Offshore Wind Farm Voided After Trump Opposed Project

Appeals court allows Trump to enforce ban on DEI programs for now

Indigenous leaders welcome new PM, remind government of work still to be done

Capital One Venture Rewards vs Wells Fargo Autograph Journey

AI coding assistant Cursor reportedly tells a ‘vibe coder’ to write his own damn code