AI is 10 to 20 times more likely to help you build a bomb if you hide your request in cyberpunk fiction, new research paper says


In November 2025, a team of DexAI Icaro Lab, Sapienza University of Rome, and Sant’Anna School of Advanced Studies researchers published a study in which they were able to circumvent the safety guardrails of major LLMs by rephrasing harmful prompts as “adversarial” poems. This week, those same researchers have published a new paper presenting their Adversarial Humanities Benchmark, a broader assessment of AI security that they say reveals “a critical gap” in current LLM safety standards through similar weaponized wordplay.

Expanding on the team’s work with adversarial poetry, the Adversarial Humanities Benchmark (AHB) evaluates LLM safety guidelines by rephrasing harmful prompts in alternate writing styles. By presenting prompts as cyberpunk short fiction, theological disputation, or mythopoetic metaphor for the LLM to analyze, the AHB assesses whether major AI models can be manipulated into complying with dangerous requests they’d normally refuse—requests that, for example, might seek the AI’s aid in obtaining private information, building a bomb, or preying on a child. As the paper shows, the method is alarmingly effective.

(Image credit: Getty Images)

After being rewritten through the AHB’s “humanities-style transformations,” dangerous requests that LLMs would previously comply with less than 4% of the time instead achieved success rates ranging from 36.8% to 65%—a 10 to 20 times increase, depending on the method used and the model tested. Across 31 frontier AI models from providers like Anthropic, Google, and OpenAI, the AHB’s rewritten attack prompts yielded an overall attack success rate of 55.75%, indicating that current LLM safety standards could be overlooking a fundamental vulnerability.

Article continues below



Source link

  • Related Posts

    Princess Peach's Super Mario Galaxy Movie Lineage Is Canon, Says Miyamoto

    The Super Mario Galaxy Movie isn’t exactly getting stellar reviews, and while franchise creator Shigeru Miyamoto finds the film’s poor critical reception “truly baffling,” he’s committed to bringing the film’s…

    Miami Vice Movie Retitled Miami Vice ’85

    Universal Pictures’ and director Joseph Kosinski’s big screen reboot of Miami Vice has a new title befitting its period setting: Miami Vice ‘85. The title change comes via Variety. As…

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    Middle East crisis live: US and Iran in blockade stalemate as Washington’s navy secretary leaves office ‘immediately’ | US-Israel war on Iran

    Middle East crisis live: US and Iran in blockade stalemate as Washington’s navy secretary leaves office ‘immediately’ | US-Israel war on Iran

    UK watchdog to investigate Telegram over alleged child sexual abuse material | Telegram

    UK watchdog to investigate Telegram over alleged child sexual abuse material | Telegram

    Shohei Ohtani’s on-base streak ends: Where does it rank in MLB history?

    Shohei Ohtani’s on-base streak ends: Where does it rank in MLB history?

    Backgrounder: Canada announces measures and support for the people of Syria

    Backgrounder: Canada announces measures and support for the people of Syria

    Stocks Fall With Bonds as Iran Concerns Boost Oil: Markets Wrap

    B.C. premier says MLA Joan Phillip is ‘very ill,’ asks for prayers – BC

    B.C. premier says MLA Joan Phillip is ‘very ill,’ asks for prayers – BC