In the Wake of Anthropic’s Mythos, OpenAI Has a New Cybersecurity Model—and Strategy


OpenAI on Tuesday announced the next phase of its cybersecurity strategy and a new model specifically designed for use by digital defenders, GPT-5.4-Cyber.

The news comes in the wake of an announcement last week by competitor Anthropic that its new Claude Mythos Preview model is only being privately released for now—because, the company says, it could be exploited by hackers and bad actors. Anthropic also announced an industry coalition, including competitors like Google, focused on how advances in generative AI across the field will impact cybersecurity.

OpenAI seemed to be seeking to differentiate its message on Tuesday by striking a less catastrophic tone and touting its existing guardrails and defenses while hinting at the need for more advanced protections in the long term.

“We believe the class of safeguards in use today sufficiently reduce cyber risk enough to support broad deployment of current models,” the company wrote in a blog post. “We expect versions of these safeguards to be sufficient for upcoming more powerful models, while models explicitly trained and made more permissive for cybersecurity work require more restrictive deployments and appropriate controls. Over the long term, to ensure the ongoing sufficiency of AI safety in cybersecurity, we also expect the need for more expansive defenses for future models, whose capabilities will rapidly exceed even the best purpose-built models of today.”

The company says that it has homed in on three pillars for its cybersecurity approach. The first involves so-called “know your customer” validation systems to allow controlled access to new models that is as broad and “democratized” as possible. “We design mechanisms which avoid arbitrarily deciding who gets access for legitimate use and who doesn’t,” the company wrote on Tuesday. OpenAI is combining a model where it partners with certain organizations on limited releases with an automated system introduced in February, known as Trusted Access for Cyber or TAC.

The second component of the strategy involves “iterative deployment,” or a process of “carefully” releasing and then refining new capabilities so the company can get real-world insight and feedback. The blog post particularly highlights “resilience to jailbreaks and other adversarial attacks, and improving defensive capabilities.” Finally, the third focus is on investments that the company says support software security and other digital defense as generative AI proliferates.

OpenAI says that the initiative fits into its broader security efforts, including an application security AI agent launched last month known as Codex Security, a cybersecurity grants program that began in 2023, a recent donation to the Linux Foundation to support open source security, and the “Preparedness Framework” that is meant to assess and defend against “severe harm from frontier AI capabilities.”

Anthropic’s claims last week that more capable AI models necessitate a cybersecurity reckoning have been controversial among security experts. Some say the concern is overstated and could feed a new wave of anti-hacker sentiment—consolidating power even more with tech giants. Others, though, emphasize that vulnerabilities and shortcomings in current security defenses are well known and really could be exploited with new speed and intensity by an even broader range of bad actors in the age of agentic AI.



Source link

  • Related Posts

    AI data center startup Fluidstack in talks for $1B round at $18B valuation months after hitting $7.5B, says report

    Fluidstack, a startup that builds specialized data centers for AI companies, is in talks to raise a $1 billion round at an $18 billion valuation, potentially led by Jane Street,…

    Patch Tuesday, April 2026 Edition – Krebs on Security

    Microsoft today pushed software updates to fix a staggering 167 security vulnerabilities in its Windows operating systems and related software, including a SharePoint Server zero-day and a publicly disclosed weakness…

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    How an undercover officer became Dean Penney’s best friend after his wife disappeared

    How an undercover officer became Dean Penney’s best friend after his wife disappeared

    Olivia Wilde Just Wore Summer’s Coolest Sneaker Color Trend

    Olivia Wilde Just Wore Summer’s Coolest Sneaker Color Trend

    FirstFT: China flexes economic muscle with soaring use of export controls

    Justice Department moves to toss seditious conspiracy convictions of Oath Keepers and Proud Boys

    Justice Department moves to toss seditious conspiracy convictions of Oath Keepers and Proud Boys

    7-Eleven says it plans to close 645 North American stores this year – National

    7-Eleven says it plans to close 645 North American stores this year – National

    AI data center startup Fluidstack in talks for $1B round at $18B valuation months after hitting $7.5B, says report

    AI data center startup Fluidstack in talks for $1B round at $18B valuation months after hitting $7.5B, says report