AI Risks – Marginal REVOLUTION


Two new papers/initiatives indicate severe risks from AI, interestingly in opposite directions. The first is that the most advanced frontier models are now capable of finding and exploiting software in ways that could be used to crash or control pretty much all the world’s major systems.

Anthropic: We formed Project Glasswing because of capabilities we’ve observed in a new frontier model trained by Anthropic that we believe could reshape cybersecurity. Claude Mythos2 Preview is a general-purpose, unreleased frontier model that reveals a stark fact: AI models have reached a level of coding capability where they can surpass all but the most skilled humans at finding and exploiting software vulnerabilities.

Mythos Preview has already found thousands of high-severity vulnerabilities, including some in every major operating system and web browser. Given the rate of AI progress, it will not be long before such capabilities proliferate, potentially beyond actors who are committed to deploying them safely. The fallout—for economies, public safety, and national security—could be severe. Project Glasswing is an urgent attempt to put these capabilities to work for defensive purposes.

That’s from Anthropic. The irony is that the company that has developed a frontier model capable of infiltrating and undermining more or less any computer system in the world is the one that has been forbidden from working with the US government. It’s as if a private firm developed nuclear weapons and the American government refused to work with them because they were too woke. Okey dokey.

The second paper on AI risks is AI Agent Traps from Google DeepMind. They point out that AI agents on the web are vulnerable to all kinds of attacks from things like text in html never read by humans, hidden commands in pdfs, commands encoded in the pixels of images using steganography and so forth.

Putting this together we have the worrying combination that very powerful AI’s are very vulnerable. Will AI solve the problems of AI? Eventually the software will be made secure but weird things happen in arms races and its going to be a bump ride.




Source link

  • Related Posts

    AX Coin, Backed by AXG, Granted First Stablecoin Issuer License by the Central Bank of Bahrain

    MANAMA, Bahrain, June 03, 2026 (GLOBE NEWSWIRE) — SOLOWIN HOLDINGS (Nasdaq: AXG) (“AXG” or the “Company”), a leading financial technology firm bridging traditional and digital assets, today announced that AX…

    Florida pastor and son’s $8 million Covid fraud case comes to bizarre end

    It was one of the strangest Covid fraud cases brought by the Justice Department, with the kind of wild details that seemed ripped from a Hollywood script. Subscribe to read…

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    What are the plans for Liverpool Women's Hospital?

    What are the plans for Liverpool Women's Hospital?

    Statement on release of Foundations for Peace: Canada’s National Action Plan on Women, Peace and Security – 2023 to 2029

    Statement on release of Foundations for Peace: Canada’s National Action Plan on Women, Peace and Security – 2023 to 2029

    AX Coin, Backed by AXG, Granted First Stablecoin Issuer License by the Central Bank of Bahrain

    Most Canadians unaware of 2027 MAID expansion for mental illness

    Break the Rules in Escape from Violet Hold, Hearthstone’s Next Expansion

    Break the Rules in Escape from Violet Hold, Hearthstone’s Next Expansion

    easyJet’s Massive Expansion: 15 New Routes Launching This Month [Map]

    easyJet’s Massive Expansion: 15 New Routes Launching This Month [Map]