AI Risks – Marginal REVOLUTION


Two new papers/initiatives indicate severe risks from AI, interestingly in opposite directions. The first is that the most advanced frontier models are now capable of finding and exploiting software in ways that could be used to crash or control pretty much all the world’s major systems.

Anthropic: We formed Project Glasswing because of capabilities we’ve observed in a new frontier model trained by Anthropic that we believe could reshape cybersecurity. Claude Mythos2 Preview is a general-purpose, unreleased frontier model that reveals a stark fact: AI models have reached a level of coding capability where they can surpass all but the most skilled humans at finding and exploiting software vulnerabilities.

Mythos Preview has already found thousands of high-severity vulnerabilities, including some in every major operating system and web browser. Given the rate of AI progress, it will not be long before such capabilities proliferate, potentially beyond actors who are committed to deploying them safely. The fallout—for economies, public safety, and national security—could be severe. Project Glasswing is an urgent attempt to put these capabilities to work for defensive purposes.

That’s from Anthropic. The irony is that the company that has developed a frontier model capable of infiltrating and undermining more or less any computer system in the world is the one that has been forbidden from working with the US government. It’s as if a private firm developed nuclear weapons and the American government refused to work with them because they were too woke. Okey dokey.

The second paper on AI risks is AI Agent Traps from Google DeepMind. They point out that AI agents on the web are vulnerable to all kinds of attacks from things like text in html never read by humans, hidden commands in pdfs, commands encoded in the pixels of images using steganography and so forth.

Putting this together we have the worrying combination that very powerful AI’s are very vulnerable. Will AI solve the problems of AI? Eventually the software will be made secure but weird things happen in arms races and its going to be a bump ride.




Source link

  • Related Posts

    Hegseth says Trump chose ‘mercy’ in dealing with Iran

    IE 11 is not supported. For an optimal experience visit our site on another browser. Skip to Content news Alerts There are no new alerts at this time 00:31 Hegseth…

    How pricing in B.C. is becoming both art and algorithm

    Businesses in the province blend intuition and data as governments eye rules on personalized pricing Source link

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    Hegseth says Trump chose ‘mercy’ in dealing with Iran

    Hegseth says Trump chose ‘mercy’ in dealing with Iran

    US carmakers accuse EU of blocking supersized pick-up trucks from roads

    No, Israel’s not targeting health workers in Lebanon

    The Best Automatic Litter Box of 2026: Petkit and Litter-Robot

    The Best Automatic Litter Box of 2026: Petkit and Litter-Robot

    How pricing in B.C. is becoming both art and algorithm

    How pricing in B.C. is becoming both art and algorithm

    Doha Diamond League postponed until June amid Middle East conflict

    Doha Diamond League postponed until June amid Middle East conflict