AI Risks – Marginal REVOLUTION


Two new papers/initiatives indicate severe risks from AI, interestingly in opposite directions. The first is that the most advanced frontier models are now capable of finding and exploiting software in ways that could be used to crash or control pretty much all the world’s major systems.

Anthropic: We formed Project Glasswing because of capabilities we’ve observed in a new frontier model trained by Anthropic that we believe could reshape cybersecurity. Claude Mythos2 Preview is a general-purpose, unreleased frontier model that reveals a stark fact: AI models have reached a level of coding capability where they can surpass all but the most skilled humans at finding and exploiting software vulnerabilities.

Mythos Preview has already found thousands of high-severity vulnerabilities, including some in every major operating system and web browser. Given the rate of AI progress, it will not be long before such capabilities proliferate, potentially beyond actors who are committed to deploying them safely. The fallout—for economies, public safety, and national security—could be severe. Project Glasswing is an urgent attempt to put these capabilities to work for defensive purposes.

That’s from Anthropic. The irony is that the company that has developed a frontier model capable of infiltrating and undermining more or less any computer system in the world is the one that has been forbidden from working with the US government. It’s as if a private firm developed nuclear weapons and the American government refused to work with them because they were too woke. Okey dokey.

The second paper on AI risks is AI Agent Traps from Google DeepMind. They point out that AI agents on the web are vulnerable to all kinds of attacks from things like text in html never read by humans, hidden commands in pdfs, commands encoded in the pixels of images using steganography and so forth.

Putting this together we have the worrying combination that very powerful AI’s are very vulnerable. Will AI solve the problems of AI? Eventually the software will be made secure but weird things happen in arms races and its going to be a bump ride.




Source link

  • Related Posts

    Ukraine Hits St. Petersburg as ‘Putin’s Davos’ Gets Underway

    IE 11 is not supported. For an optimal experience visit our site on another browser. UP NEXT Race for Mayor of Los Angeles Heads to Runoff in November 00:41 2026…

    Meta, Microsoft, Amazon, and Alphabet are about to spend a shocking amount of money to dominate the AI era

    The artificial intelligence spending for Big Tech has only just begun. The news: Goldman Sachs strategist Amanda Lynam has put some fresh numbers on hyperscaler capex spending on AI, and…

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    Carolina Caroline Review

    Carolina Caroline Review

    Ukraine Hits St. Petersburg as ‘Putin’s Davos’ Gets Underway

    Ukraine Hits St. Petersburg as ‘Putin’s Davos’ Gets Underway

    Middle East crisis live: Trump claims Iranian supreme leader is involved in US negotiations | US-Israel war on Iran

    Middle East crisis live: Trump claims Iranian supreme leader is involved in US negotiations | US-Israel war on Iran

    Macy’s raises annual outlook after the fourth straight quarter of sales gains

    Macy’s raises annual outlook after the fourth straight quarter of sales gains

    Why Richard Nixon torpedoed the global monetary system

    Carney set to be back in the House of Commons as MPs prepare to vote on budget bill

    Carney set to be back in the House of Commons as MPs prepare to vote on budget bill