AI Risks – Marginal REVOLUTION


Two new papers/initiatives indicate severe risks from AI, interestingly in opposite directions. The first is that the most advanced frontier models are now capable of finding and exploiting software in ways that could be used to crash or control pretty much all the world’s major systems.

Anthropic: We formed Project Glasswing because of capabilities we’ve observed in a new frontier model trained by Anthropic that we believe could reshape cybersecurity. Claude Mythos2 Preview is a general-purpose, unreleased frontier model that reveals a stark fact: AI models have reached a level of coding capability where they can surpass all but the most skilled humans at finding and exploiting software vulnerabilities.

Mythos Preview has already found thousands of high-severity vulnerabilities, including some in every major operating system and web browser. Given the rate of AI progress, it will not be long before such capabilities proliferate, potentially beyond actors who are committed to deploying them safely. The fallout—for economies, public safety, and national security—could be severe. Project Glasswing is an urgent attempt to put these capabilities to work for defensive purposes.

That’s from Anthropic. The irony is that the company that has developed a frontier model capable of infiltrating and undermining more or less any computer system in the world is the one that has been forbidden from working with the US government. It’s as if a private firm developed nuclear weapons and the American government refused to work with them because they were too woke. Okey dokey.

The second paper on AI risks is AI Agent Traps from Google DeepMind. They point out that AI agents on the web are vulnerable to all kinds of attacks from things like text in html never read by humans, hidden commands in pdfs, commands encoded in the pixels of images using steganography and so forth.

Putting this together we have the worrying combination that very powerful AI’s are very vulnerable. Will AI solve the problems of AI? Eventually the software will be made secure but weird things happen in arms races and its going to be a bump ride.




Source link

  • Related Posts

    Mortgage applications fall, thanks to higher rates

    Mortgage applications are falling, and weeks of rising rates are likely to blame. Combined refinancing and purchase mortgage applications were down 0.8% through Friday, according to Mortgage Bankers Association data.…

    Harvard scientist’s visa was unlawfully canceled, judge finds

    A federal judge in Vermont ruled that Harvard scientist Kseniia Petrova’s visa was unlawfully canceled after she was detained at an airport over biological samples she was carrying, handing her…

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    Poilievre’s communications director Katy Merrifield resigns

    Poilievre’s communications director Katy Merrifield resigns

    Mortgage applications fall, thanks to higher rates

    Mortgage applications fall, thanks to higher rates

    More than 50 young asylum seekers have died in UK since 2015, data shows | Immigration and asylum

    More than 50 young asylum seekers have died in UK since 2015, data shows | Immigration and asylum

    Henry Brookes: Gloucestershire sign Middlesex seamer on loan

    Henry Brookes: Gloucestershire sign Middlesex seamer on loan

    7 Airlines With The World’s Most Comfortable Long-Haul Economy Seats

    7 Airlines With The World’s Most Comfortable Long-Haul Economy Seats

    The best carry-on luggage in the UK, tested on an assault course | Travel

    The best carry-on luggage in the UK, tested on an assault course | Travel