The Meta hack shows there’s more to AI security than Mythos


Gong and other scholars have been issuing warnings about the security vulnerabilities of AI agents for a while. They publish papers and blog posts detailing exploits such as indirect prompt injection, which involves hijacking agents using commands hidden in websites, emails, or other seemingly anodyne data sources. Compared with these techniques, the Meta hack was practically mindless. The only complication that hackers had to overcome was using a VPN that matched the true account owner’s location; then they directly asked the support agent to change the account’s email address, and it complied.

Meta has not commented publicly on how this vulnerability slipped through the cracks. But given the simplicity of the exploit, Gong says, it should have been uncovered easily, before the agent was deployed. “It’s really surprising,” he says. “I don’t understand why they didn’t find this simple problem.”

Jessica Ji, a senior research analyst at Georgetown’s Center for Security and Emerging Technology, agrees. “It raises questions like: Were there even guardrails in place?” she says. “Did anyone think to test for this kind of scenario?” She notes that the oversight is particularly striking coming from a company like Meta, which has extensive expertise in both AI and cybersecurity. Meta did not respond to a request for comment for this article, but on Monday a Meta spokesperson said on X that the vulnerability had been resolved.

As embarrassing a moment as this might be for Meta in particular, it also highlights some core vulnerabilities shared by all AI agents. Unlike traditional software, agents can respond in flexible—and unexpected—ways to new circumstances, which is why they might be able to substitute for human customer support agents. But AI agents can also be tricked in ways that humans wouldn’t be, and because they can take real-world actions, those mistakes have consequences. “A human would say, ‘Okay, why do you want to change the email address?’ and maybe respond with a security question,” says Somesh Jha, a professor of computer science at the University of Wisconsin–Madison. “What is going on with these agents is they’re very eager to finish the task. It’s almost like some elementary school student who just wants to please the teacher.”

There are ways to mitigate the risks. Companies can use traditional software to build guardrails that make sure agents follow strict rules, such as always asking for answers to security questions before sending sensitive account information to a new email address. And the experts consulted for this article all agree that agents should undergo rigorous red-teaming, a process in which developers try their best to attack a system in order to discover its vulnerabilities before it is deployed.



Source link

  • Related Posts

    Google Wants Android 17 to Excite the Rich. What About the Rest of Us?

    Google appears to think I’m a much wealthier, sexier man than I really am. Thanks, I guess? That’s the impression I got from the company during its 2026 Android Show.…

    My SSN was exposed in a breach at Columbia—a school I have no connection with

    I asked the College Board if this theory could be true. A spokesperson disputed that any student’s SSN would have been shared with Columbia via an opt-in program called “Student…

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    Karen Read reveals decision behind new lawsuit against city and police

    Karen Read reveals decision behind new lawsuit against city and police

    Are Billy Bishop airport lobbyists breaking city rules?

    Are Billy Bishop airport lobbyists breaking city rules?

    Employers added 172,000 jobs in May as labor market continues to blow past expectations

    Employers added 172,000 jobs in May as labor market continues to blow past expectations

    Google Wants Android 17 to Excite the Rich. What About the Rest of Us?

    Google Wants Android 17 to Excite the Rich. What About the Rest of Us?

    Joint statement on situation between Hezbollah and Israel

    Joint statement on situation between Hezbollah and Israel

    Canada’s unemployment rate drops to 6.6% as economy gains 88,000 jobs