The missing step between hype and profit


Produced by Pause AI, an international activist group that co-organized the protest, it ended with this plea to the reader: “Pause AI until we know what the hell Step 2 is.” 

In the South Park episode “Gnomes,” which first aired in 1998, Kenny, Kyle, Cartman, and Stan discover a community of gnomes that sneak out at night to steal underpants from dressers. Why? The gnomes present their pitch deck. “Phase 1: Collect underpants. Phase 2: ? Phase 3: Profit.”

The gnomes’ business plan has since become one of the greats among internet memes, used to satirize everything from startup strategies to policy proposals. Memelord in chief Elon Musk once invoked it in a talk about how he planned to fund a mission to Mars. Right now, it captures the state of AI. Companies have built the tech (Step 1) and promised transformation (Step 3). How they get there is still a big question mark.

As far as Pause AI is concerned, Step 2 must involve some kind of regulation. But exactly what it will call for and who will enforce it are up for debate.

AI boosters, on the other hand, are convinced that Step 3 is salvation and tend to glaze over the middle bit. They see us racing toward sunny uplands on the back of an “economically transformative technology,” as OpenAI’s chief scientist, Jakub Pachocki, put it to me a few weeks ago. They know where they want to go—more or less: It’s hazy up there and still some way off. But everyone’s taking a different route. Will they all make it? Will anyone?

For every big claim about the future, there is a more sober assessment of how the rubber meets the road—one that quells the hype. Consider two recent studies. One, from Anthropic, predicted what types of jobs are going to be most affected by LLMs. (A takeaway: Managers, architects, and people in the media should prepare for change; groundskeepers, construction workers, and those in hospitality, not so much.) But their predictions are really just guesses, based on what kinds of tasks LLMs seem to be good at rather than how they really perform in the workplace.   

Another study, put out in February by researchers at Mercor, an AI hiring startup, tested several AI agents powered by top-tier models from OpenAI, Anthropic, and Google DeepMind on 480 workplace tasks frequently carried out by human bankers, consultants, and lawyers. Every agent they tested failed to complete most of its duties.   

Why is there such wide disagreement? There are a number of factors. For a start, it’s crucial to consider who is making the claims (and why). Anthropic has skin in the game. What’s more, most of the people telling us that something big is about to happen have reached that conclusion largely on the basis of how fast AI coding tools are getting. But not all tasks can be hacked with coding. Other studies have found that LLMs are bad at making strategic judgment calls, for example.



Source link

  • Related Posts

    GitHub rushed to fix a critical vulnerability in less than six hours

    GitHub employees fixed a critical remote code execution vulnerability in less than six hours last month. Wiz Research used AI models to uncover a vulnerability in GitHub’s internal git infrastructure…

    The Justice Department Has Destroyed Its Voting Rights Section

    When a new administration moves to Washington, DC, there are always changes in policy priorities and personnel. Alex, a lawyer in the Department of Justice’s Voting Section, had survived Donald…

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    GitHub rushed to fix a critical vulnerability in less than six hours

    GitHub rushed to fix a critical vulnerability in less than six hours

    IPL 2026 – PBKS vs RR – Shubham Dubey comes good in ‘very difficult role’ for Rajasthan Royals

    IPL 2026 – PBKS vs RR – Shubham Dubey comes good in ‘very difficult role’ for Rajasthan Royals

    WATCH: King Charles, Trump share quips, tout US-UK relationship at state dinner

    WATCH:  King Charles, Trump share quips, tout US-UK relationship at state dinner

    Supreme Court weighs Trump attempt to remove protections for thousands of Haitian and Syrian immigrants

    Supreme Court weighs Trump attempt to remove protections for thousands of Haitian and Syrian immigrants

    MIT study finds children more vulnerable to cancer-causing chemical in water

    MIT study finds children more vulnerable to cancer-causing chemical in water

    Carney, Champagne and other front benchers fan out to tout latest fiscal update

    Carney, Champagne and other front benchers fan out to tout latest fiscal update