AI companies want you to stop chatting with bots and start managing them



Despite the hype about these agents being co-workers, from our experience, these agents tend to work best if you think of them as tools that amplify existing skills, not as the autonomous co-workers the marketing language implies. They can produce impressive drafts fast but still require constant human course-correction.

The Frontier launch came just three days after OpenAI released a new macOS desktop app for Codex, its AI coding tool, which OpenAI executives described as a “command center for agents.” The Codex app lets developers run multiple agent threads in parallel, each working on an isolated copy of a codebase via Git worktrees.

OpenAI also released GPT-5.3-Codex on Thursday, a new AI model that powers the Codex app. OpenAI claims that the Codex team used early versions of GPT-5.3-Codex to debug the model’s own training run, manage its deployment, and diagnose test results, similar to what OpenAI told Ars Technica in a December interview.

“Our team was blown away by how much Codex was able to accelerate its own development,” the company wrote. On Terminal-Bench 2.0, the agentic coding benchmark, GPT-5.3-Codex scored 77.3%, which exceeds Anthropic’s just-released Opus 4.6 by about 12 percentage points.

The common thread across all of these products is a shift in the user’s role. Rather than merely typing a prompt and waiting for a single response, the developer or knowledge worker becomes more like a supervisor, dispatching tasks, monitoring progress, and stepping in when an agent needs direction.

In this vision, developers and knowledge workers effectively become middle managers of AI. That is, not writing the code or doing the analysis themselves, but delegating tasks, reviewing output, and hoping the agents underneath them don’t quietly break things. Whether that will come to pass (or if it’s actually a good idea) is still widely debated.



Source link

  • Related Posts

    With GPT-5.3-Codex, OpenAI pitches Codex for more than just writing code

    Today, OpenAI announced GPT-5.3-Codex, a new version of its frontier coding model that will be available via the command line, IDE extension, web interface, and the new macOS desktop app.…

    The 2026 Winter Olympics Will Have a Major Impact on the Region’s Snow

    All told, the Milano Cortina 2026 Winter Olympics are estimated to cause the loss of 5.5 square kilometers of snowpack and 34 million metric tons of glacial ice. Without the…

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    It’s time to free Ontario universities from government funding: Denley

    With GPT-5.3-Codex, OpenAI pitches Codex for more than just writing code

    With GPT-5.3-Codex, OpenAI pitches Codex for more than just writing code

    Turok Is Back, And It’s Bringing The Hunt To Switch 2 Later This Year

    Turok Is Back, And It’s Bringing The Hunt To Switch 2 Later This Year

    Local officials push back on Trump’s threats to ‘nationalize’ elections in targeted cities

    Local officials push back on Trump’s threats to ‘nationalize’ elections in targeted cities

    US Transportation Department Will Start Investigating ATC Trainee High Failure Rates

    US Transportation Department Will Start Investigating ATC Trainee High Failure Rates

    Argentina and U.S. Sign Sweeping Trade Deal as Alliance Deepens