OpenAI releases GPT-5.2 to take on Google and Anthropic


OpenAI’s “code red” response to Google’s Gemini 3 Pro has arrived. On the same day the company announced a Sora licensing pact with Disney, it took the wraps off GPT-5.2. OpenAI is touting the new model as its best yet for real-world, professional use. “It’s better at creating spreadsheets, building presentations, writing code, perceiving images, understanding long contexts, using tools, and handling complex, multi-step projects,” said OpenAI.

In a series of 10 benchmarks highlighted by OpenAI, GPT-5.2 Thinking, the most advanced version of the model, outperformed its GPT-5.1 counterpart, sometimes by a significant margin. For example, in AIME 2025, a test that involves 30 challenging mathematics problems, the model earned a perfect 100 percent score, beating out GPT-5.1’s already state-of-the-art score of 94 perfect. It also achieved that feat without turning to tools like web search. Meanwhile, in ARC-AGI-1, a benchmark that tests an AI system’s ability to reason abstractly like a human being would, the new system beat GPT-5.1’s score by more than 10 percentage points.

OpenAI says GPT-5.2 Thinking is better at answering questions factually, with the company finding it produces errors 30 percent less frequently. “For professionals, this means fewer mistakes when using the model for research, writing, analysis, and decision support — making the model more dependable for everyday knowledge work,” the company said.

The new model should be better in conversation too. Of the version of the system most users are likely to encounter, OpenAI says “GPT‑5.2 Instant is a fast, capable workhorse for everyday work and learning, with clear improvements in info-seeking questions, how-tos and walk-throughs, technical writing, and translation, building on the warmer conversational tone introduced in GPT‑5.1 Instant.“

While it’s probably overstating things to suggest this is a make or break release for OpenAI, it is fair to say the company does have a lot riding on GPT 5.2. Its big release of 2025, GPT-5, didn’t meet expectations. Users complained of a system that generated surprisingly dumb answers and had a boring personality. The disappointment with GPT-5 was such that people began demanding OpenAI bring back GPT-4o.

Then came Gemini 3 Pro — which jumped to the top of LMArena, a website where humans rate outputs from AI systems to vote on the best one. Following Google’s announcement, Sam Altman reportedly called for a “code red” effort to improve ChatGPT. Before today, the company’s previous model, GPT-5.1, was ranked sixth on LMArena, with systems from Anthropic and Elon Musk’s xAI occupying the spots between OpenAI between Google.

For a company that recently signed more than $1.4 trillion worth of infrastructure deals in a bid to outscale the competition, that was not a good position for OpenAI to be in. In his memo to staff, Altman said GPT-5.2 would be the equal of Gemini 3 Pro. With the new system rolling out now, we’ll see whether that’s true, and what it might mean for the company if it can’t at least match Google’s best.   

OpenAI is offering three different versions of GPT-5.2: Instant, Thinking and Pro. All three models will be first available to users on the company’s paid plans. Notably, the company plans to keep GPT-5.1 around, at least for a little while. Paid users can continue to use the older model for the next three months by selecting it from the legacy models section.



Source link

  • Related Posts

    ScaleOps raises $130M to improve computing efficiency amid AI demand

    AI may be booming, but behind the scenes, companies are wasting vast amounts of expensive compute. GPUs sit idle, workloads are over-provisioned, and cloud costs continue to climb. ScaleOps believes…

    Apple subsidiary fined by UK government over Moscow sanctions breach | Apple

    The UK government has fined a subsidiary of Apple £390,000 for breaching sanctions against Moscow over payments it made to a Russian streaming platform. Apple Distribution International (ADI), based in…

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    How soon will airport security lines return to normal after TSA officers get paid?

    How soon will airport security lines return to normal after TSA officers get paid?

    He Led Congo for 18 Years. Now, Joseph Kabila Is a Hunted Man.

    Powerful cholesterol drug cuts heart attack risk by 31%

    Powerful cholesterol drug cuts heart attack risk by 31%

    War in Iran puts the squeeze on B.C. small businesses

    War in Iran puts the squeeze on B.C. small businesses

    “If I release something official, I want it to match my vision” – Undertale and Deltarune creator Toby Fox explains why no translations other than Japanese have happened

    “If I release something official, I want it to match my vision” – Undertale and Deltarune creator Toby Fox explains why no translations other than Japanese have happened

    Tapping, twirling and "T" signs: Sports replays have a language all their own

    Tapping, twirling and "T" signs: Sports replays have a language all their own