Google announces Gemini 3.1 Pro, says it’s better at complex problem-solving


Another day, another Google AI model. Google has really been pumping out new AI tools lately, having just released Gemini 3 in November. Today, it’s bumping the flagship model to version 3.1. The new Gemini 3.1 Pro is rolling out (in preview) for developers and consumers today with the promise of better problem-solving and reasoning capabilities.

Google announced improvements to its Deep Think tool last week, and apparently, the “core intelligence” behind that update was Gemini 3.1 Pro. As usual, Google’s latest model announcement comes with a plethora of benchmarks that show mostly modest improvements. In the popular Humanity’s Last Exam, which tests advanced domain-specific knowledge, Gemini 3.1 Pro scored a record 44.4 percent. Gemini 3 Pro managed 37.5 percent, while OpenAI’s GPT 5.2 got 34.5 percent.

Gemini 3.1 Pro benchmarks

Google also calls out the model’s improvement in ARC-AGI-2, which features novel logic problems that can’t be directly trained into an AI. Gemini 3 was a bit behind on this evaluation, reaching a mere 31.1 percent versus scores in the 50s and 60s for competing models. Gemini 3.1 Pro more than doubles Google’s score, reaching a lofty 77.1 percent.

Google has often gloated when it releases new models that they’ve already hit the top of the Arena leaderboard (formerly LM Arena), but that’s not the case this time. For text, Claude Opus 4.6 edges out the new Gemini by four points at 1504. For code, Opus 4.6, Opus 4.5, and GPT 5.2 High all run ahead of Gemini 3.1 Pro by a bit more. It’s worth noting, however, that the Arena leaderboard is run on vibes. Users vote on the outputs they like best, which can reward outputs that look correct regardless of whether they are.



Source link

  • Related Posts

    Donald Trump Jr.’s Private DC Club Has Mysterious Ties to an Ex-Cop With a Controversial Past

    When the Executive Branch soft-launched in Washington, DC, last spring, the private club’s initial buzz centered on its starry roster of backers and founding members. The president’s eldest son, Donald…

    Cellebrite cut off Serbia citing abuse of its phone unlocking tools. Why not others?

    Last year, the phone hacking tool maker Cellebrite announced it had suspended Serbian police as customers, after human rights researchers alleged local police and intelligence agencies used its tools to…

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    Investigators hope to catch signals from Nancy Guthrie’s pacemaker

    Donald Trump Jr.’s Private DC Club Has Mysterious Ties to an Ex-Cop With a Controversial Past

    Donald Trump Jr.’s Private DC Club Has Mysterious Ties to an Ex-Cop With a Controversial Past

    9 Down Alternative Comforters Made With Nontoxic Materials

    9 Down Alternative Comforters Made With Nontoxic Materials

    Large Trump banner hung at justice department headquarters | Donald Trump

    Large Trump banner hung at justice department headquarters | Donald Trump

    Virginia judge temporarily blocks Democrats’ redistricting work on bid to flip 4 congressional seats

    Virginia judge temporarily blocks Democrats’ redistricting work on bid to flip 4 congressional seats

    Goodfellow Reports Its Results for the Fourth Quarter and Fiscal Year Ended November 30, 2025 and Declares a Dividend