Google launches Gemini 3 with new coding app and record benchmark scores | TechCrunch
Briefly

Google launches Gemini 3 with new coding app and record benchmark scores | TechCrunch
"On Tuesday, Google released Gemini 3, its latest and most advanced foundation model, which is now immediately available through the Gemini app and AI search interface. Coming just seven months after the Gemini 2.5 release, the new model is Google's most capable LLM yet, and an immediate contender for the most capable AI tool on the market. The release also comes less than a week after OpenAI released GPT 5.1, and a mere two months after Anthropic released Sonnet 4.5 - a reminder of the blistering pace of frontier model development."
""With Gemini 3, we're seeing this massive jump in reasoning," said Tulsee Doshi, Google's head of product for the Gemini model. "It's responding with a level of depth and nuance that we haven't seen before.""
"With a score of 37.4, the model marked the highest score on record on the Humanity's Last Exam benchmark, meant to capture general reasoning and expertise. The previous high score, held by GPT-5 Pro, was 31.64. Gemini 3 also topped the leaderboard on LMArena, a human-led benchmark that measures user satisfaction."
Google released Gemini 3, the most advanced foundation model, now available through the Gemini app and AI search. The model follows Gemini 2.5 by seven months and arrives amid recent competitor releases such as GPT 5.1 and Sonnet 4.5. A research-oriented variant, Gemini 3 Deepthink, will be offered to Google AI Ultra subscribers after further safety testing. Gemini 3 demonstrates notable reasoning improvements, scoring 37.4 on Humanity's Last Exam—surpassing GPT-5 Pro's 31.64—and topping LMArena's user satisfaction leaderboard. Google reports over 650 million monthly app users and 13 million developers using the model. Google also released Antigravity, a Gemini-powered multi-pane coding interface combining a chat prompt, terminal, and browser view to surface agent-made changes.
Read at TechCrunch
Unable to calculate read time
[
|
]