Google introduces the Deep Think reasoning model for Gemini 2.5 Pro and a better 2.5 Flash

Google has started testing a reasoning model called Deep Think for Gemini 2.5 Pro, the company has revealed at its I/O developer conference. According to DeepMind CEO Demis Hassabis, Gemini's Deep Think uses "the latest cutting-edge research" that gives the model the capability to consider multiple hypotheses before responding to queries. Google says it got an "impressive score" when evaluated using questions from the 2025 United States of America Mathematical Olympiad competition. However, Google wants to take more time to conduct safety evaluations and get further input from safety experts before releasing it widely. That's why it's making Deep Think initially available to trusted testers via the Gemini API first in order to get their feedback first. 

The company has also introduced a better Gemini 2.5 Flash model, which is optimized for speed and efficiency. It's now more efficient than before, uses fewer tokens and has scored higher in benchmarks for reasoning, multimodality, code and long context than its predecessor. It will be generally available in early June. For now, the improved Gemini 2.5 Flash is available as a preview via Google AI Studio for developers, via Vertex AI for enterprise customers and via the Gemini app for other users. 

While most of the efficiency gains covered on the I/O stage were focused on 2.5 Flash, Google did announce that it's bringing the 2.5 Flash concept of "Thinking Budgets" to its more advanced 2.5 Pro model. This feature will let you balance tokens spent vs. accuracy and speed of output.

Separately, Google is bringing Project Mariner into the Gemini API and Vertex AI, as well. Project Mariner is Google's Gemini-powered AI agents that can navigate pages on the web browser to complete tasks for users. The company will roll the agents out more broadly this summer so that developers can experiment with them. In addition, the company is releasing new previews for text-to-speech on both 2.5 Pro and 2.5 Flash models via the Gemini API, with support for two voices in 24 languages. 

This article originally appeared on Engadget at https://www.engadget.com/ai/google-introduces-the-deep-think-reasoning-model-for-gemini-25-pro-and-a-better-25-flash-174531020.html?src=rss https://www.engadget.com/ai/google-introduces-the-deep-think-reasoning-model-for-gemini-25-pro-and-a-better-25-flash-174531020.html?src=rss
Creado 1mo | 20 may 2025, 18:50:17


Inicia sesión para agregar comentarios

Otros mensajes en este grupo.

Get a free $30 Amazon gift card when you buy the new Sony WH-1000XM6 headphones

Noise-cancelling headphones are a must-have for anyone who travels often and wants to drown out airplane noise, commuters who want some peace and quiet amongst the crowds and anyone else looking to

23 jun 2025, 14:10:09 | Engadget
Lenovo promises 17 hours of battery life on its new Chromebook Plus 14

For years now, most Chromebooks have run some variety of Intel or AMD processor. The occasional device used a MediaTek chip, but they were often underpowered and cheap devices that were hard to rec

23 jun 2025, 14:10:08 | Engadget
Tesla’s first Robotaxi rides kick off in Austin, Texas

The June 22 launch of Tesla's robotaxis in Austin, Texas, actually occurred. It's a tentative first step for the company, however: a human "

23 jun 2025, 11:40:12 | Engadget
Perplexity's AI-powered browser opens up to select Windows users

Perplexity is planning to open up its Comet browser that's powered by "agentic search" to Windows users, according to the company's CEO. Aravind Srinivas

22 jun 2025, 19:40:05 | Engadget
The Blood of Dawnwalker developers share a look at gameplay from the upcoming vampire fantasy RPG

One of the games that really caught my eye during the

22 jun 2025, 19:40:04 | Engadget
How to buy the Nintendo Switch 2: Latest stock updates at Target, Best Buy, Walmart and more

The Nintendo Switch 2 has been available in the US for more than two weeks — but good luck finding one. While

22 jun 2025, 17:20:09 | Engadget