Android's screen reader can now answer questions about images

Today is Global Accessibility Awareness Day (GAAD), and, as in years past, many tech companies are marking the occasion with the announcement of new assistive features for their ecosystems. Apple got things rolling on Tuesday, and now Google is joining in on the parade. To start, the company has made TalkBack, Android's built-in screen reader, more useful. With the help of one of Google's Gemini models, TalkBack can now answer questions about images displayed on your phone, even they don't have any alt text describing them.

"That means the next time a friend texts you a photo of their new guitar, you can get a description and ask follow-up questions about the make and color, or even what else is in the image," explains Google. The fact Gemini can see and understand the image is thanks to the multi-modal capabilities Google built into the model. Additionally, the Q&A functionality works across the entire screen. So, for example, say you're doing some online shopping, you can first ask your phone to describe the color of the piece of clothing you're interested in and then ask if it's on sale.

Separately, Google is rolling out a new version of its Expressive Captions. First announced at the end of last year, the feature generates subtitles that attempt to capture the emotion of what’s being said. For instance, if you're video chatting with some friends and one of them groans after you make a lame joke, your phone will not only subtitle what they said but it will also include "[groaning]" in the transcription. With the new version of Expressive Captions, the resulting subtitles will reflect when someone drags out the sound of their words. That means the next time you're watching a live soccer match and the announcer yells "goallllllll," their excitement will be properly transcribed. Plus, there will be more labels now for sounds like when someone is clearing their throat.

The new version of Expressive Captions is rolling out to English-speaking users in the US, UK, Canada and Australia running Android 15 and above on their phones.

This article originally appeared on Engadget at https://www.engadget.com/mobile/smartphones/androids-screen-reader-can-now-answer-questions-about-images-160032185.html?src=rss https://www.engadget.com/mobile/smartphones/androids-screen-reader-can-now-answer-questions-about-images-160032185.html?src=rss
Created 2mo | May 15, 2025, 6:10:27 PM


Login to add comment

Other posts in this group

Judge rules Apple must face antitrust lawsuit brought by the US DOJ

The US Department of Justice's antitrust

Jun 30, 2025, 10:10:23 PM | Engadget
How to buy the Switch 2: Nintendo's restock updates from Walmart, Best Buy and more

The Nintendo Switch 2 has been available in the US for more than three weeks — and we finally saw a second wave of a

Jun 30, 2025, 10:10:22 PM | Engadget
Apple may power Siri with Anthropic or OpenAI models amid AI struggles

Apple is considering using AI models from OpenAI or Anthropic to deliver the

Jun 30, 2025, 10:10:21 PM | Engadget
Video Games Weekly: Summer Game Fest ends when I say so

Welcome to Video Games Weekly on Engadget. Expect a new story every Monday or Tuesday, broken into two parts. The first is a space for short essays and ramblings about video game trends and rel

Jun 30, 2025, 10:10:20 PM | Engadget
11 Bit Studios clarifies its AI use in The Alters after player outcry

11 Bit Studios has drawn the ire of players for the undisclosed use of artificial intelligence in its recent release, The Alters. The new project from the team behind Frostpunk an

Jun 30, 2025, 10:10:18 PM | Engadget