Android's screen reader can now answer questions about images

Today is Global Accessibility Awareness Day (GAAD), and, as in years past, many tech companies are marking the occasion with the announcement of new assistive features for their ecosystems. Apple got things rolling on Tuesday, and now Google is joining in on the parade. To start, the company has made TalkBack, Android's built-in screen reader, more useful. With the help of one of Google's Gemini models, TalkBack can now answer questions about images displayed on your phone, even they don't have any alt text describing them.

"That means the next time a friend texts you a photo of their new guitar, you can get a description and ask follow-up questions about the make and color, or even what else is in the image," explains Google. The fact Gemini can see and understand the image is thanks to the multi-modal capabilities Google built into the model. Additionally, the Q&A functionality works across the entire screen. So, for example, say you're doing some online shopping, you can first ask your phone to describe the color of the piece of clothing you're interested in and then ask if it's on sale.

Separately, Google is rolling out a new version of its Expressive Captions. First announced at the end of last year, the feature generates subtitles that attempt to capture the emotion of what’s being said. For instance, if you're video chatting with some friends and one of them groans after you make a lame joke, your phone will not only subtitle what they said but it will also include "[groaning]" in the transcription. With the new version of Expressive Captions, the resulting subtitles will reflect when someone drags out the sound of their words. That means the next time you're watching a live soccer match and the announcer yells "goallllllll," their excitement will be properly transcribed. Plus, there will be more labels now for sounds like when someone is clearing their throat.

The new version of Expressive Captions is rolling out to English-speaking users in the US, UK, Canada and Australia running Android 15 and above on their phones.

This article originally appeared on Engadget at https://www.engadget.com/mobile/smartphones/androids-screen-reader-can-now-answer-questions-about-images-160032185.html?src=rss https://www.engadget.com/mobile/smartphones/androids-screen-reader-can-now-answer-questions-about-images-160032185.html?src=rss
Létrehozva 2mo | 2025. máj. 15. 18:10:27


Jelentkezéshez jelentkezzen be

EGYÉB POSTS Ebben a csoportban

Canada caves to Trump and rescinds its digital service tax on big tech

Canada has folded in its battle with US President Donald Trump over tariffs by cancelling its proposed digital services tax (DST) on big tech companies, the government

2025. jún. 30. 12:50:13 | Engadget
Our favorite mesh Wi-Fi router drops to a record-low price for Prime Day

Prime Day is just one week away, and the early deals are already arriving on Amazon. There's everything from the fu

2025. jún. 30. 12:50:12 | Engadget
Apple's F1 laps its competition with a $144 million opening weekend

Apple's film studio finally has a successful summer blockbuster to its name with its latest sports drama flick starring Brad Pitt.

2025. jún. 29. 20:40:12 | Engadget
Dave the Diver's In the Jungle DLC may not arrive until 2026, but Godzilla is back

Dave the Diver just marked its two-year anniversary, and the team behind it has a bunch of updates to share about its future. While it's mostly good news, there is one little hiccup: the u

2025. jún. 29. 20:40:11 | Engadget
Playdate Season 2 review: Tiny Turnip and Chance's Lucky Escape

It's hard to believe that Playdate Season Two is almost over already, but here we are in week five with just one more drop of new games left to go after this. In the latest batch, we got the climbi

2025. jún. 29. 18:20:17 | Engadget