Android's screen reader can now answer questions about images

Today is Global Accessibility Awareness Day (GAAD), and, as in years past, many tech companies are marking the occasion with the announcement of new assistive features for their ecosystems. Apple got things rolling on Tuesday, and now Google is joining in on the parade. To start, the company has made TalkBack, Android's built-in screen reader, more useful. With the help of one of Google's Gemini models, TalkBack can now answer questions about images displayed on your phone, even they don't have any alt text describing them.

"That means the next time a friend texts you a photo of their new guitar, you can get a description and ask follow-up questions about the make and color, or even what else is in the image," explains Google. The fact Gemini can see and understand the image is thanks to the multi-modal capabilities Google built into the model. Additionally, the Q&A functionality works across the entire screen. So, for example, say you're doing some online shopping, you can first ask your phone to describe the color of the piece of clothing you're interested in and then ask if it's on sale.

Separately, Google is rolling out a new version of its Expressive Captions. First announced at the end of last year, the feature generates subtitles that attempt to capture the emotion of what’s being said. For instance, if you're video chatting with some friends and one of them groans after you make a lame joke, your phone will not only subtitle what they said but it will also include "[groaning]" in the transcription. With the new version of Expressive Captions, the resulting subtitles will reflect when someone drags out the sound of their words. That means the next time you're watching a live soccer match and the announcer yells "goallllllll," their excitement will be properly transcribed. Plus, there will be more labels now for sounds like when someone is clearing their throat.

The new version of Expressive Captions is rolling out to English-speaking users in the US, UK, Canada and Australia running Android 15 and above on their phones.

This article originally appeared on Engadget at https://www.engadget.com/mobile/smartphones/androids-screen-reader-can-now-answer-questions-about-images-160032185.html?src=rss https://www.engadget.com/mobile/smartphones/androids-screen-reader-can-now-answer-questions-about-images-160032185.html?src=rss
Erstellt 1mo | 15.05.2025, 18:10:27


Melden Sie sich an, um einen Kommentar hinzuzufügen

Andere Beiträge in dieser Gruppe

NYC proposes 5 percent raise for rideshare drivers in a bid to appease Uber and Lyft

New York City's Taxi and Limousine Commission (TLC) have settled on new minimum-wage rules for rideshare drivers,

20.06.2025, 23:30:13 | Engadget
Windows parental controls are blocking Chrome

Stop me if you've heard this one before: Microsoft is making it harder to use Chrome on Windows. The culprit? This time, it's

20.06.2025, 18:50:15 | Engadget
Meta tells the Oversight Board it isn't removing the word 'transgenderism' from its hate speech rules

If anyone was holding out hope that the Oversight Board would provide some kind of check on Meta's

20.06.2025, 18:50:14 | Engadget
How to buy the Nintendo Switch 2: Latest stock updates at Best Buy, Walmart, Target and more

The Nintendo Switch 2 has been available in the US for more than two weeks — but good luck finding one. The

20.06.2025, 16:30:30 | Engadget
What to expect at the next Samsung Galaxy Unpacked

The next Samsung Galaxy Unpacked event could be announced any day now. The summertime event usually happens in July or August. No date has been officially set, but at least one

20.06.2025, 16:30:29 | Engadget
Nothing’s first over-ear headphones leak ahead of July unveiling

Nothing has probably made its biggest impression in the tech world with its distinctive mid-range Android phones (like the 3a Pro pictured above). But the UK-based brand’s

20.06.2025, 16:30:27 | Engadget