The latest version of xAI's Grok can process images

xAI, the OpenAI competitor founded by Elon Musk, has introduced the first version of Grok that can process visual information. Grok-1.5V is the company's first-generation multimodal AI model, which cannot only process text, but also "documents, diagrams, charts, screenshots and photographs." In xAI's announcement, it gave a few samples of how its capabilities can be used in the real world. You can, for instance, show it a photo of a flow chart and ask Grok to translate it into Python code, get it to write a story based on a drawing and even have it explain a meme you can't understand. Hey, not everyone can keep up with everything the internet spits out. 

The new version comes just a couple of weeks after the company unveiled Grok-1.5. That model was designed to be better at coding and math than its predecessor, as well as to be able to process longer contexts so that it can check data from more sources to better understand certain inquiries. xAI said its early testers and existing users will soon be able to enjoy Grok-1.5V's capabilities, though it didn't give an exact timeline for its rollout. 

In addition to introducing Grok-1.5V, the company has also released a benchmark dataset it's calling RealWorldQA. You can use any of RealWorldQA's 700 images to evaluate AI models: Each item comes with questions and answers you can easily verify, but which may stump multimodal models like Grok. xAI claimed its technology received the highest score when the company tested it with RealWorldQA against competitors, such as OpenAI's GPT-4V and Google Gemini Pro 1.5.

This article originally appeared on Engadget at https://www.engadget.com/the-latest-version-of-xais-grok-can-process-images-120025782.html?src=rss https://www.engadget.com/the-latest-version-of-xais-grok-can-process-images-120025782.html?src=rss
Erstellt 1y | 13.04.2024, 12:40:10


Melden Sie sich an, um einen Kommentar hinzuzufügen

Andere Beiträge in dieser Gruppe

Resident Evil Requiem feels very familiar, but it's so well made that I respect the hell out of it

For nearly 30 years, developer Capcom has been redefining its particular brand of survival horror for the Resident Evil series. Despite its tone shifting between action-horror games and more pure h

20.08.2025, 19:50:25 | Engadget
Gemini is coming to Google Home in October with both free and paid versions

Gemini is launching in early access on smart displays and speakers in October, Google announced in

20.08.2025, 19:50:24 | Engadget
The Rogue Prince of Persia is officially out for PC and consoles

Ubisoft and Evil Empire's long-awaited The Rogue Prince of Persia is finally out and

20.08.2025, 19:50:22 | Engadget
Microsoft is working on a fix for PC shader stutter

Microsoft is creating a new

20.08.2025, 19:50:21 | Engadget
Sony raises PS5 console prices in the US

Sony held out longer than Microsoft and

20.08.2025, 17:40:36 | Engadget
Amazon may abandon its Fire tablet software

Amazon could finally be ditching its proprietary software on Fire tablets,

20.08.2025, 17:40:35 | Engadget