OpenAI’s GPT-4o brings us closer to the ‘Her’ experience

OpenAI held a webcast Monday to roll out a new version of its free ChatGPT app, which sounds and acts a lot like the AI in the 2013 Spike Jonze film, Her.

The experience is powered by a new version of its GPT-4 large language model—available on desktop and mobile—called GPT-4o (“GPT-four-oh”). The new model, OpenAI says, returns answers much faster than GPT-4, and improves on its text, vision, and audio capabilities.

The model is a showcase for OpenAI’s development of multi-modal AI. GPT-4o can recieve and reason about text, audio, and visual inputs, then deliver outputs in natural language and natural-sounding voice.

OpenAI researcher Mark Chen demonstrated the new model’s impressive conversational capabilities during a live demo. He told the chatbot that he was nervous about the demo, and asked her for advice to help calm down. Chen then mock-hyperventilated into phone, to which the app responded “Mark! You’re not a vacuum cleaner.” The AI was spontaneous and funny, much like the voice assistant (voiced by Scarlett Johansson) in Her, which has become a North Star for people developing consumer AI.

The app was asked to tell a story with various levels of “drama” in its voice, which it did, convincingly. The AI then told the same story in a stereotypical robot’s voice, and then again in sing-song fashion.

Chen also demonstrated how he could interrupt the AI voice, and she would quickly stop talking. ChatGPT, in other words, is getting more “emotionally” intelligent. This is very similar to what Inflection.ai was developing with its Pi AI app. But Inflection.ai was essentially bought out by Microsoft, the same tech giant that owns almost half of OpenAI.

The ChatGPT app also has the ability to “see” things and reason about them. Through the phone camera, the app was shown a math problem written on a white board and asked for help in working it out. It was then asked to explain some computer code. The app also did a live translation from Italian to English and back.

The new features in the ChatGPT app will roll out to users of the free version of ChatGPT over the next few weeks. OpenAI says it’s also making GPT-4o available to developers through its API. OpenAI’s live streamed announcement Monday seemed timed to steal some thunder from Google, which is expected to make a series of AI-related announcements at its I/O developer conference Tuesday.

https://www.fastcompany.com/91123206/openai-gpt-4o-announcement?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Utworzony 14d | 13 maj 2024, 18:40:08


Zaloguj się, aby dodać komentarz

Inne posty w tej grupie

These are the best apps for building good habits

This article is republished with permission from Wonder Tools, a newsletter that helps you discover the most useful sites and apps. 

27 maj 2024, 14:10:03 | Fast company - tech
How China’s 1980s PC industry hacked dot-matrix printers

Excerpted from The Chinese Computer: A Global History of the Information Age, by Thomas S. Mullaney. Published by The MIT Press. Copyright © 2024 MIT. All rights reserved.

27 maj 2024, 11:40:05 | Fast company - tech
This is why the Northern Lights look better through your phone camera

Smartphone cameras have significantly improved in recent years. Computational photography and AI allow these devices to capture stunning images that can surpass what we see with the naked eye. Pho

27 maj 2024, 09:30:03 | Fast company - tech
Facebook sees extremist militia groups return after being kicked off—testing moderation efforts

When journalists sounded alarm bells in early May 2024 that more than 100 extremist militia groups had be

26 maj 2024, 12:50:06 | Fast company - tech
Here are the companies OpenAI has made deals with to train ChatGPT

OpenAI’s chatbots scored a big new data source following the company’s deal with News Corp. on We

26 maj 2024, 12:50:06 | Fast company - tech
5 not-so-obvious ways to speed up Windows 11

It seems as though the more powerful computers get, the more noticeable it is once they start to buckle under the pressure of multi-tabbed web sessions, unchecked digital hoarding, many-windowed m

26 maj 2024, 06:10:06 | Fast company - tech