Here are the companies OpenAI has made deals with to train ChatGPT

OpenAI’s chatbots scored a big new data source following the company’s deal with News Corp. on Wednesday. With the stroke of a pen, ChatGPT and the company’s other services added the Wall Street Journal, New York Post, MarketWatch, Barron’s, and other publications to its database.

The deal, which did not include Fox News content, was the latest in a growing series of big data sharing agreements OpenAI has signed in an effort to educate its systems and expand the technology’s expertise. Just last week, the company signed a similar deal with Reddit to incorporate its content into ChatGPT and new products.

The deals come after some media outlets, including The New York Times Company, have sued OpenAI and Microsoft for using their publications’ copyrighted stories without permission in training chatbots. Filed in Federal District Court in Manhattan, the suit alleges millions of NYTimes articles were used to train chatbots, which have begun to compete with the news outlet as information sources. A collective of well-known authors has also sued the company, alleging “systematic theft on a mass scale.” 

Inputting data is only half the battle, of course. OpenAI will have to figure out how to deal with biases in the information it incorporates and how to weed out information that’s sarcastic or pure parody. (Earlier this week, Google showed it still has a long way to go on this front, with the company’s AI Search sharing a farcical Reddit post as fact when it suggested “mix[ing] about 1/8 cup of non-toxic glue into the sauce” to keep the cheese from sliding off of your pizza slice.)

So, who all has partnered with OpenAI, giving the company access to their content libraries? Here’s a comprehensive look.

The Associated Press

Last July, the AP and OpenAI announced a deal letting the AI giant license AP’s archive of news stories going back through 1985. AP, in the meantime, was given the opportunity to leverage OpenAI’s tech.

Axel Springer

The German publisher was the first major media outlet to partner with OpenAI and open its archives to the chatbot. Axel Springer controls a huge assortment of outlets, including Politico, Business Insider, and German outlets Bild and Welt.

Dotdash Meredith

Dotdash Meredith is one of the largest digital publishers in the U.S., so its licensing deal, signed in May, gave OpenAI access to more than 40 brands, including People, Travel & Leisure, Entertainment Weekly, Allrecipes, Real Simple, Food & Wine, Parents, Investopedia, Better Homes & Garden, and InStyle.

The deal came after the company’s parent firm IAC had pushed to create a coalition uniting big publishers as they strove to protect copyrighted materials from AI firms. That effort ultimately fell apart.

The Financial Times

The FT partnered with OpenAI in April. The licensing deal gave the ChatGPT maker the ability to use FT materials to create text, images, and code. The deal also let ChatGPT respond to questions with short summaries from FT articles, with links back to FT.com.

Le Monde

In March, the French media organization struck a multiyear licensing agreement with OpenAI for its content library. Photos were not part of the deal and OpenAI agreed that references to Le Monde articles would be highlighted and accompanied by a logo, hyperlink, and the titles of the articles used as references.

News Corp.

News Corp.’s multiyear deal will give OpenAI access to a catalog of some of the most respected financial reporting around, with stories from the Wall Street Journal, MarketWatch, Barron’s, and more. It will also grant access to the New York Post as well as the U.K. publications The Times and The Sun plus multiple Australian publications including The Herald Sun and The Courier Mail.

The agreement does not include content from Fox News or News Corp.’s other businesses, such as its digital real estate services or HarperCollins, however. 

Prisa Media

At the same time it struck a deal with Le Monde, OpenAI also partnered with Spanish news outlet Prisa Media, which has brands in Spain, Latin America, and the U.S., including El Pais and El Huffpost, the Spanish version of the Huffington Post.

Reddit

With more than 1 million posts per day, Reddit is an ongoing source of content for ChatGPT to devour. It also will give the chatbot data for a wide range of topics, from “Ask Me Anything” sessions with celebrities and people in unusual jobs to sports discussion. (The NSFW forums could provide some data as well, but we’re not going to speculate what those could be used for.)

Reddit also struck a $60 million content licensing deal with Google in February. 

Shutterstock

OpenAI’s partnership with the stock photography website goes back to 2021. In 2023, OpenAI announced it was extending its partnership for another six years, with Shutterstock giving the company a large swath of training data for its AI, including Shutterstock’s image, video, and music libraries, and associated metadata.

These deals could be just the tip of the iceberg. As OpenAI continues to grow ChatGPT, it will need more data for its large language models. Several major publishers, from book houses to news outlets, are still on the sidelines but could be swayed to sign a partnership in the months to come as their revenues fall and OpenAI offers lucrative contracts. 

https://www.fastcompany.com/91130785/companies-reddit-news-corp-deals-openai-train-chatgpt-partnerships?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Établi 22d | 26 mai 2024 à 12:50:06


Connectez-vous pour ajouter un commentaire

Autres messages de ce groupe

How GM’s brilliant little car kicked off an EV boom 30 years ago

Few cars have captured the popular imagination quite like the EV1, the pioneering electric vehicle that General Motors launched a generation ago. Approved for production in 1994 and released two y

17 juin 2024 à 11:30:05 | Fast company - tech
The short, happy reign of CD-ROM

Thirty years ago, a breakthrough technology was poised to transform how people stayed informed, entertained themselves, and maybe even shopped. I’m not talking about the World Wide Web. True, it w

17 juin 2024 à 11:30:04 | Fast company - tech
In Japan, an AI app is detecting pain in cats

Mayumi Kitakata frets about the health and well-being of Chi, her stoic housemate who enjoys treats, indulges a bit too much in the catnip, and about 14 is getting on in years for a feline.

15 juin 2024 à 09:30:02 | Fast company - tech
Encrypt private messages in QR codes with this simple free site

Most of the tools we talk about tend to be things that make our own lives a little bit easier—often in some small but significant way.

Today’s tool takes a twist. It’s a free, o

15 juin 2024 à 04:50:04 | Fast company - tech
What to know about Weverse, HYBE’s superfan platform joined by Ariana Grande

Pop star Ariana Grande is joining Weverse, a superfan platform owned by

14 juin 2024 à 19:40:07 | Fast company - tech
Is X trying to compete with OnlyFans?

Last October, X began experimenting with various tiers of

14 juin 2024 à 19:40:07 | Fast company - tech