A group of authors is suing artificial intelligence startup Anthropic, alleging it committed “large-scale theft” in training its popular chatbot Claude on pirated copies of copyrighted books.
While similar lawsuits have piled up for more than a year against competitor OpenAI, maker of ChatGPT, this is the first from writers to target Anthropic and its Claude chatbot.
The smaller San Francisco-based company — founded by ex-OpenAI leaders — has marketed itself as the more responsible and safety-focused developer of generative AI models that can compose emails, summarize documents and interact with people in a natural way.
But the lawsuit filed Monday in a federal court in San Francisco alleges that Anthropic’s actions “have made a mockery of its lofty goals” by tapping into repositories of pirated writings to build its AI product.
“It is no exaggeration to say that Anthropic’s model seeks to profit from strip-mining the human expression and ingenuity behind each one of those works,” the lawsuit says.
Anthropic didn’t immediately respond to a request for comment Monday.
The lawsuit was brought by a trio of writers — Andrea Bartz, Charles Graeber and Kirk Wallace Johnson — who are seeking to represent a class of similarly situated authors of fiction and nonfiction.
While it’s the first case against Anthropic from book authors, the company is also fighting a lawsuit by major music publishers alleging that Claude regurgitates the lyrics of copyrighted songs.
The authors’ case joins a growing number of lawsuits filed against developers of AI large language models in San Francisco and New York.
OpenAI and its business partner Microsoft are already battling a group of copyright infringement cases led by household names like John Grisham, Jodi Picoult and “Game of Thrones” novelist George R. R. Martin; and another set of lawsuits from media outlets such as The New York Times, Chicago Tribune and Mother Jones.
What links all the cases is the claim that tech companies ingested huge troves of human writings to train AI chatbots to produce human-like passages of text, without getting permission or compensating the people who wrote the original works. The legal challenges are coming not just from writers but visual artists, music labels and other creators who allege that generative AI profits have been built on misappropriation.
Anthropic and other tech companies have argued that training of AI models fits into the “fair use” doctrine of U.S. laws that allows for limited uses of copyrighted materials such as for teaching, research or transforming the copyrighted work into something different.
But the lawsuit against Anthropic accuses it of using a dataset called The Pile that included a trove of pirated books. It also disputes the idea that AI systems are learning the way humans do.
“Humans who learn from books buy lawful copies of them, or borrow them from libraries that buy them, providing at least some measure of compensation to authors and creators,” the lawsuit says.
———
The Associated Press and OpenAI have a licensing and technology agreement that allows OpenAI access to part of AP’s text archives.
—Matt O’Brien, AP Technology Writer
Zaloguj się, aby dodać komentarz
Inne posty w tej grupie

A “click-to-cancel” rule, which would have made it easier f


Inbox fatigue is real. According to one analysis, the average person receives more than 120 emails a day, with some o

Tech nostalgia runs strong among Gen Z. The retro movement has made long-outdated devices desirable

For much of the AI era, intelligence has been on-demand: a user issues

Every so often, Microsoft design director Diego Baca boots up an old computer so he can play around with Windows 95 again.
Baca has made a hobby of assembling old PCs with new-in-box vin

Twitter cofounder Jack Dorsey is back with a new app that tracks sun exposure and vitamin D levels.
Sun Day uses location-based data to show the current UV index, the day’s high, and add