AI dataset licensing companies form group to promote ‘ethical data sourcing’

Seven content-licensing sellers of music, image, video, and other datasets for use in training artificial intelligence systems have formed the sector’s first trade group, they said on Wednesday.

The Dataset Providers Alliance (DPA) will advocate for “ethical data sourcing” in the training of AI systems, including rights for people depicted in datasets, and the protection of content owners’ intellectual property rights, the companies said in a statement.

Founding members include U.S. music dataset company Rightsify, image licensing service vAIsual, Japanese stock photo provider Pixta, and Germany-based data marketplace Datarade.

The emergence of generative AI technologies that can mimic human creativity in recent years has triggered an outcry from content creators and a string of copyright lawsuits against tech companies like Google, Meta, and ChatGPT maker OpenAI, which is backed by Microsoft.

Developers have been training models by feeding them vast quantities of content, much of it scraped from the internet for free without the consent of those who created the works or own rights to them.

Tech companies, which claim the usage is legal, are also quietly paying for access to private collections of content both to fulfill needs for particular types of data and to hedge against legal and regulatory risks.

The prospect that demand for licensed data will grow if copyright owners prevail in their legal fights has prompted the emergence of a nascent industry of companies that package content and sell access to it for use by AI systems.

As a result, groups have been formed to establish ethical standards for that trade, like Fairly Trained, a nonprofit founded this year which certifies models that have not used copyrighted materials without a license.

The DPA targets the content of those transactions, requiring, for example, that its members agree not to sell text data obtained by crawling the web or audio that features people’s voices without their explicit consent.

A heavy focus will be to push for legislation like the NO FAKES Act, a U.S. bill introduced last year to create penalties for generating unauthorized digital replicas of people’s voices or likenesses, said Alex Bestall, CEO of Rightsify and its licensing subsidiary GCX, who led the founding of the group.

“Advocacy will be a big part of it because everyone’s taken their positions on AI and copyright, but a lot of these battles are yet to be solved and it’s going to take a while for them to be,” said Bestall.

The DPA also will press for more training-data transparency requirements like those in the European Union’s AI Act and a similar U.S. bill introduced in April, the Generative AI Copyright Disclosure Act, he added.

The group plans to publish a white paper outlining its positions in July, he said.

(This story has been refiled to remove extra characters in paragraph 1)

—Katie Paul, Reuters

https://www.fastcompany.com/91146889/ai-dataset-licensing-companies-form-group-to-promote-ethical-data-sourcing?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

созданный 12mo | 27 июн. 2024 г., 01:10:07


Войдите, чтобы добавить комментарий

Другие сообщения в этой группе

How to prepare for your digital legacy after death

From family photos in the cloud to email archives and social media accounts, the digital lives of Americans are extensive and growing.

According to recent studies by the password managem

12 июн. 2025 г., 22:40:02 | Fast company - tech
Chime’s cofounder on the company’s IPO: ‘We’re just getting started’

A dozen years after its launch, fintech company Chime rang the bell this morning at the Nasdaq MarketSite in Times Square to ce

12 июн. 2025 г., 20:20:06 | Fast company - tech
What is a fridge cigarette? The viral Diet Coke trend explained

It hits at a certain time in the afternoon, when a familiar craving strikes. You walk to the kitchen. The satisfying sound of a can cracking, the hiss of bubbles. It’s time for a “fridge cigarette

12 июн. 2025 г., 20:20:06 | Fast company - tech
This startup wants AI to help manage software infrastructure, not just write code

Many developers find that AI programming assistants have made writing code easier than ever. But maintaining the infrastructure that actually runs that code remains a challenge, requiring engineer

12 июн. 2025 г., 18:10:21 | Fast company - tech
Apple fumbled its personal AI debut, but the alternative was far worse

Welcome to AI DecodedFast Company’s weekly newsletter that breaks down the most important news in the world of AI. You can sign up to receive this newsletter every week 

12 июн. 2025 г., 18:10:18 | Fast company - tech
Greenhouse and Clear team up to fight fake job applications flooding tech hiring

Fraudulent job applications have become a serious issue in the era of

12 июн. 2025 г., 13:30:02 | Fast company - tech
‘We’re on the cusp of more widespread adoption’: Laura Shin on Trump, stablecoins, and the global rise of cryptocurrency

With the first family actively engaged in memecoin ventures, speculation about the future of cryptocurrency has never been hotter. Laura Shin, crypto expert and host of the podcast Unchained

12 июн. 2025 г., 11:10:06 | Fast company - tech