Publishers Target Common Crawl In Fight Over AI Training Data

June 13, 2024

3

Danish media outlets have demanded that the nonprofit web archive Common Crawl remove copies of their articles from past data sets and stop crawling their websites immediately. This request was issued amid growing outrage over how artificial intelligence companies like OpenAI are using copyrighted materials.

Common Crawl plans to comply with the request, first issued on Monday. Executive director Rich Skrenta says the organization is “not equipped” to fight media companies and publishers in court.

The Danish Rights Alliance (DRA), an association representing copyright holders in Denmark, spearheaded the campaign. It made the request on behalf of four media outlets, including Berlingske Media and the daily newspaper Jyllands-Posten. The New York

→ Continue reading at WIRED

Publishers Target Common Crawl In Fight Over AI Training Data

Similar Articles

Most Popular

Publishers Target Common Crawl In Fight Over AI Training Data

Similar Articles

OpenAI’s Child Exploitation Reports Increased Sharply This Year

How Elon Musk Won His No Good, Very Bad Year

Most Popular

David Leavy, Longtime Zaslav Advisor, to Exit Warner Bros. Discovery

Arcade1Up isn’t dead, maybe

Miley Cyrus to Receive Outstanding Artistic Achievement Honor at Palm Springs International Film Awards