Close Menu
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On
This Chrome extension blocks social media until you scream (literally) in agony

This Chrome extension blocks social media until you scream (literally) in agony

8 February 2026
You Asked: Is Apple TV 4K Still a Good Buy? Bravia 9 or OLED for Bright Rooms?

You Asked: Is Apple TV 4K Still a Good Buy? Bravia 9 or OLED for Bright Rooms?

8 February 2026
I Have Fallen in Love With Open Earbuds (and You Should Too)

I Have Fallen in Love With Open Earbuds (and You Should Too)

8 February 2026
Facebook X (Twitter) Instagram
Just In
  • This Chrome extension blocks social media until you scream (literally) in agony
  • You Asked: Is Apple TV 4K Still a Good Buy? Bravia 9 or OLED for Bright Rooms?
  • I Have Fallen in Love With Open Earbuds (and You Should Too)
  • The Shoes and Brooms Transforming Curling at the 2026 Winter Olympics
  • The Best AI Notetakers to Record Your Meetings, Interviews, or Classes
  • For $4,550, Would You Buy a Single Premium Watch or a Swarm of Affordable Ones?
  • Meta thinks you’ll want a whole app just for AI videos
  • 7 Steps to Better Financial Health You Can Take Right Now
Facebook X (Twitter) Instagram Pinterest Vimeo
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release
Subscribe
Best in TechnologyBest in Technology
Home » The Fight Against AI Comes to a Foundational Data Set
News

The Fight Against AI Comes to a Foundational Data Set

News RoomBy News Room13 June 20243 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
The Fight Against AI Comes to a Foundational Data Set
Share
Facebook Twitter LinkedIn Pinterest Email

Danish media outlets have demanded that the nonprofit web archive Common Crawl remove copies of their articles from past datasets and stop crawling their websites immediately. This request was issued amid growing outrage over how artificial intelligence companies like OpenAI are using copyrighted materials.

Common Crawl plans to comply with the request, first issued on Monday. Executive director Rich Skrenta says the organization is “not equipped” to fight media companies and publishers in court.

The Danish Rights Alliance (DRA), an association representing copyright holders in Denmark, spearheaded the campaign. It made the request on behalf of four media outlets, including Berlingske Media and the daily newspaper Jyllands-Posten. The New York Times made a similar request of Common Crawl last year, prior to filing a lawsuit against OpenAI for using its work without permission. In its complaint, the New York Times highlighted how Common Crawl’s data was the most “highly weighted dataset” in GPT-3.

Thomas Heldrup, the DRA’s head of content protection and enforcement, says that this new effort was inspired by the Times. “Common Crawl is unique in the sense that we’re seeing so many big AI companies using their data,” Heldrup says. He sees its corpus as a threat to media companies attempting to negotiate with AI titans.

Although Common Crawl has been essential to the development of many text-based generative AI tools, it was not designed with AI in mind. Founded in 2007, the San Francisco-based organization was best known prior to the AI boom for its value as a research tool. “Common Crawl is caught up in this conflict about copyright and generative AI,” says Stefan Baack, a data analyst at the Mozilla Foundation who recently published a report on Common Crawl’s role in AI training. “For many years it was a small niche project that almost nobody knew about.”

Prior to 2023, Common Crawl did not receive a single request to redact data. Now, in addition to the requests from the New York Times and this group of Danish publishers, it’s also fielding an uptick of requests that have not been made public.

In addition to this sharp rise in demands to redact data, Common Crawl’s web crawler, CCBot, is also increasingly thwarted from accumulating new data from publishers. According to the AI detection startup Originality AI, which often tracks the use of web crawlers, over 44 percent of the top global news and media sites block CCBot. Apart from Buzzfeed, which began blocking it in 2018, most of the prominent outlets it analyzed—including Reuters, The Washington Post, and the CBC—only spurned the crawler in the last year. “They’re being blocked more and more,” Baack says.

Common Crawl’s quick compliance with this kind of request is driven by the realities of keeping a small nonprofit afloat. Compliance does not equate to ideological agreement, though. Skrenta sees this push to remove archival materials from data repositories like Common Crawl as nothing short of an affront to the internet as we know it. “It’s an existential threat,” he says. “They’ll kill the open web.”

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleSamsung Galaxy F15 5G Airtel Edition Reportedly Listed Online; Price, Specifications Leak
Next Article Save up to $1,170 on the Samsung Galaxy Tab S9 Ultra

Related Articles

This Chrome extension blocks social media until you scream (literally) in agony
News

This Chrome extension blocks social media until you scream (literally) in agony

8 February 2026
You Asked: Is Apple TV 4K Still a Good Buy? Bravia 9 or OLED for Bright Rooms?
News

You Asked: Is Apple TV 4K Still a Good Buy? Bravia 9 or OLED for Bright Rooms?

8 February 2026
I Have Fallen in Love With Open Earbuds (and You Should Too)
News

I Have Fallen in Love With Open Earbuds (and You Should Too)

8 February 2026
The Shoes and Brooms Transforming Curling at the 2026 Winter Olympics
News

The Shoes and Brooms Transforming Curling at the 2026 Winter Olympics

8 February 2026
The Best AI Notetakers to Record Your Meetings, Interviews, or Classes
News

The Best AI Notetakers to Record Your Meetings, Interviews, or Classes

8 February 2026
For ,550, Would You Buy a Single Premium Watch or a Swarm of Affordable Ones?
News

For $4,550, Would You Buy a Single Premium Watch or a Swarm of Affordable Ones?

8 February 2026
Demo
Top Articles
ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024108 Views
5 laptops to buy instead of the M4 MacBook Pro

5 laptops to buy instead of the M4 MacBook Pro

17 November 2024101 Views
Costco partners with Electric Era to bring back EV charging in the U.S.

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202498 Views

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Latest News
For ,550, Would You Buy a Single Premium Watch or a Swarm of Affordable Ones? News

For $4,550, Would You Buy a Single Premium Watch or a Swarm of Affordable Ones?

News Room8 February 2026
Meta thinks you’ll want a whole app just for AI videos News

Meta thinks you’ll want a whole app just for AI videos

News Room7 February 2026
7 Steps to Better Financial Health You Can Take Right Now News

7 Steps to Better Financial Health You Can Take Right Now

News Room7 February 2026
Most Popular
The Spectacular Burnout of a Solar Panel Salesman

The Spectacular Burnout of a Solar Panel Salesman

13 January 2025137 Views
ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024108 Views
5 laptops to buy instead of the M4 MacBook Pro

5 laptops to buy instead of the M4 MacBook Pro

17 November 2024101 Views
Our Picks
The Shoes and Brooms Transforming Curling at the 2026 Winter Olympics

The Shoes and Brooms Transforming Curling at the 2026 Winter Olympics

8 February 2026
The Best AI Notetakers to Record Your Meetings, Interviews, or Classes

The Best AI Notetakers to Record Your Meetings, Interviews, or Classes

8 February 2026
For ,550, Would You Buy a Single Premium Watch or a Swarm of Affordable Ones?

For $4,550, Would You Buy a Single Premium Watch or a Swarm of Affordable Ones?

8 February 2026

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Facebook X (Twitter) Instagram Pinterest
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact Us
© 2026 Best in Technology. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.