Close Menu
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On

The Person in Charge of Testing Tech for US Spies Has Resigned

4 July 2025

How Gearbox Designed The Open World Of Borderlands 4

4 July 2025

Trump’s Defiance of TikTok Ban Prompted Immunity Promises to 10 Tech Companies

4 July 2025
Facebook X (Twitter) Instagram
Just In
  • The Person in Charge of Testing Tech for US Spies Has Resigned
  • How Gearbox Designed The Open World Of Borderlands 4
  • Trump’s Defiance of TikTok Ban Prompted Immunity Promises to 10 Tech Companies
  • The 61 Best Early Amazon Prime Day Deals
  • A Game Called Date Everything Literally Lets You Date Everything—Except People
  • Anthem’s Servers Are Shutting Down In January, Rendering It Completely Unplayable
  • Trump’s ‘Big Beautiful Bill’ Would Leave Millions Without Health Insurance
  • Trump Officials Want to Prosecute Over the ICEBlock App. Lawyers Say That’s Unconstitutional
Facebook X (Twitter) Instagram Pinterest Vimeo
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release
Subscribe
Best in TechnologyBest in Technology
Home » Meta Secretly Trained Its AI on a Notorious Piracy Database, Newly Unredacted Court Docs Reveal
News

Meta Secretly Trained Its AI on a Notorious Piracy Database, Newly Unredacted Court Docs Reveal

News RoomBy News Room10 January 20253 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
Share
Facebook Twitter LinkedIn Pinterest Email

“Meta has treated the so-called ‘public availability’ of shadow datasets as a get-out-of-jail-free card, notwithstanding that internal Meta records show every relevant decision-maker at Meta, up to and including its CEO, Mark Zuckerberg, knew LibGen was ‘a dataset we know to be pirated,’” the plaintiffs allege in this motion. (Originally filed in late 2024, the motion is a request to file a third amended complaint.)

In addition to the plaintiffs’ briefs, another filing was unredacted in response to Chhabria’s order—Meta’s opposition to the motion to file an amended complaint. It argues that the authors’ attempts to add additional claims to the case are an “eleventh-hour gambit based on a false and inflammatory premise” and denies that Meta waited to reveal crucial information in discovery. Instead, Meta argues it first revealed to the plaintiffs that it used a LibGen dataset in July 2024. (Because much of the discovery materials remain confidential, it is difficult for WIRED to confirm that claim.)

Meta’s argument hinges on its claim that the plaintiffs already knew about the LibGen use and shouldn’t be granted additional time to file a third amended claim when they had ample time to do so before discovery ended in December 2024. “Plaintiffs knew of Meta’s downloading and use of LibGen and other alleged ‘shadow libraries’ since at least mid-July 2024,” the tech giant’s lawyers argue.

In November 2023, Chhabria granted Meta’s motion to dismiss some of the lawsuit’s claims, including its claim Meta’s alleged use of the authors’ work to train AI violated the Digital Millennium Copyright Act, a US law introduced in 1998 to stop people from selling or duplicating copyrighted works on the internet. At the time, the judge agreed with Meta’s stance that the plaintiffs had not provided sufficient evidence to prove that the company had removed what’s known as “copyright management information,” like the author’s name and title of the work.

The unredacted documents argue that the plaintiffs should be allowed to amend their complaint, alleging that the information Meta revealed is evidence that the DMCA claim was warranted. They also say the discovery process has unearthed reasons to add new allegations. “Meta, through a corporate representative who testified on November 20, 2024, has now admitted under oath to uploading (aka ‘seeding’) pirated files containing Plaintiffs’ works on ‘torrent’ sites,” the motion alleges. (Seeding is when torrented files are then shared with other peers after they have finished downloading.)

“This torrenting activity turned Meta itself into a distributor of the very same pirated copyrighted material that it was also downloading for use in its commercially available AI models,” one of the newly unredacted documents claims, alleging that Meta, in other words, had not just used copyrighted material without permission but also disseminated it.

LibGen, an archive of books uploaded to the internet that originated in Russia around 2008, is one of the largest and most controversial “shadow libraries” in the world. In 2015, a New York judge ordered a preliminary injunction against the site, a measure designed in theory to temporarily shut the archive down, but its anonymous administrators simply switched its domain. In September 2024, a different New York judge ordered LibGen to pay $30 million to the rights holders for infringing on their copyrights, despite not knowing who actually operates the piracy hub.

Meta’s discovery woes for this case aren’t over, either. In the same order, Chhabria warned the tech giant against any overly sweeping redaction requests in the future: “If Meta again submits an unreasonably broad sealing request, all materials will simply be unsealed,” he wrote.

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleSnap up this LG 27-inch gaming monitor for $126 at Walmart
Next Article FAA issues stern warning after drone and aircraft collide near L.A. fires

Related Articles

News

The Person in Charge of Testing Tech for US Spies Has Resigned

4 July 2025
News

Trump’s Defiance of TikTok Ban Prompted Immunity Promises to 10 Tech Companies

4 July 2025
News

The 61 Best Early Amazon Prime Day Deals

3 July 2025
News

A Game Called Date Everything Literally Lets You Date Everything—Except People

3 July 2025
News

Trump’s ‘Big Beautiful Bill’ Would Leave Millions Without Health Insurance

3 July 2025
News

Trump Officials Want to Prosecute Over the ICEBlock App. Lawyers Say That’s Unconstitutional

3 July 2025
Demo
Top Articles

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024100 Views

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202495 Views

Oppo Reno 14, Reno 14 Pro India Launch Timeline and Colourways Leaked

27 May 202581 Views

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Latest News
Gaming

Anthem’s Servers Are Shutting Down In January, Rendering It Completely Unplayable

News Room3 July 2025
News

Trump’s ‘Big Beautiful Bill’ Would Leave Millions Without Health Insurance

News Room3 July 2025
News

Trump Officials Want to Prosecute Over the ICEBlock App. Lawyers Say That’s Unconstitutional

News Room3 July 2025
Most Popular

The Spectacular Burnout of a Solar Panel Salesman

13 January 2025124 Views

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024100 Views

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202495 Views
Our Picks

The 61 Best Early Amazon Prime Day Deals

3 July 2025

A Game Called Date Everything Literally Lets You Date Everything—Except People

3 July 2025

Anthem’s Servers Are Shutting Down In January, Rendering It Completely Unplayable

3 July 2025

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Facebook X (Twitter) Instagram Pinterest
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact Us
© 2025 Best in Technology. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.