Close Menu
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On

The Online Tools That Fueled ‘No Kings’ and the Trump Resistance

16 June 2025

How to Fight Like a Ballerina

16 June 2025

Google Pixel 10 Could Debut With Magic Cue AI Feature That Suggests Actions Based on App Usage

16 June 2025
Facebook X (Twitter) Instagram
Just In
  • The Online Tools That Fueled ‘No Kings’ and the Trump Resistance
  • How to Fight Like a Ballerina
  • Google Pixel 10 Could Debut With Magic Cue AI Feature That Suggests Actions Based on App Usage
  • Microsoft Begins Testing AI Agents in Windows 11, Brings Option to Share Recall Snapshots in Europe
  • The Six One Indie Showcase Returns This September
  • This Historian Has Seen the Future of Trans Health Care
  • Oppo K13x 5G India Launch Date Set for June 23; Price Range, Key Features Revealed
  • How to Out-Troll the Trolls, as Told by the Internet’s Foremost Posters
Facebook X (Twitter) Instagram Pinterest Vimeo
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release
Subscribe
Best in TechnologyBest in Technology
Home » AI Tools Are Secretly Training on Real Images of Children
News

AI Tools Are Secretly Training on Real Images of Children

News RoomBy News Room10 June 20243 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
Share
Facebook Twitter LinkedIn Pinterest Email

Over 170 images and personal details of children from Brazil have been scraped by an open-source dataset without their knowledge or consent, and used to train AI, claims a new report from Human Rights Watch released Monday.

The images have been scraped from content posted as recently as 2023 and as far back as the mid-1990s, according to the report, long before any internet user might anticipate that their content might be used to train AI. Human Rights Watch claims that personal details of these children, alongside links to their photographs, were included in LAION-5B, a dataset that has been a popular source of training data for AI startups.

“Their privacy is violated in the first instance when their photo is scraped and swept into these datasets. And then these AI tools are trained on this data and therefore can create realistic imagery of children,” says Hye Jung Han, children’s rights and technology researcher at Human Rights Watch and the researcher who found these images. “The technology is developed in such a way that any child who has any photo or video of themselves online is now at risk because any malicious actor could take that photo, and then use these tools to manipulate them however they want.”

LAION-5B is based on Common Crawl—a repository of data that was created by scraping the web and made available to researchers—and has been used to train several AI models, including Stability AI’s Stable Diffusion image generation tool. Created by the German nonprofit organization LAION, the dataset is openly accessible and now includes more than 5.85 billion pairs of images and captions, according to its website.

The images of children that researchers found came from mommy blogs and other personal, maternity, or parenting blogs, as well as stills from YouTube videos with small view counts, seemingly uploaded to be shared with family and friends.

“Just looking at the context of where they were posted, they enjoyed an expectation and a measure of privacy,” Hye says. “Most of these images were not possible to find online through a reverse image search.”

YouTube’s terms of service do not allow scraping except under certain circumstances; these instances seem to run afoul of those policies. “We’ve been clear that the unauthorized scraping of YouTube content is a violation of our Terms of Service,” says YouTube spokesperson Jack Maon, “and we continue to take action against this type of abuse.”

In December, researchers at Stanford University found that AI training data collected by LAION-5B contained child sexual abuse material. The problem of explicit deepfakes is on the rise even among students in US school, where they are being used to bully classmates, especially girls. Hye worries that, beyond using children’s photos to generate CSAM, that the database could reveal potentially sensitive information, such as locations or medical data. In 2022, a US-based artist found her own image in the LAION dataset, and realized it was from her private medical records.

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleClair Obscur: Expedition 33 Is A Slick-Looking Fantasy RPG Coming Next Year
Next Article NYT Strands today: hints, spangram and answers for Monday, June 10

Related Articles

News

The Online Tools That Fueled ‘No Kings’ and the Trump Resistance

16 June 2025
News

How to Fight Like a Ballerina

16 June 2025
News

This Historian Has Seen the Future of Trans Health Care

16 June 2025
News

How to Out-Troll the Trolls, as Told by the Internet’s Foremost Posters

16 June 2025
News

Review: Acefast Acefit Air Open Earbuds

16 June 2025
News

Why We Made a Guide to Winning a Fight

16 June 2025
Demo
Top Articles

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 202495 Views

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202495 Views

5 laptops to buy instead of the M4 MacBook Pro

17 November 202466 Views

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Latest News
News

This Historian Has Seen the Future of Trans Health Care

News Room16 June 2025
Phones

Oppo K13x 5G India Launch Date Set for June 23; Price Range, Key Features Revealed

News Room16 June 2025
News

How to Out-Troll the Trolls, as Told by the Internet’s Foremost Posters

News Room16 June 2025
Most Popular

The Spectacular Burnout of a Solar Panel Salesman

13 January 2025124 Views

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 202495 Views

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202495 Views
Our Picks

Microsoft Begins Testing AI Agents in Windows 11, Brings Option to Share Recall Snapshots in Europe

16 June 2025

The Six One Indie Showcase Returns This September

16 June 2025

This Historian Has Seen the Future of Trans Health Care

16 June 2025

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Facebook X (Twitter) Instagram Pinterest
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact Us
© 2025 Best in Technology. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.