Close Menu
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On

Jensen Huang Wants You to Know He’s Getting a Lot Out of the ‘Fantastic’ Nvidia-Intel Deal

19 September 2025

These Are the 15 New York Officials ICE and NYPD Arrested in Manhattan

19 September 2025

Vaccine Panel Stacked by RFK Jr. Recommends Delaying MMRV Immunization

19 September 2025
Facebook X (Twitter) Instagram
Just In
  • Jensen Huang Wants You to Know He’s Getting a Lot Out of the ‘Fantastic’ Nvidia-Intel Deal
  • These Are the 15 New York Officials ICE and NYPD Arrested in Manhattan
  • Vaccine Panel Stacked by RFK Jr. Recommends Delaying MMRV Immunization
  • Move Aside, Chatbots: AI Humanoids Are Here
  • Brendan Carr Isn’t Going to Stop Until Someone Makes Him
  • You Can Save $200 on Samsung’s Elite Gaming Monitor Today
  • No One Knows What ‘Terminally Online’ Means Anymore
  • AI Psychosis Is Rarely Psychosis at All
Facebook X (Twitter) Instagram Pinterest Vimeo
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release
Subscribe
Best in TechnologyBest in Technology
Home » Amazon Is Investigating Perplexity Over Claims of Scraping Abuse
News

Amazon Is Investigating Perplexity Over Claims of Scraping Abuse

News RoomBy News Room28 June 20243 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
Share
Facebook Twitter LinkedIn Pinterest Email

Amazon’s cloud division has launched an investigation into Perplexity AI. At issue is whether the AI search startup is violating Amazon Web Services rules by scraping websites that attempted to prevent it from doing so, WIRED has learned.

An AWS spokesperson, who spoke to WIRED on the condition that they would not be named, confirmed the company’s investigation of Perplexity. WIRED had previously found that the startup—which has backing from the Jeff Bezos family fund, Nvidia, and was recently valued at $3 billion—appears to rely on content from scraped websites that had forbidden access through the Robots Exclusion Protocol, a common web standard. While the Robots Exclusion Protocol is not legally binding, terms of service generally are.

The Robots Exclusion Protocol is a decades-old web standard that involves placing a plaintext file (like wired.com/robots.txt) on a domain to indicate which pages should not be accessed by automated bots and crawlers. While companies that use scrapers can choose to ignore this protocol, most have traditionally respected it. The Amazon spokesperson told WIRED that AWS customers must adhere to the robots.txt standard while crawling websites.

“AWS’s terms of service prohibit customers from using our services for any illegal activity, and our customers are responsible for complying with our terms and all applicable laws,” the spokesperson said in a statement.

Scrutiny of Perplexity’s practices follows a June 11 report from Forbes that accused the startup of stealing at least one of its articles. WIRED investigations confirmed the practice and found further evidence of scraping abuse and plagiarism by systems linked to Perplexity’s AI-powered search chatbot. Engineers for Condé Nast, WIRED’s parent company, block Perplexity’s crawler across all its websites using a robots.txt file. But WIRED found the company had access to a server using an unpublished IP address—44.221.181.252—which visited Condé Nast properties at least hundreds of times in the past three months, apparently to scrape Condé Nast websites.

The machine associated with Perplexity appears to be engaged in widespread crawling of news websites that forbid bots from accessing its content. Spokespeople for the Guardian, Forbes, and The New York Times also say they detected the IP address on its servers multiple times.

WIRED traced the IP address to a virtual machine known as an Elastic Compute Cloud (EC2) instance hosted on AWS, which launched its investigation after we asked whether using AWS infrastructure to scrape websites that forbade it violated the company’s terms of service.

Last week, Perplexity CEO Aravind Srinivas responded to WIRED’s investigation first by saying the questions we posed to the company “reflect a deep and fundamental misunderstanding of how Perplexity and the Internet work.” Srinivas then told Fast Company that the secret IP address WIRED observed scraping Condé Nast websites and a test site we created was operated by a third-party company that performs web crawling and indexing services. He refused to name the company citing a nondisclosure agreement. When asked if he would tell the third-party to stop crawling WIRED, Srinivas replied “it’s complicated.”

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleMedusa Banking Trojan Makes Comeback With Upgrades Targeting Android Devices in Seven Countries
Next Article How to pair Beats headphones with a Bluetooth device

Related Articles

News

Jensen Huang Wants You to Know He’s Getting a Lot Out of the ‘Fantastic’ Nvidia-Intel Deal

19 September 2025
News

These Are the 15 New York Officials ICE and NYPD Arrested in Manhattan

19 September 2025
News

Vaccine Panel Stacked by RFK Jr. Recommends Delaying MMRV Immunization

19 September 2025
News

Move Aside, Chatbots: AI Humanoids Are Here

19 September 2025
News

Brendan Carr Isn’t Going to Stop Until Someone Makes Him

19 September 2025
News

You Can Save $200 on Samsung’s Elite Gaming Monitor Today

18 September 2025
Demo
Top Articles

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024105 Views

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202495 Views

5 laptops to buy instead of the M4 MacBook Pro

17 November 202492 Views

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Latest News
News

You Can Save $200 on Samsung’s Elite Gaming Monitor Today

News Room18 September 2025
News

No One Knows What ‘Terminally Online’ Means Anymore

News Room18 September 2025
News

AI Psychosis Is Rarely Psychosis at All

News Room18 September 2025
Most Popular

The Spectacular Burnout of a Solar Panel Salesman

13 January 2025129 Views

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024105 Views

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202495 Views
Our Picks

Move Aside, Chatbots: AI Humanoids Are Here

19 September 2025

Brendan Carr Isn’t Going to Stop Until Someone Makes Him

19 September 2025

You Can Save $200 on Samsung’s Elite Gaming Monitor Today

18 September 2025

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Facebook X (Twitter) Instagram Pinterest
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact Us
© 2025 Best in Technology. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.