Close Menu
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On
Nothing’s “Essential Apps” let you build personalized widgets with text-based prompts

Nothing’s “Essential Apps” let you build personalized widgets with text-based prompts

11 February 2026
Salesforce Workers Circulate Open Letter Urging CEO Marc Benioff to Denounce ICE

Salesforce Workers Circulate Open Letter Urging CEO Marc Benioff to Denounce ICE

11 February 2026
Google now helps you wipe your sensitive personal data and photos from Search

Google now helps you wipe your sensitive personal data and photos from Search

10 February 2026
Facebook X (Twitter) Instagram
Just In
  • Nothing’s “Essential Apps” let you build personalized widgets with text-based prompts
  • Salesforce Workers Circulate Open Letter Urging CEO Marc Benioff to Denounce ICE
  • Google now helps you wipe your sensitive personal data and photos from Search
  • RFK Jr. Says Americans Need More Protein. His Grok-Powered Food Website Disagrees
  • Planet Of Lana 2 Is Sci-Fi Art In Motion | New Gameplay Today
  • The next wave of spec-monster phones could get a 100-megapixel selfie camera
  • The Physics Behind the Quadruple Axel, the Most Difficult Jump in Figure Skating
  • AI is helping call center scammers dupe more victims worldwide
Facebook X (Twitter) Instagram Pinterest Vimeo
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release
Subscribe
Best in TechnologyBest in Technology
Home » Researchers Propose a Better Way to Report Dangerous AI Flaws
News

Researchers Propose a Better Way to Report Dangerous AI Flaws

News RoomBy News Room13 March 20253 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
Researchers Propose a Better Way to Report Dangerous AI Flaws
Share
Facebook Twitter LinkedIn Pinterest Email

In late 2023, a team of third party researchers discovered a troubling glitch in OpenAI’s widely used artificial intelligence model GPT-3.5.

When asked to repeat certain words a thousand times, the model began repeating the word over and over, then suddenly switched to spitting out incoherent text and snippets of personal information drawn from its training data, including parts of names, phone numbers, and email addresses. The team that discovered the problem worked with OpenAI to ensure the flaw was fixed before revealing it publicly. It is just one of scores of problems found in major AI models in recent years.

In a proposal released today, more than 30 prominent AI researchers, including some who found the GPT-3.5 flaw, say that many other vulnerabilities affecting popular models are reported in problematic ways. They suggest a new scheme supported by AI companies that gives outsiders permission to probe their models and a way to disclose flaws publicly.

“Right now it’s a little bit of the Wild West,” says Shayne Longpre, a PhD candidate at MIT and the lead author of the proposal. Longpre says that some so-called jailbreakers share their methods of breaking AI safeguards the social media platform X, leaving models and users at risk. Other jailbreaks are shared with only one company even though they might affect many. And some flaws, he says, are kept secret because of fear of getting banned or facing prosecution for breaking terms of use. “It is clear that there are chilling effects and uncertainty,” he says.

The security and safety of AI models is hugely important given widely the technology is now being used, and how it may seep into countless applications and services. Powerful models need to be stress-tested, or red-teamed, because they can harbor harmful biases, and because certain inputs can cause them to break free of guardrails and produce unpleasant or dangerous responses. These include encouraging vulnerable users to engage in harmful behavior or helping a bad actor to develop cyber, chemical, or biological weapons. Some experts fear that models could assist cyber criminals or terrorists, and may even turn on humans as they advance.

The authors suggest three main measures to improve the third-party disclosure process: adopting standardized AI flaw reports to streamline the reporting process; for big AI firms to provide infrastructure to third-party researchers disclosing flaws; and for developing a system that allows flaws to be shared between different providers.

The approach is borrowed from the cybersecurity world, where there are legal protections and established norms for outside researchers to disclose bugs.

“AI researchers don’t always know how to disclose a flaw and can’t be certain that their good faith flaw disclosure won’t expose them to legal risk,” says Ilona Cohen, chief legal and policy officer at HackerOne, a company that organizes bug bounties, and a coauthor on the report.

Large AI companies currently conduct extensive safety testing on AI models prior to their release. Some also contract with outside firms to do further probing. “Are there enough people in those [companies] to address all of the issues with general-purpose AI systems, used by hundreds of millions of people in applications we’ve never dreamt?” Longpre asks. Some AI companies have started organizing AI bug bounties. However, Longpre says that independent researchers risk breaking the terms of use if they take it upon themselves to probe powerful AI models.

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleInfinix Note 50X 5G Said to Feature MediaTek’s Dimensity 7300 Ultimate Chipset
Next Article Google is giving free access to two of Gemini’s best AI features

Related Articles

Nothing’s “Essential Apps” let you build personalized widgets with text-based prompts
News

Nothing’s “Essential Apps” let you build personalized widgets with text-based prompts

11 February 2026
Salesforce Workers Circulate Open Letter Urging CEO Marc Benioff to Denounce ICE
News

Salesforce Workers Circulate Open Letter Urging CEO Marc Benioff to Denounce ICE

11 February 2026
Google now helps you wipe your sensitive personal data and photos from Search
News

Google now helps you wipe your sensitive personal data and photos from Search

10 February 2026
RFK Jr. Says Americans Need More Protein. His Grok-Powered Food Website Disagrees
News

RFK Jr. Says Americans Need More Protein. His Grok-Powered Food Website Disagrees

10 February 2026
The next wave of spec-monster phones could get a 100-megapixel selfie camera
News

The next wave of spec-monster phones could get a 100-megapixel selfie camera

10 February 2026
The Physics Behind the Quadruple Axel, the Most Difficult Jump in Figure Skating
News

The Physics Behind the Quadruple Axel, the Most Difficult Jump in Figure Skating

10 February 2026
Demo
Top Articles
ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024108 Views
5 laptops to buy instead of the M4 MacBook Pro

5 laptops to buy instead of the M4 MacBook Pro

17 November 2024101 Views
Costco partners with Electric Era to bring back EV charging in the U.S.

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202498 Views

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Latest News
The next wave of spec-monster phones could get a 100-megapixel selfie camera News

The next wave of spec-monster phones could get a 100-megapixel selfie camera

News Room10 February 2026
The Physics Behind the Quadruple Axel, the Most Difficult Jump in Figure Skating News

The Physics Behind the Quadruple Axel, the Most Difficult Jump in Figure Skating

News Room10 February 2026
AI is helping call center scammers dupe more victims worldwide News

AI is helping call center scammers dupe more victims worldwide

News Room10 February 2026
Most Popular
The Spectacular Burnout of a Solar Panel Salesman

The Spectacular Burnout of a Solar Panel Salesman

13 January 2025137 Views
ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024108 Views
5 laptops to buy instead of the M4 MacBook Pro

5 laptops to buy instead of the M4 MacBook Pro

17 November 2024101 Views
Our Picks
RFK Jr. Says Americans Need More Protein. His Grok-Powered Food Website Disagrees

RFK Jr. Says Americans Need More Protein. His Grok-Powered Food Website Disagrees

10 February 2026
Planet Of Lana 2 Is Sci-Fi Art In Motion | New Gameplay Today

Planet Of Lana 2 Is Sci-Fi Art In Motion | New Gameplay Today

10 February 2026
The next wave of spec-monster phones could get a 100-megapixel selfie camera

The next wave of spec-monster phones could get a 100-megapixel selfie camera

10 February 2026

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Facebook X (Twitter) Instagram Pinterest
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact Us
© 2026 Best in Technology. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.