Close Menu
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On

Police Report: Edward ‘Big Balls’ Coristine Assaulted in Alleged Carjacking

6 August 2025

US Coast Guard Report on Titan Submersible Implosion Singles Out OceanGate CEO Stockton Rush

6 August 2025

Apple Reportedly Evaluates Tandem OLED Technology for Future iPhone Models

5 August 2025
Facebook X (Twitter) Instagram
Just In
  • Police Report: Edward ‘Big Balls’ Coristine Assaulted in Alleged Carjacking
  • US Coast Guard Report on Titan Submersible Implosion Singles Out OceanGate CEO Stockton Rush
  • Apple Reportedly Evaluates Tandem OLED Technology for Future iPhone Models
  • iPhone 17 Launch Date Leaks, Telling Us When to Expect Apple’s Upcoming iPhone Models
  • Samsung Galaxy S26 Ultra Tipped to Offer Improved Low-Light Camera Performance
  • OpenAI Just Released Its First Open-Weight Models Since GPT-2
  • Claude Fans Threw a Funeral for Anthropic’s Retired AI Model
  • Amazon Great Freedom Festival Sale: Top Deals on OnePlus 13R, Nord 5, Nord CE 5, and More OnePlus Smartphones
Facebook X (Twitter) Instagram Pinterest Vimeo
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release
Subscribe
Best in TechnologyBest in Technology
Home » AI Agents Are Getting Better at Writing Code—and Hacking It as Well
News

AI Agents Are Getting Better at Writing Code—and Hacking It as Well

News RoomBy News Room25 June 20253 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
Share
Facebook Twitter LinkedIn Pinterest Email

The latest artificial intelligence models are not only remarkably good at software engineering—new research shows they are getting ever-better at finding bugs in software, too.

AI researchers at UC Berkeley tested how well the latest AI models and agents could find vulnerabilities in 188 large open source codebases. Using a new benchmark called CyberGym, the AI models identified 17 new bugs including 15 previously unknown, or “zero-day,” ones. “Many of these vulnerabilities are critical,” says Dawn Song, a professor at UC Berkeley who led the work.

Many experts expect AI models to become formidable cybersecurity weapons. An AI tool from startup Xbow currently has crept up the ranks of HackerOne’s leaderboard for bug hunting and currently sits in top place. The company recently announced $75 million in new funding.

Song says that the coding skills of the latest AI models combined with improving reasoning abilities are starting to change the cybersecurity landscape. “This is a pivotal moment,” she says. “It actually exceeded our general expectations.”

As the models continue to improve they will automate the process of both discovering and exploiting security flaws. This could help companies keep their software safe but may also aid hackers in breaking into systems. “We didn’t even try that hard,” Song says. “If we ramped up on the budget, allowed the agents to run for longer, they could do even better.”

The UC Berkeley team tested conventional frontier AI models from OpenAI, Google, and Anthropic, as well as open source offerings from Meta, DeepSeek, and Alibaba combined with several agents for finding bugs, including OpenHands, Cybench, and EnIGMA.

The researchers used descriptions of known software vulnerabilities from the 188 software projects. They then fed the descriptions to the cybersecurity agents powered by frontier AI models to see if they could identify the same flaws for themselves by analyzing new codebases, running tests, and crafting proof-of-concept exploits. The team also asked the agents to hunt for new vulnerabilities in the codebases by themselves.

Through the process, the AI tools generated hundreds of proof-of-concept exploits, and of these exploits the researchers identified 15 previously unseen vulnerabilities and two vulnerabilities that had previously been disclosed and patched. The work adds to growing evidence that AI can automate the discovery of zero-day vulnerabilities, which are potentially dangerous (and valuable) because they may provide a way to hack live systems.

AI seems destined to become an important part of the cybersecurity industry nonetheless. Security expert Sean Heelan recently discovered a zero-day flaw in the widely used Linux kernel with help from OpenAI’s reasoning model o3. Last November, Google announced that it had discovered a previously unknown software vulnerability using AI through a program called Project Zero.

Like other parts of the software industry, many cybersecurity firms are enamored with the potential of AI. The new work indeed shows that AI can routinely find new flaws, but it also highlights remaining limitations with the technology. The AI systems were unable to find most flaws and were stumped by especially complex ones.

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleOppo Reno 14F 5G With Snapdragon 6 Gen 1 SoC, 6,000mAh Battery Launched: Price, Specifications
Next Article Fairphone 6 With Snapdragon 7s Gen 3, User-Replaceable 4,415mAh Battery Launched: Price, Specifications

Related Articles

News

Police Report: Edward ‘Big Balls’ Coristine Assaulted in Alleged Carjacking

6 August 2025
News

US Coast Guard Report on Titan Submersible Implosion Singles Out OceanGate CEO Stockton Rush

6 August 2025
News

OpenAI Just Released Its First Open-Weight Models Since GPT-2

5 August 2025
News

Claude Fans Threw a Funeral for Anthropic’s Retired AI Model

5 August 2025
News

TikTok Promotes Stickers for Secretly Recording Meta Ray-Ban Video

5 August 2025
News

How Supercomputing Will Evolve, According to Jack Dongarra

5 August 2025
Demo
Top Articles

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024104 Views

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202495 Views

Oppo Reno 14, Reno 14 Pro India Launch Timeline and Colourways Leaked

27 May 202582 Views

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Latest News
News

OpenAI Just Released Its First Open-Weight Models Since GPT-2

News Room5 August 2025
News

Claude Fans Threw a Funeral for Anthropic’s Retired AI Model

News Room5 August 2025
Phones

Amazon Great Freedom Festival Sale: Top Deals on OnePlus 13R, Nord 5, Nord CE 5, and More OnePlus Smartphones

News Room5 August 2025
Most Popular

The Spectacular Burnout of a Solar Panel Salesman

13 January 2025129 Views

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024104 Views

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202495 Views
Our Picks

iPhone 17 Launch Date Leaks, Telling Us When to Expect Apple’s Upcoming iPhone Models

5 August 2025

Samsung Galaxy S26 Ultra Tipped to Offer Improved Low-Light Camera Performance

5 August 2025

OpenAI Just Released Its First Open-Weight Models Since GPT-2

5 August 2025

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Facebook X (Twitter) Instagram Pinterest
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact Us
© 2025 Best in Technology. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.