Business Wire

Carnegie Mellon Researchers Demonstrate That LLMs Can Autonomously Plan and Execute Real-World Cyberattacks

Share

New study reveals how AI could both challenge and strengthen future cybersecurity defenses

In a major advance in the fields of cybersecurity and artificial intelligence, researchers from Carnegie Mellon University, in collaboration with Anthropic, have demonstrated that large language models (LLMs) can autonomously plan and execute sophisticated cyberattacks on enterprise-grade network environments without human intervention.

The study, led by Ph.D. candidate Brian Singer from Carnegie Mellon's Department of Electrical and Computer Engineering, reveals that LLMs, when structured with high-level planning capabilities and supported by specialized agent frameworks, can simulate network intrusions that closely mirror real-world breaches. The study’s most striking finding: an LLM was able to successfully replicate the infamous 2017 Equifax data breach in a controlled research environment—autonomously exploiting vulnerabilities, installing malware, and exfiltrating data.

“Our research shows that with the right abstractions and guidance, LLMs can go far beyond basic tasks,” said Singer. “They can coordinate and execute attack strategies that reflect real-world complexity.”

The team developed a hierarchical architecture where the LLM acts as a strategist, planning the attack and issuing high-level instructions, while a mix of LLM and non-LLM agents carry out low-level tasks like scanning networks or deploying exploits. This approach proved far more effective than earlier methods, which relied solely on LLMs executing shell commands.

This work builds on Singer’s prior research into making autonomous attacker and defender tools more accessible and programmable for human developers. Ironically, the same abstractions that simplified development for humans made it easier for LLMs to autonomously perform similar tasks.

While the findings are groundbreaking, Singer emphasized that the research remains a prototype.

“This isn’t something that’s going to take down the internet tomorrow,” he said. “The scenarios are constrained and controlled—but it’s a powerful step forward.”

The implications are twofold: the research highlights serious long-term safety concerns about the potential misuse of increasingly capable LLMs, but it also opens up transformative possibilities for defensive cybersecurity.

“Today, only large organizations can afford red team exercises to proactively test their defenses,” Singer explained. “This research points toward a future where AI systems continuously test networks for vulnerabilities, making these protections accessible to small organizations too.”

The project was conducted in collaboration with Anthropic, which provided model credits and technical consultation. The team included CMU students and faculty affiliated with CyLab, the university’s security and privacy institute. An early version of the research was presented at an OpenAI-hosted security workshop in May.

The resulting paper, “On the Feasibility of Using LLMs to Autonomously Execute Multi-host Network Attacks,” has been cited in multiple industry reports and is already informing safety documentation for cutting-edge AI systems. Lujo Bauer and Vyas Sekar, co-directors of CMU’s Future Enterprise Security Initiative, served as faculty advisors for the project.

Looking ahead, the team is now studying how similar architectures might enable autonomous AI defenses, exploring scenarios where LLM-based agents detect and respond to attacks in real time.

“We're entering an era of AI versus AI in cybersecurity,” Singer said. “And we need to understand both sides to stay ahead.”

About the College of Engineering: The College of Engineering at Carnegie Mellon University is a top-ranked engineering college that is known for our intentional focus on cross-disciplinary collaboration in research. The College is well-known for working on problems of both scientific and practical importance. Our “maker” culture is ingrained in all that we do, leading to novel approaches and transformative results. Our acclaimed faculty have a focus on innovation management and engineering to yield transformative results that will drive the intellectual and economic vitality of our community, nation, and world.

About CyLab:CyLab is the university-wide security and privacy institute at Carnegie Mellon University. We coordinate security and privacy research and education across all university departments. Our mission is to catalyze, support, promote, and strengthen collaborative security and privacy research and education across departments, disciplines, and geographic boundaries to achieve significant impact on research, education, public policy, and practice.

View source version on businesswire.com: https://www.businesswire.com/news/home/20250724351815/en/

Subscribe to releases from Business Wire

Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.

Latest releases from Business Wire

Andersen Consulting forbedrer sine organisatoriske udviklingskapaciteter med Omni HR Consulting1.8.2025 19:22:00 CEST | Pressemeddelelse

Andersen Consulting udvider sine kompetencer inden for menneskelige ressourcer gennem en samarbejdsaftale med Omni HR Consulting, et sydafrikansk konsulentfirma med speciale i løsninger til forretnings- og personaleudvikling. Omni HR Consulting tilbyder en komplet pakke af tjenester, der omfatter organisationsudvikling, præstationsrådgivning, akkrediteret uddannelse, kompetenceudvikling og ledelsesprogrammer gennem sit Business and Leadership Academy. Virksomheden samarbejder med kunderne om at designe og implementere løsninger, der retter sig mod medarbejdernes kompetencer, optimering af resultater og strategisk tilpasning og understøttes af en konsekvent tilgang til projektledelse og overholdelse af sydafrikanske kvalitetsstandarder. "Hos Omni tror vi på, at effektiv udvikling starter med forståelse af konteksten," siger administrerende direktør Lize Moldenhauer. "Vi arbejder tæt sammen med vores kunder for at udvikle skræddersyede løsninger, der skaber målbare fremskridt – hvad ente

DevvStream Deploys Crypto Treasury with Initial Bitcoin and Solana Purchases; Intends to Expand Credit Facility to $300M1.8.2025 16:00:00 CEST | Press release

DevvStream Corp. (Nasdaq: DEVS) (“DevvStream” or the “Company”), a leading carbon management firm specializing in the development, investment, and sale of environmental assets, today announced the initial deployment of its crypto treasury strategy with purchases of Bitcoin ($BTC) and Solana ($SOL), funded by a portion of the first (US)$10 million tranche of its (US)$300 million senior secured convertible notes facility with Helena Global Investment Opportunities 1 Ltd. These acquisitions represent the operational launch of DevvStream’s digital treasury strategy, designed to combine institutional-grade liquidity with blockchain infrastructure. The Company believes Bitcoin provides a liquid, non-correlated store of value and that Solana’s high-throughput network supports the Company’s long-term objectives in, and the industry’s move towards, sustainability-linked tokenization. In parallel, DevvStream announced its intention to increase its existing Equity Line of Credit (ELOC) to (US)$30

BEYOND Launches PASSO, a Sculptural Icon on Palm Jumeirah1.8.2025 15:17:00 CEST | Press release

BEYOND Developments, the forward-thinking real estate brand shaping lifestyle destinations by the sea, has unveiled PASSO, a sculptural waterfront development located on the prestigious West Crescent of Palm Jumeirah. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250801880717/en/ PASSO by BEYOND, a Sculptural Icon on Palm Jumeirah. (Photo: AETOSWire) As BEYOND’s first flagship beyond its masterplan in Dubai Maritime City, PASSO marks a defining milestone in the company’s strategic growth to one of the world’s most iconic and desirable destinations. The project launched with a spectacular Palm Jumeirah event featuring Dubai’s first-ever “screens in the sky” show, a 13-minute performance with over 4,000 drones blending immersive visuals and live stage action. Comprising two sculptural towers, Avita and Bella, PASSO offers 625 residences in a refined mix of layouts. From one-bedroom retreats and two-to-four-bedroom-plus lifest

LevelBlue Completes Acquisition of Aon’s Cybersecurity and IP Litigation Consulting Groups1.8.2025 14:00:00 CEST | Press release

Strategic deal enhances LevelBlue's cybersecurity offerings, solidifying its position as the world’s largest leading independent, pure-play MSSP LevelBlue, a global leader in cloud-based, AI-driven managed security services, today announced the completion of its acquisition of Aon’s (NYSE: AON) Cybersecurity and Intellectual Property (IP) Litigation consulting groups, including the renowned cybersecurity firm, Stroz Friedberg, and Elysium Digital. With this completion the consulting group will operate as Stroz Friedberg, a LevelBlue company. This strategic acquisition adds elite cyber and high-tech IP litigation consulting expertise to the LevelBlue portfolio, which includes a globally recognized platform of approximately 300 technology professionals with deep relationships across Fortune 500 companies, 80 percent of the Am Law 100, and most of the UK’s top 20 law firms. As a result, LevelBlue will significantly fortify its incident response and advisory capabilities, while expanding i

SBC Medical to Announce Q2 2025 Financial Results and Hold Conference Call on August 13, 20251.8.2025 14:00:00 CEST | Press release

SBC Medical Group Holdings Incorporated (Nasdaq: SBC) (“SBC Medical” or the “Company”), a global franchise and provider of services for aesthetic clinics, today announced that it will report its Q2 2025 financial results on Wednesday, August 13, 2025, before the U.S. market opens. The Company will hold a conference call on Wednesday, August 13, 2025 at 8:30 am Eastern Time (or Wednesday, August 13, 2025 at 9:30 pm Japan Time) to discuss the financial results and take questions live. Please register in advance of the conference using the link provided below. https://edge.media-server.com/mmc/p/ukc9sp9j It will automatically direct you to the registration page of “SBC Q2 2025 Financial Results Presentation.” Please follow the steps to enter your registration details, then click “Submit.” Upon registration, you will be able to access the dedicated Conference Call viewing site. In addition to viewing the conference call, this site provides access to information about the speakers as well a

In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.

Visit our pressroom
World GlobeA line styled icon from Orion Icon Library.HiddenA line styled icon from Orion Icon Library.Eye