Autonomous AI agent hacked a database in under an hour

An AI agent broke into a system, moved across four internal servers, stole credentials, and dumped an entire database. No human directed it in real time. No script gave it step-by-step instructions. It figured out each move on its own, in less than sixty minutes.

The Sysdig Threat Research Team documented the incident, which took place on May 10. The team identified it as the first known real-world intrusion where a large language model agent operated with fully autonomous, goal-oriented behavior. The attack started with CVE-2026-39987, a vulnerability on a publicly accessible Marimo notebook. Once inside, the agent mapped the environment, pulled credentials from AWS Secrets Manager, then hopped through four systems. It used WebSocket connections and Cloudflare Workers to hide its activity. The final step was exfiltrating a full PostgreSQL database dump.

Six days earlier, a separate prompt-injection attack hit an integrated Grok wallet. The attacker tricked the LLM into transferring about $175,000 in DRB tokens to an external address. The token's price fell roughly 40% after the exploit. The direct vector was the AI integration layer itself: no vulnerability in the token contract, just a way to make the agent do what it was not supposed to do.

The Sysdig researchers said the May 10 intrusion marks a shift from AI-assisted hacking to AI-led hacking. Earlier cases involved LLMs generating malicious code for humans to use, or assisting human operators. This agent made real-time decisions without a predefined playbook.

Security analysts tracking LLM agents note a rise in automated post-exploitation and credential harvesting since early 2026. The DRB wallet incident provides a concrete example of how quickly market sentiment can collapse when an AI-integrated system gets compromised. A 40% price decline from a single autonomous exploit is the kind of volatility that draws attention from financial regulators.

For anyone evaluating projects that rely on large language models, the immediate questions are about prompt injection defenses, sandboxing of agent actions, and isolation of credentials. The Sysdig-find attack showed that once an agent has access to a secrets manager, the rest of the chain can unfold in under an hour.

Autonomous AI agent hacked a database in under an hour

Explore More

More from AlphaScala

Trading Q&A

Related Tools & Research