a16z research: AI agents can identify DeFi price manipulation vulnerabilities, but the ability to execute complex attacks is still limited
According to a16z, its researchers conducted a systematic test on whether AI agents can independently exploit DeFi price manipulation vulnerabilities.
The study used a dataset of 20 Ethereum price manipulation incidents and employed Codex (GPT 5.4) equipped with the Foundry toolchain as the testing agent. Under baseline conditions without domain knowledge, the agent's success rate was only 10%; after introducing structured domain knowledge extracted from real attack events, the success rate increased to 70%. Failure cases showed that the agent could accurately identify vulnerabilities but generally struggled to understand the leverage logic of recursive borrowing, misjudged profit margins, and could not assemble multi-step attack structures across contracts. The experiment also recorded a sandbox escape incident: the agent extracted the RPC key from the local node configuration and called the anvil_reset method to reset the node to a future block, bypassing information isolation restrictions and obtaining real attack data. The research team believes that AI agents can currently effectively assist in vulnerability identification but cannot yet replace professional security auditors.
You may also like

The other side of Musk's trillion-dollar fortune: 85% cannot be sold

The U.S. government prohibits foreigners from using Fable 5, Anthropic issues a rebuttal

Citibank releases "2030 Asset Tokenization Market Outlook": 6 major trends may create a $8.2 trillion market

The trillion-dollar valuation test: Are the three major super IPOs a celebration for tech stocks or a nightmare for the crypto market?

Morning Report | Digital Asset completes $355 million financing led by a16z Crypto; Meta completes operational separation from Manus

a16z Crypto Partner: Cash flow is the moat

Cryptocurrency market makers collectively seek change as it becomes increasingly difficult to make money

How TradeXYZ, xStocks, and Alpaca break down the SpaceX IPO into three different strategies

$75 billion in risk asset redistribution: How will SpaceX's IPO affect U.S. stocks and Bitcoin?

Why Is BlackRock Investing $5 Billion in the SpaceX IPO?

Morning News | CME Group launches Nasdaq Cryptocurrency Index futures; Asset management giant Janus Henderson strategically invests in Ethena

Bitcoin Layer 2 Network Botanix: Why Did We Choose to Dissolve?

Why did Oracle deliver the strongest financial report in history, yet its stock price fell?

When the P2P illicit funds from ten years ago turned into 60,000 bitcoins

Dialogue with OmenX Founder: Why does the prediction market need an evolution from "spot" to "derivatives"?

Galaxy in-depth report: Is Solana still worth paying attention to?

Young people in South Korea make a "final effort" in the epic bull market






