About this episode
AI is getting dangerously good at smart contract security. Faster than crypto is ready for. Alpin Yukseloglu joins Bankless to break down EVMBench (built with OpenAI), a benchmark testing whether AI agents can detect, patch, and exploit real fund-draining bugs and why the jump from ~12–13% exploit-finding to 70%+ could rewrite today’s security assumptions. We unpack what that “70%” really means, why crypto’s verifiability is an ideal training ground, why AI labs haven’t prioritized crypto data yet, and what a 24/7 blackhat vs whitehat AI arms race means for DeFi.---?SPOTIFY PREMIUM RSS FEED | USE CODE: SPOTIFY24https://bankless.cc/spotify-premium---BANKLESS SPONSOR TOOLS:?POLYMARKET | #1 PREDICTION MARKEThttps://bankless.cc/polymarket-podcast?GALAXY | INSTITUTIONAL DIGITAL FINANCEhttps://bankless.cc/galaxy-podcast? EUPHORIA | REAL-TIME ONE-TAP TRADINGhttps://bankless.cc/euphoria?BRIX | EMERGING MARKET YIELDhttps://bankless.cc/brix?BITGET TRADFI | TRADE GOLD WITH USDThttps://bankless.cc/bitget?THE DEFI REPORT | ONCHAIN INSIGHTShttps://thedefireport.io/bankless---TIMESTAMPS0:00 AI’s exploit leap: 12% ? 70% and the “Superhuman auditors”7:02: Staring at the singularity without losing your mind10:31 Agency » doom: the Thiel framing19:10 What’s most at risk (and what’s safer)23:37 What EVMBench actually is (benchmark + harness)27:03 Why exploiting is the key: killing false positives29:24 AI gets “good at crypto” fast: verifiability30:56 What “70% exploit rate” really means33:32 Why AI labs avoided crypto (it’s not technical)43:38 Blackhat vs whitehat: how the race plays out47:21 Agents and “payments at the speed of light”51:02 EVM vs Solana: network effects56:18 AI formal verification as an endgame58:06 EVMBench V2: expanding the frontier59:54 Why Alpin stays in crypto---RESOURCESAlpin Yukselogluhttps://x.com/0xalpoEVMBenchhttps://paradigm.xyz/evmb