The Bidding Games: Reinforcement Learning for MEV Extraction on Polygon Blockchain
By: Andrei Seoev , Leonid Gremyachikh , Anastasiia Smirnova and more
Potential Business Impact:
Helps make more money from online transactions.
In blockchain networks, the strategic ordering of transactions within blocks has emerged as a significant source of profit extraction, known as Maximal Extractable Value (MEV). The transition from spam-based Priority Gas Auctions to structured auction mechanisms like Polygon Atlas has transformed MEV extraction from public bidding wars into sealed-bid competitions under extreme time constraints. While this shift reduces network congestion, it introduces complex strategic challenges where searchers must make optimal bidding decisions within a sub-second window without knowledge of competitor behavior or presence. Traditional game-theoretic approaches struggle in this high-frequency, partially observable environment due to their reliance on complete information and static equilibrium assumptions. We present a reinforcement learning framework for MEV extraction on Polygon Atlas and make three contributions: (1) A novel simulation environment that accurately models the stochastic arrival of arbitrage opportunities and probabilistic competition in Atlas auctions; (2) A PPO-based bidding agent optimized for real-time constraints, capable of adaptive strategy formulation in continuous action spaces while maintaining production-ready inference speeds; (3) Empirical validation demonstrating our history-conditioned agent captures 49\% of available profits when deployed alongside existing searchers and 81\% when replacing the market leader, significantly outperforming static bidding strategies. Our work establishes that reinforcement learning provides a critical advantage in high-frequency MEV environments where traditional optimization methods fail, offering immediate value for industrial participants and protocol designers alike.
Similar Papers
Unpacking Maximum Extractable Value on Polygon: A Study on Atomic Arbitrage
Distributed, Parallel, and Cluster Computing
Finds ways to make money from digital money trades.
MEV in Multiple Concurrent Proposer Blockchains
CS and Game Theory
Finds ways to make online money faster and safer.
Certifying optimal MEV strategies with Lean
Cryptography and Security
Proves no one can steal money from online finance.