Lower Complexity Bounds for Nonconvex-Strongly-Convex Bilevel Optimization with First-Order Oracles
By: Kaiyi Ji
Potential Business Impact:
Makes solving tricky math problems much faster.
Although upper bound guarantees for bilevel optimization have been widely studied, progress on lower bounds has been limited due to the complexity of the bilevel structure. In this work, we focus on the smooth nonconvex-strongly-convex setting and develop new hard instances that yield nontrivial lower bounds under deterministic and stochastic first-order oracle models. In the deterministic case, we prove that any first-order zero-respecting algorithm requires at least $Ω(κ^{3/2}ε^{-2})$ oracle calls to find an $ε$-accurate stationary point, improving the optimal lower bounds known for single-level nonconvex optimization and for nonconvex-strongly-convex min-max problems. In the stochastic case, we show that at least $Ω(κ^{5/2}ε^{-4})$ stochastic oracle calls are necessary, again strengthening the best known bounds in related settings. Our results expose substantial gaps between current upper and lower bounds for bilevel optimization and suggest that even simplified regimes, such as those with quadratic lower-level objectives, warrant further investigation toward understanding the optimal complexity of bilevel optimization under standard first-order oracles.
Similar Papers
On the Condition Number Dependency in Bilevel Optimization
Optimization and Control
Finds better ways to solve tricky math problems.
Stochastic Bilevel Optimization with Heavy-Tailed Noise
Machine Learning (CS)
Teaches computers to learn better from messy data.
Faster Gradient Methods for Highly-smooth Stochastic Bilevel Optimization
Optimization and Control
Makes smart computer learning faster and better.