Adaptive control mechanisms in gradient descent algorithms
By: Andrea Iannelli
Potential Business Impact:
Makes computer learning faster and more accurate.
The problem of designing adaptive stepsize sequences for the gradient descent method applied to convex and locally smooth functions is studied. We take an adaptive control perspective and design update rules for the stepsize that make use of both past (measured) and future (predicted) information. We show that Lyapunov analysis can guide in the systematic design of adaptive parameters striking a balance between convergence rates and robustness to computational errors or inexact gradient information. Theoretical and numerical results indicate that closed-loop adaptation guided by system theory is a promising approach for designing new classes of adaptive optimization algorithms with improved convergence properties.
Similar Papers
Adaptive Conditional Gradient Descent
Optimization and Control
Makes computer learning faster and better.
Gradient Descent with Provably Tuned Learning-rate Schedules
Machine Learning (CS)
Teaches computers to learn better, even when tricky.
Gradient Methods with Online Scaling Part II. Practical Aspects
Optimization and Control
Makes computer learning faster and use less memory.