Cycling Race Time Prediction: A Personalized Machine Learning Approach Using Route Topology and Training Load
By: Francisco Aguilera Moreno
Potential Business Impact:
Predicts bike ride time using your fitness.
Predicting cycling duration for a given route is essential for training planning and event preparation. Existing solutions rely on physics-based models that require extensive parameterization, including aerodynamic drag coefficients and real-time wind forecasts, parameters impractical for most amateur cyclists. This work presents a machine learning approach that predicts ride duration using route topology features combined with the athlete's current fitness state derived from training load metrics. The model learns athlete-specific performance patterns from historical data, substituting complex physical measurements with historical performance proxies. We evaluate the approach using a single-athlete dataset (N=96 rides) in an N-of-1 study design. After rigorous feature engineering to eliminate data leakage, we find that Lasso regression with Topology + Fitness features achieves MAE=6.60 minutes and R2=0.922. Notably, integrating fitness metrics (CTL, ATL) reduces error by 14% compared to topology alone (MAE=7.66 min), demonstrating that physiological state meaningfully constrains performance even in self-paced efforts. Progressive checkpoint predictions enable dynamic race planning as route difficulty becomes apparent.
Similar Papers
Cyclists route choice modeling from trip duration data in urban areas
Applications
Finds bike routes without GPS tracking.
How does the Performance of the Data-driven Traffic Flow Forecasting Models deteriorate with Increasing Forecasting Horizon? An Extensive Approach Considering Statistical, Machine Learning and Deep Learning Models
Machine Learning (CS)
Predicts traffic jams before they happen.
A zone-based training approach for last-mile routing using Graph Neural Networks and Pointer Networks
Machine Learning (CS)
Makes delivery trucks take shorter, faster routes.