IndiaWeatherBench: A Dataset and Benchmark for Data-Driven Regional Weather Forecasting over India
By: Tung Nguyen , Harkanwar Singh , Nilay Naharas and more
Potential Business Impact:
Improves weather forecasts for specific regions.
Regional weather forecasting is a critical problem for localized climate adaptation, disaster mitigation, and sustainable development. While machine learning has shown impressive progress in global weather forecasting, regional forecasting remains comparatively underexplored. Existing efforts often use different datasets and experimental setups, limiting fair comparison and reproducibility. We introduce IndiaWeatherBench, a comprehensive benchmark for data-driven regional weather forecasting focused on the Indian subcontinent. IndiaWeatherBench provides a curated dataset built from high-resolution regional reanalysis products, along with a suite of deterministic and probabilistic metrics to facilitate consistent training and evaluation. To establish strong baselines, we implement and evaluate a range of models across diverse architectures, including UNets, Transformers, and Graph-based networks, as well as different boundary conditioning strategies and training objectives. While focused on India, IndiaWeatherBench is easily extensible to other geographic regions. We open-source all raw and preprocessed datasets, model implementations, and evaluation pipelines to promote accessibility and future development. We hope IndiaWeatherBench will serve as a foundation for advancing regional weather forecasting research. Code is available at https://github.com/tung-nd/IndiaWeatherBench.
Similar Papers
OceanForecastBench: A Benchmark Dataset for Data-Driven Global Ocean Forecasting
Machine Learning (CS)
Helps predict ocean changes better and faster.
WeatherBench: A Real-World Benchmark Dataset for All-in-One Adverse Weather Image Restoration
CV and Pattern Recognition
Cleans up blurry, bad-weather photos from real life.
ClimateBench-M: A Multi-Modal Climate Data Benchmark with a Simple Generative Method
Machine Learning (CS)
Helps predict weather and floods using different data.