Score: 0

Forests of Uncertaint(r)ees: Using tree-based ensembles to estimate probability distributions of future conflict

Published: December 5, 2025 | arXiv ID: 2512.06210v1

By: Daniel Mittermaier , Tobias Bohne , Martin Hofer and more

Potential Business Impact:

Predicts war danger more accurately.

Business Areas:
Predictive Analytics Artificial Intelligence, Data and Analytics, Software

Predictions of fatalities from violent conflict on the PRIO-GRID-month (pgm) level are characterized by high levels of uncertainty, limiting their usefulness in practical applications. We discuss the two main sources of uncertainty for this prediction task, the nature of violent conflict and data limitations, embedding this in the wider literature on uncertainty quantification in machine learning. We develop a strategy to quantify uncertainty in conflict forecasting, shifting from traditional point predictions to full predictive distributions. Our approach compares and combines multiple tree-based classifiers and distributional regressors in a custom auto-ML setup, estimating distributions for each pgm individually. We also test the integration of regional models in spatial ensembles as a potential avenue to reduce uncertainty. The models are able to consistently outperform a suite of benchmarks derived from conflict history in predictions up to one year in advance, with performance driven by regions where conflict was observed. With our evaluation, we emphasize the need to understand how a metric behaves for a given prediction problem, in our case characterized by extremely high zero-inflatedness. While not resulting in better predictions, the integration of smaller models does not decrease performance for this prediction task, opening avenues to integrate data sources with less spatial coverage in the future.

Page Count
18 pages

Category
Statistics:
Applications