Unveiling Location-Specific Price Drivers: A Two-Stage Cluster Analysis for Interpretable House Price Predictions
By: Paul Gümmer , Julian Rosenberger , Mathias Kraus and more
Potential Business Impact:
Finds house prices more accurately by grouping similar homes.
House price valuation remains challenging due to localized market variations. Existing approaches often rely on black-box machine learning models, which lack interpretability, or simplistic methods like linear regression (LR), which fail to capture market heterogeneity. To address this, we propose a machine learning approach that applies two-stage clustering, first grouping properties based on minimal location-based features before incorporating additional features. Each cluster is then modeled using either LR or a generalized additive model (GAM), balancing predictive performance with interpretability. Constructing and evaluating our models on 43,309 German house property listings from 2023, we achieve a 36% improvement for the GAM and 58% for LR in mean absolute error compared to models without clustering. Additionally, graphical analyses unveil pattern shifts between clusters. These findings emphasize the importance of cluster-specific insights, enhancing interpretability and offering practical value for buyers, sellers, and real estate analysts seeking more reliable property valuations.
Similar Papers
Modern approaches to building interpretable models of the property market using machine learning on the base of mass cadastral valuation
Statistical Finance
Helps predict house prices accurately.
Predicting House Rental Prices in Ghana Using Machine Learning
Machine Learning (CS)
Predicts house rent prices accurately in Ghana.
The impact of economic policies on housing prices. Approximations and predictions in the UK, the US, France, and Switzerland from the 1980s to today
Statistical Finance
Predicts house prices using money and government data.