Using LLMs to Directly Guess Conditional Expectations Can Improve Efficiency in Causal Estimation
By: Chris Engh, P. M. Aronow
Potential Business Impact:
AI guesses help find what causes things better.
We propose a simple yet effective use of LLM-powered AI tools to improve causal estimation. In double machine learning, the accuracy of causal estimates of the effect of a treatment on an outcome in the presence of a high-dimensional confounder depends on the performance of estimators of conditional expectation functions. We show that predictions made by generative models trained on historical data can be used to improve the performance of these estimators relative to approaches that solely rely on adjusting for embeddings extracted from these models. We argue that the historical knowledge and reasoning capacities associated with these generative models can help overcome curse-of-dimensionality problems in causal inference problems. We consider a case study using a small dataset of online jewelry auctions, and demonstrate that inclusion of LLM-generated guesses as predictors can improve efficiency in estimation.
Similar Papers
LLM-based Agents for Automated Confounder Discovery and Subgroup Analysis in Causal Inference
Machine Learning (CS)
Helps doctors find best treatments by finding hidden causes.
CARE: Turning LLMs Into Causal Reasoning Expert
Machine Learning (CS)
Teaches computers to understand cause and effect.
Realizing LLMs' Causal Potential Requires Science-Grounded, Novel Benchmarks
Machine Learning (CS)
Helps AI understand cause and effect better.