Quantifying Symptom Causality in Clinical Decision Making: An Exploration Using CausaLM
By: Mehul Shetty, Connor Jordan
Potential Business Impact:
Helps doctors trust computer diagnoses more.
Current machine learning approaches to medical diagnosis often rely on correlational patterns between symptoms and diseases, risking misdiagnoses when symptoms are ambiguous or common across multiple conditions. In this work, we move beyond correlation to investigate the causal influence of key symptoms-specifically "chest pain" on diagnostic predictions. Leveraging the CausaLM framework, we generate counterfactual text representations in which target concepts are effectively "forgotten" enabling a principled estimation of the causal effect of that concept on a model's predicted disease distribution. By employing Textual Representation-based Average Treatment Effect (TReATE), we quantify how the presence or absence of a symptom shapes the model's diagnostic outcomes, and contrast these findings against correlation-based baselines such as CONEXP. Our results offer deeper insight into the decision-making behavior of clinical NLP models and have the potential to inform more trustworthy, interpretable, and causally-grounded decision support tools in medical practice.
Similar Papers
Causal Inference on Outcomes Learned from Text
Econometrics
Helps understand what words cause changes.
Technical Report: Facilitating the Adoption of Causal Inference Methods Through LLM-Empowered Co-Pilot
Machine Learning (CS)
Helps doctors find best treatments from patient data.
Text Mining Analysis of Symptom Patterns in Medical Chatbot Conversations
Machine Learning (CS)
Helps chatbots understand patient symptoms better.