Position: Thematic Analysis of Unstructured Clinical Transcripts with Large Language Models
By: Seungjun Yi , Joakim Nguyen , Terence Lim and more
Potential Business Impact:
Helps doctors understand patient stories faster.
This position paper examines how large language models (LLMs) can support thematic analysis of unstructured clinical transcripts, a widely used but resource-intensive method for uncovering patterns in patient and provider narratives. We conducted a systematic review of recent studies applying LLMs to thematic analysis, complemented by an interview with a practicing clinician. Our findings reveal that current approaches remain fragmented across multiple dimensions including types of thematic analysis, datasets, prompting strategies and models used, most notably in evaluation. Existing evaluation methods vary widely (from qualitative expert review to automatic similarity metrics), hindering progress and preventing meaningful benchmarking across studies. We argue that establishing standardized evaluation practices is critical for advancing the field. To this end, we propose an evaluation framework centered on three dimensions: validity, reliability, and interpretability.
Similar Papers
LLM-Assisted Thematic Analysis: Opportunities, Limitations, and Recommendations
Software Engineering
Helps researchers analyze text faster, but needs human checks.
Automated Thematic Analyses Using LLMs: Xylazine Wound Management Social Media Chatter Use Case
Artificial Intelligence
Computers find patterns in online talks.
Large Language Models in Thematic Analysis: Prompt Engineering, Evaluation, and Guidelines for Qualitative Software Engineering Research
Software Engineering
Helps computers find patterns in people's words.