Uncertainty-aware Semi-supervised Ensemble Teacher Framework for Multilingual Depression Detection
By: Mohammad Zia Ur Rehman , Velpuru Navya , Sanskar and more
Detecting depression from social media text is still a challenging task. This is due to different language styles, informal expression, and the lack of annotated data in many languages. To tackle these issues, we propose, Semi-SMDNet, a strong Semi-Supervised Multilingual Depression detection Network. It combines teacher-student pseudo-labelling, ensemble learning, and augmentation of data. Our framework uses a group of teacher models. Their predictions come together through soft voting. An uncertainty-based threshold filters out low-confidence pseudo-labels to reduce noise and improve learning stability. We also use a confidence-weighted training method that focuses on reliable pseudo-labelled samples. This greatly boosts robustness across languages. Tests on Arabic, Bangla, English, and Spanish datasets show that our approach consistently beats strong baselines. It significantly reduces the performance gap between settings that have plenty of resources and those that do not. Detailed experiments and studies confirm that our framework is effective and can be used in various situations. This shows that it is suitable for scalable, cross-language mental health monitoring where labelled resources are limited.
Similar Papers
A Gold Standard Dataset and Evaluation Framework for Depression Detection and Explanation in Social Media using LLMs
Computation and Language
Finds sadness in online posts to help people.
Interpretable Depression Detection from Social Media Text Using LLM-Derived Embeddings
Computation and Language
Finds sad posts to help people feel better.
Generating Medically-Informed Explanations for Depression Detection using LLMs
Computation and Language
Finds depression early from online posts.