X-ray Insights Unleashed: Pioneering the Enhancement of Multi-Label Long-Tail Data
By: Xinquan Yang , Jinheng Xie , Yawen Huang and more
Long-tailed pulmonary anomalies in chest radiography present formidable diagnostic challenges. Despite the recent strides in diffusion-based methods for enhancing the representation of tailed lesions, the paucity of rare lesion exemplars curtails the generative capabilities of these approaches, thereby leaving the diagnostic precision less than optimal. In this paper, we propose a novel data synthesis pipeline designed to augment tail lesions utilizing a copious supply of conventional normal X-rays. Specifically, a sufficient quantity of normal samples is amassed to train a diffusion model capable of generating normal X-ray images. This pre-trained diffusion model is subsequently utilized to inpaint the head lesions present in the diseased X-rays, thereby preserving the tail classes as augmented training data. Additionally, we propose the integration of a Large Language Model Knowledge Guidance (LKG) module alongside a Progressive Incremental Learning (PIL) strategy to stabilize the inpainting fine-tuning process. Comprehensive evaluations conducted on the public lung datasets MIMIC and CheXpert demonstrate that the proposed method sets a new benchmark in performance.
Similar Papers
CXR-CML: Improved zero-shot classification of long-tailed multi-label diseases in Chest X-Rays
CV and Pattern Recognition
Helps doctors find rare diseases in X-rays.
Subtyping Breast Lesions via Generative Augmentation based Long-tailed Recognition in Ultrasound
CV and Pattern Recognition
Helps doctors find breast cancer types better.
Anatomy-Grounded Weakly Supervised Prompt Tuning for Chest X-ray Latent Diffusion Models
CV and Pattern Recognition
Helps doctors understand X-rays by reading reports.