Score: 2

PULSE: A Unified Multi-Task Architecture for Cardiac Segmentation, Diagnosis, and Few-Shot Cross-Modality Clinical Adaptation

Published: December 3, 2025 | arXiv ID: 2512.03848v1

By: Hania Ghouse , Maryam Alsharqi , Farhad R. Nezami and more

BigTech Affiliations: Massachusetts Institute of Technology

Potential Business Impact:

Analyzes heart images for health and reports.

Business Areas:
Image Recognition Data and Analytics, Software

Cardiac image analysis remains fragmented across tasks: anatomical segmentation, disease classification, and grounded clinical report generation are typically handled by separate networks trained under different data regimes. No existing framework unifies these objectives within a single architecture while retaining generalization across imaging modalities and datasets. We introduce PULSE, a multi-task vision-language framework built on self-supervised representations and optimized through a composite supervision strategy that balances region overlap learning, pixel wise classification fidelity, and boundary aware IoU refinement. A multi-scale token reconstruction decoder enables anatomical segmentation, while shared global representations support disease classification and clinically grounded text output allowing the model to transition from pixels to structures and finally clinical reasoning within one architecture. Unlike prior task-specific pipelines, PULSE learns task-invariant cardiac priors, generalizes robustly across datasets, and can be adapted to new imaging modalities with minimal supervision. This moves the field closer to a scalable, foundation style cardiac analysis framework.

Country of Origin
πŸ‡ΈπŸ‡¦ πŸ‡ΊπŸ‡Έ United States, Saudi Arabia

Page Count
36 pages

Category
Computer Science:
CV and Pattern Recognition