Score: 3

Anatomy-Guided Representation Learning Using a Transformer-Based Network for Thyroid Nodule Segmentation in Ultrasound Images

Published: December 14, 2025 | arXiv ID: 2512.12662v1

By: Muhammad Umar Farooq , Abd Ur Rehman , Azka Rehman and more

BigTech Affiliations: Stanford University

Potential Business Impact:

Helps doctors find thyroid problems better.

Business Areas:
Semantic Search Internet Services

Accurate thyroid nodule segmentation in ultrasound images is critical for diagnosis and treatment planning. However, ambiguous boundaries between nodules and surrounding tissues, size variations, and the scarcity of annotated ultrasound data pose significant challenges for automated segmentation. Existing deep learning models struggle to incorporate contextual information from the thyroid gland and generalize effectively across diverse cases. To address these challenges, we propose SSMT-Net, a Semi-Supervised Multi-Task Transformer-based Network that leverages unlabeled data to enhance Transformer-centric encoder feature extraction capability in an initial unsupervised phase. In the supervised phase, the model jointly optimizes nodule segmentation, gland segmentation, and nodule size estimation, integrating both local and global contextual features. Extensive evaluations on the TN3K and DDTI datasets demonstrate that SSMT-Net outperforms state-of-the-art methods, with higher accuracy and robustness, indicating its potential for real-world clinical applications.

Country of Origin
πŸ‡°πŸ‡· πŸ‡ΆπŸ‡¦ πŸ‡ΊπŸ‡Έ United States, Korea, Republic of, Qatar

Page Count
13 pages

Category
Computer Science:
CV and Pattern Recognition