CT-ScanGaze: A Dataset and Baselines for 3D Volumetric Scanpath Modeling
By: Trong-Thang Pham , Akash Awasthi , Saba Khan and more
Potential Business Impact:
Helps computers see like doctors read scans.
Understanding radiologists' eye movement during Computed Tomography (CT) reading is crucial for developing effective interpretable computer-aided diagnosis systems. However, CT research in this area has been limited by the lack of publicly available eye-tracking datasets and the three-dimensional complexity of CT volumes. To address these challenges, we present the first publicly available eye gaze dataset on CT, called CT-ScanGaze. Then, we introduce CT-Searcher, a novel 3D scanpath predictor designed specifically to process CT volumes and generate radiologist-like 3D fixation sequences, overcoming the limitations of current scanpath predictors that only handle 2D inputs. Since deep learning models benefit from a pretraining step, we develop a pipeline that converts existing 2D gaze datasets into 3D gaze data to pretrain CT-Searcher. Through both qualitative and quantitative evaluations on CT-ScanGaze, we demonstrate the effectiveness of our approach and provide a comprehensive assessment framework for 3D scanpath prediction in medical imaging.
Similar Papers
Eyes on the Image: Gaze Supervised Multimodal Learning for Chest X-ray Diagnosis and Report Generation
CV and Pattern Recognition
Helps doctors find sickness in X-rays better.
Imitating Radiological Scrolling: A Global-Local Attention Model for 3D Chest CT Volumes Multi-Label Anomaly Classification
CV and Pattern Recognition
Helps doctors find sickness in CT scans faster.
GazeTrack: High-Precision Eye Tracking Based on Regularization and Spatial Computing
CV and Pattern Recognition
Makes virtual reality eyes track more accurately.