Efficient Image-to-Image Schrödinger Bridge for CT Field of View Extension
By: Zhenhao Li , Long Yang , Xiaojie Yin and more
Potential Business Impact:
Fixes blurry medical scans for clearer pictures.
Computed tomography (CT) is a cornerstone imaging modality for non-invasive, high-resolution visualization of internal anatomical structures. However, when the scanned object exceeds the scanner's field of view (FOV), projection data are truncated, resulting in incomplete reconstructions and pronounced artifacts near FOV boundaries. Conventional reconstruction algorithms struggle to recover accurate anatomy from such data, limiting clinical reliability. Deep learning approaches have been explored for FOV extension, with diffusion generative models representing the latest advances in image synthesis. Yet, conventional diffusion models are computationally demanding and slow at inference due to their iterative sampling process. To address these limitations, we propose an efficient CT FOV extension framework based on the image-to-image Schr\"odinger Bridge (I$^2$SB) diffusion model. Unlike traditional diffusion models that synthesize images from pure Gaussian noise, I$^2$SB learns a direct stochastic mapping between paired limited-FOV and extended-FOV images. This direct correspondence yields a more interpretable and traceable generative process, enhancing anatomical consistency and structural fidelity in reconstructions. I$^2$SB achieves superior quantitative performance, with root-mean-square error (RMSE) values of 49.8\,HU on simulated noisy data and 152.0HU on real data, outperforming state-of-the-art diffusion models such as conditional denoising diffusion probabilistic models (cDDPM) and patch-based diffusion methods. Moreover, its one-step inference enables reconstruction in just 0.19s per 2D slice, representing over a 700-fold speedup compared to cDDPM (135s) and surpassing diffusionGAN (0.58s), the second fastest. This combination of accuracy and efficiency makes I$^2$SB highly suitable for real-time or clinical deployment.
Similar Papers
Taming Stable Diffusion for Computed Tomography Blind Super-Resolution
Image and Video Processing
Makes X-rays clearer with less radiation.
An Iterative Reconstruction Method for Dental Cone-Beam Computed Tomography with a Truncated Field of View
CV and Pattern Recognition
Improves dental X-rays by fixing missing parts.
Resolution-Independent Neural Operators for Multi-Rate Sparse-View CT
Image and Video Processing
Makes CT scans faster and clearer for everyone.