Integrating Multi-scale and Multi-filtration Topological Features for Medical Image Classification
By: Pengfei Gu , Huimin Li , Haoteng Tang and more
Potential Business Impact:
Finds hidden disease signs in medical pictures.
Modern deep neural networks have shown remarkable performance in medical image classification. However, such networks either emphasize pixel-intensity features instead of fundamental anatomical structures (e.g., those encoded by topological invariants), or they capture only simple topological features via single-parameter persistence. In this paper, we propose a new topology-guided classification framework that extracts multi-scale and multi-filtration persistent topological features and integrates them into vision classification backbones. For an input image, we first compute cubical persistence diagrams (PDs) across multiple image resolutions/scales. We then develop a ``vineyard'' algorithm that consolidates these PDs into a single, stable diagram capturing signatures at varying granularities, from global anatomy to subtle local irregularities that may indicate early-stage disease. To further exploit richer topological representations produced by multiple filtrations, we design a cross-attention-based neural network that directly processes the consolidated final PDs. The resulting topological embeddings are fused with feature maps from CNNs or Transformers. By integrating multi-scale and multi-filtration topologies into an end-to-end architecture, our approach enhances the model's capacity to recognize complex anatomical structures. Evaluations on three public datasets show consistent, considerable improvements over strong baselines and state-of-the-art methods, demonstrating the value of our comprehensive topological perspective for robust and interpretable medical image classification.
Similar Papers
TopoImages: Incorporating Local Topology Encoding into Deep Learning Models for Medical Image Classification
CV and Pattern Recognition
Helps computers spot hidden patterns in pictures better.
Transferable Class Statistics and Multi-scale Feature Approximation for 3D Object Detection
CV and Pattern Recognition
Helps robots see objects with less computer power.
Multi-Modal Feature Fusion for Spatial Morphology Analysis of Traditional Villages via Hierarchical Graph Neural Networks
CV and Pattern Recognition
Helps computers understand how villages change over time.