Score: 1

A Lightweight Multi-Scale Attention Framework for Real-Time Spinal Endoscopic Instance Segmentation

Published: December 26, 2025 | arXiv ID: 2512.21984v1

By: Qi Lai , JunYan Li , Qiang Cai and more

Potential Business Impact:

Helps surgeons see inside bodies better during operations.

Business Areas:

Image Recognition Data and Analytics, Software

Real-time instance segmentation for spinal endoscopy is important for identifying and protecting critical anatomy during surgery, but it is difficult because of the narrow field of view, specular highlights, smoke/bleeding, unclear boundaries, and large scale changes. Deployment is also constrained by limited surgical hardware, so the model must balance accuracy and speed and remain stable under small-batch (even batch-1) training. We propose LMSF-A, a lightweight multi-scale attention framework co-designed across backbone, neck, and head. The backbone uses a C2f-Pro module that combines RepViT-style re-parameterized convolution (RVB) with efficient multi-scale attention (EMA), enabling multi-branch training while collapsing into a single fast path for inference. The neck improves cross-scale consistency and boundary detail using Scale-Sequence Feature Fusion (SSFF) and Triple Feature Encoding (TFE), which strengthens high-resolution features. The head adopts a Lightweight Multi-task Shared Head (LMSH) with shared convolutions and GroupNorm to reduce parameters and support batch-1 stability. We also release the clinically reviewed PELD dataset (61 patients, 610 images) with instance masks for adipose tissue, bone, ligamentum flavum, and nerve. Experiments show that LMSF-A is highly competitive (or even better than) in all evaluation metrics and much lighter than most instance segmentation methods requiring only 1.8M parameters and 8.8 GFLOPs, and it generalizes well to a public teeth benchmark. Code and dataset: https://github.com/hhwmortal/PELD-Instance-segmentation.

Effective Attention-Guided Multi-Scale Medical Network for Skin Lesion Segmentation

CV and Pattern Recognition

Finds skin cancer spots more accurately.

8 Dec 2025 1

88%

Large Language Model Evaluated Stand-alone Attention-Assisted Graph Neural Network with Spatial and Structural Information Interaction for Precise Endoscopic Image Segmentation

CV and Pattern Recognition

Helps doctors find tiny cancer spots in the gut.

9 Aug 2025 1

88%

Cross-Layer Feature Self-Attention Module for Multi-Scale Object Detection

CV and Pattern Recognition

Finds objects of all sizes in pictures better.

16 Oct 2025 0

View PDF Login to Bookmark

Repos / Data Links

github.com

Page Count

10 pages

A Lightweight Multi-Scale Attention Framework for Real-Time Spinal Endoscopic Instance Segmentation

Helps surgeons see inside bodies better during operations.

Technical Abstract

Effective Attention-Guided Multi-Scale Medical Network for Skin Lesion Segmentation

Large Language Model Evaluated Stand-alone Attention-Assisted Graph Neural Network with Spatial and Structural Information Interaction for Precise Endoscopic Image Segmentation

Cross-Layer Feature Self-Attention Module for Multi-Scale Object Detection