Score: 0

Post-Processing Mask-Based Table Segmentation for Structural Coordinate Extraction

Published: December 24, 2025 | arXiv ID: 2512.21287v1

By: Suren Bandara

Potential Business Impact:

Finds table lines in messy, blurry pictures.

Business Areas:

Image Recognition Data and Analytics, Software

Structured data extraction from tables plays a crucial role in document image analysis for scanned documents and digital archives. Although many methods have been proposed to detect table structures and extract cell contents, accurately identifying table segment boundaries (rows and columns) remains challenging, particularly in low-resolution or noisy images. In many real-world scenarios, table data are incomplete or degraded, limiting the adaptability of transformer-based methods to noisy inputs. Mask-based edge detection techniques have shown greater robustness under such conditions, as their sensitivity can be adjusted through threshold tuning; however, existing approaches typically apply masks directly to images, leading to noise sensitivity, resolution loss, or high computational cost. This paper proposes a novel multi-scale signal-processing method for detecting table edges from table masks. Row and column transitions are modeled as one-dimensional signals and processed using Gaussian convolution with progressively increasing variances, followed by statistical thresholding to suppress noise while preserving stable structural edges. Detected signal peaks are mapped back to image coordinates to obtain accurate segment boundaries. Experimental results show that applying the proposed approach to column edge detection improves Cell-Aware Segmentation Accuracy (CASA) a layout-aware metric evaluating both textual correctness and correct cell placement from 67% to 76% on the PubLayNet-1M benchmark when using TableNet with PyTesseract OCR. The method is robust to resolution variations through zero-padding and scaling strategies and produces optimized structured tabular outputs suitable for downstream analysis.

RefineSeg: Dual Coarse-to-Fine Learning for Medical Image Segmentation

CV and Pattern Recognition

Teaches computers to draw body parts from messy sketches.

4 Aug 2025 1

86%

PubTables-v2: A new large-scale dataset for full-page and multi-page table extraction

CV and Pattern Recognition

Helps computers find and understand tables in documents.

11 Dec 2025 0

85%

Decoding with Structured Awareness: Integrating Directional, Frequency-Spatial, and Structural Attention for Medical Image Segmentation

CV and Pattern Recognition

Finds tiny details in medical pictures better.

5 Dec 2025 0

View PDF Login to Bookmark

Page Count

6 pages

Post-Processing Mask-Based Table Segmentation for Structural Coordinate Extraction

Finds table lines in messy, blurry pictures.

Technical Abstract

RefineSeg: Dual Coarse-to-Fine Learning for Medical Image Segmentation

PubTables-v2: A new large-scale dataset for full-page and multi-page table extraction

Decoding with Structured Awareness: Integrating Directional, Frequency-Spatial, and Structural Attention for Medical Image Segmentation