Score: 0

CoAtNeXt:An Attention-Enhanced ConvNeXtV2-Transformer Hybrid Model for Gastric Tissue Classification

Published: September 11, 2025 | arXiv ID: 2509.09242v1

By: Mustafa Yurdakul, Sakir Tasdemir

Potential Business Impact:

Helps doctors find stomach sickness faster.

Business Areas:

Image Recognition Data and Analytics, Software

Background and objective Early diagnosis of gastric diseases is crucial to prevent fatal outcomes. Although histopathologic examination remains the diagnostic gold standard, it is performed entirely manually, making evaluations labor-intensive and prone to variability among pathologists. Critical findings may be missed, and lack of standard procedures reduces consistency. These limitations highlight the need for automated, reliable, and efficient methods for gastric tissue analysis. Methods In this study, a novel hybrid model named CoAtNeXt was proposed for the classification of gastric tissue images. The model is built upon the CoAtNet architecture by replacing its MBConv layers with enhanced ConvNeXtV2 blocks. Additionally, the Convolutional Block Attention Module (CBAM) is integrated to improve local feature extraction through channel and spatial attention mechanisms. The architecture was scaled to achieve a balance between computational efficiency and classification performance. CoAtNeXt was evaluated on two publicly available datasets, HMU-GC-HE-30K for eight-class classification and GasHisSDB for binary classification, and was compared against 10 Convolutional Neural Networks (CNNs) and ten Vision Transformer (ViT) models. Results CoAtNeXt achieved 96.47% accuracy, 96.60% precision, 96.47% recall, 96.45% F1 score, and 99.89% AUC on HMU-GC-HE-30K. On GasHisSDB, it reached 98.29% accuracy, 98.07% precision, 98.41% recall, 98.23% F1 score, and 99.90% AUC. It outperformed all CNN and ViT models tested and surpassed previous studies in the literature. Conclusion Experimental results show that CoAtNeXt is a robust architecture for histopathological classification of gastric tissue images, providing performance on binary and multiclass. Its highlights its potential to assist pathologists by enhancing diagnostic accuracy and reducing workload.

EVCC: Enhanced Vision Transformer-ConvNeXt-CoAtNet Fusion for Classification

CV and Pattern Recognition

Makes AI see better, using less computer power.

24 Nov 2025 1

88%

An Efficient Medical Image Classification Method Based on a Lightweight Improved ConvNeXt-Tiny Architecture

CV and Pattern Recognition

Helps doctors find diseases faster with less computer power.

15 Aug 2025 0

87%

A Novel Multi-branch ConvNeXt Architecture for Identifying Subtle Pathological Features in CT Scans

CV and Pattern Recognition

Helps doctors find sickness in X-rays faster.

10 Oct 2025 1

View PDF Login to Bookmark

Country of Origin

🇹🇷 Turkey

Page Count

19 pages

CoAtNeXt:An Attention-Enhanced ConvNeXtV2-Transformer Hybrid Model for Gastric Tissue Classification

Helps doctors find stomach sickness faster.

Technical Abstract

EVCC: Enhanced Vision Transformer-ConvNeXt-CoAtNet Fusion for Classification

An Efficient Medical Image Classification Method Based on a Lightweight Improved ConvNeXt-Tiny Architecture

A Novel Multi-branch ConvNeXt Architecture for Identifying Subtle Pathological Features in CT Scans