Score: 0

Graph Embedding with Mel-spectrograms for Underwater Acoustic Target Recognition

Published: December 12, 2025 | arXiv ID: 2512.11545v1

By: Sheng Feng, Shuqing Ma, Xiaoqian Zhu

Underwater acoustic target recognition (UATR) is extremely challenging due to the complexity of ship-radiated noise and the variability of ocean environments. Although deep learning (DL) approaches have achieved promising results, most existing models implicitly assume that underwater acoustic data lie in a Euclidean space. This assumption, however, is unsuitable for the inherently complex topology of underwater acoustic signals, which exhibit non-stationary, non-Gaussian, and nonlinear characteristics. To overcome this limitation, this paper proposes the UATR-GTransformer, a non-Euclidean DL model that integrates Transformer architectures with graph neural networks (GNNs). The model comprises three key components: a Mel patchify block, a GTransformer block, and a classification head. The Mel patchify block partitions the Mel-spectrogram into overlapping patches, while the GTransformer block employs a Transformer Encoder to capture mutual information between split patches to generate Mel-graph embeddings. Subsequently, a GNN enhances these embeddings by modeling local neighborhood relationships, and a feed-forward network (FFN) further performs feature transformation. Experiments results based on two widely used benchmark datasets demonstrate that the UATR-GTransformer achieves performance competitive with state-of-the-art methods. In addition, interpretability analysis reveals that the proposed model effectively extracts rich frequency-domain information, highlighting its potential for applications in ocean engineering.

A Multi-task Learning Balanced Attention Convolutional Neural Network Model for Few-shot Underwater Acoustic Target Recognition

Sound

Helps identify underwater sounds with little data.

17 Apr 2025 1

88%

Underwater Image Reconstruction Using a Swin Transformer-Based Generator and PatchGAN Discriminator

CV and Pattern Recognition

Cleans up blurry underwater pictures for better views.

5 Dec 2025 0

87%

Multi-Representation Attention Framework for Underwater Bioacoustic Denoising and Recognition

Audio and Speech Processing

Helps scientists hear whale songs in noisy oceans.

29 Oct 2025 0

View PDF Login to Bookmark

Graph Embedding with Mel-spectrograms for Underwater Acoustic Target Recognition

Technical Abstract

A Multi-task Learning Balanced Attention Convolutional Neural Network Model for Few-shot Underwater Acoustic Target Recognition

Underwater Image Reconstruction Using a Swin Transformer-Based Generator and PatchGAN Discriminator

Multi-Representation Attention Framework for Underwater Bioacoustic Denoising and Recognition