Score: 0

Exploring Textual Semantics Diversity for Image Transmission in Semantic Communication Systems using Visual Language Model

Published: March 25, 2025 | arXiv ID: 2503.19386v2

By: Peishan Huang, Dong Li

Potential Business Impact:

Sends pictures better by describing them with words.

Business Areas:

Semantic Web Internet Services

In recent years, the rapid development of machine learning has brought reforms and challenges to traditional communication systems. Semantic communication has appeared as an effective strategy to effectively extract relevant semantic signals semantic segmentation labels and image features for image transmission. However, the insufficient number of extracted semantic features of images will potentially result in a low reconstruction accuracy, which hinders the practical applications and still remains challenging for solving. In order to fill this gap, this letter proposes a multi-text transmission semantic communication (Multi-SC) system, which uses the visual language model (VLM) to assist in the transmission of image semantic signals. Unlike previous image transmission semantic communication systems, the proposed system divides the image into multiple blocks and extracts multiple text information from the image using a modified large language and visual assistant (LLaVA), and combines semantic segmentation tags with semantic text for image recovery. Simulation results show that the proposed text semantics diversity scheme can significantly improve the reconstruction accuracy compared with related works.

VLF-MSC: Vision-Language Feature-Based Multimodal Semantic Communication System

CV and Pattern Recognition

Sends pictures and words together, saving space.

13 Nov 2025 0

91%

Semantic-Clipping: Efficient Vision-Language Modeling with Semantic-Guidedd Visual Selection

CV and Pattern Recognition

Helps computers understand pictures better by focusing on important parts.

14 Mar 2025 0

90%

Task-Oriented Semantic Communication in Large Multimodal Models-based Vehicle Networks

Artificial Intelligence

Helps car AI understand traffic better with less data.

5 May 2025 1

View PDF Login to Bookmark

Country of Origin

🇲🇴 Macao

Page Count

6 pages

Exploring Textual Semantics Diversity for Image Transmission in Semantic Communication Systems using Visual Language Model

Sends pictures better by describing them with words.

Technical Abstract

VLF-MSC: Vision-Language Feature-Based Multimodal Semantic Communication System

Semantic-Clipping: Efficient Vision-Language Modeling with Semantic-Guidedd Visual Selection

Task-Oriented Semantic Communication in Large Multimodal Models-based Vehicle Networks