Score: 0

Cross-Modal Prototype Augmentation and Dual-Grained Prompt Learning for Social Media Popularity Prediction

Published: August 22, 2025 | arXiv ID: 2508.16147v1

By: Ao Zhou , Mingsheng Tu , Luping Wang and more

Potential Business Impact:

Predicts social media post popularity better.

Business Areas:
Social News Media and Entertainment

Social Media Popularity Prediction is a complex multimodal task that requires effective integration of images, text, and structured information. However, current approaches suffer from inadequate visual-textual alignment and fail to capture the inherent cross-content correlations and hierarchical patterns in social media data. To overcome these limitations, we establish a multi-class framework , introducing hierarchical prototypes for structural enhancement and contrastive learning for improved vision-text alignment. Furthermore, we propose a feature-enhanced framework integrating dual-grained prompt learning and cross-modal attention mechanisms, achieving precise multimodal representation through fine-grained category modeling. Experimental results demonstrate state-of-the-art performance on benchmark metrics, establishing new reference standards for multimodal social media analysis.

Country of Origin
🇨🇳 China

Page Count
8 pages

Category
Computer Science:
Information Retrieval