Score: 1

Improving Product Search Relevance with EAR-MP: A Solution for the CIKM 2025 AnalytiCup

Published: October 27, 2025 | arXiv ID: 2510.23018v2

By: JaeEun Lim , Soomin Kim , Jaeyong Seo and more

Potential Business Impact:

Helps online stores understand shoppers in any language.

Business Areas:
Semantic Search Internet Services

Multilingual e-commerce search is challenging due to linguistic diversity and the noise inherent in user-generated queries. This paper documents the solution employed by our team (EAR-MP) for the CIKM 2025 AnalytiCup, which addresses two core tasks: Query-Category (QC) relevance and Query-Item (QI) relevance. Our approach first normalizes the multilingual dataset by translating all text into English, then mitigates noise through extensive data cleaning and normalization. For model training, we build on DeBERTa-v3-large and improve performance with label smoothing, self-distillation, and dropout. In addition, we introduce task-specific upgrades, including hierarchical token injection for QC and a hybrid scoring mechanism for QI. Under constrained compute, our method achieves competitive results, attaining an F1 score of 0.8796 on QC and 0.8744 on QI. These findings underscore the importance of systematic data preprocessing and tailored training strategies for building robust, resource-efficient multilingual relevance systems.

Country of Origin
🇰🇷 🇯🇵 Korea, Republic of, Japan

Page Count
6 pages

Category
Computer Science:
Information Retrieval