Improving Product Search Relevance with EAR-MP: A Solution for the CIKM 2025 AnalytiCup
By: JaeEun Lim , Soomin Kim , Jaeyong Seo and more
Potential Business Impact:
Helps online shoppers find products in any language.
Multilingual e-commerce search is challenging due to linguistic diversity and the noise inherent in user-generated queries. This paper documents the solution employed by our team (EAR-MP) for the CIKM 2025 AnalytiCup, which addresses two core tasks: Query-Category (QC) relevance and Query-Item (QI) relevance. Our approach first normalizes the multilingual dataset by translating all text into English, then mitigates noise through extensive data cleaning and normalization. For model training, we build on DeBERTa-v3-large and improve performance with label smoothing, self-distillation, and dropout. In addition, we introduce task-specific upgrades, including hierarchical token injection for QC and a hybrid scoring mechanism for QI. Under constrained compute, our method achieves competitive results, attaining an F1 score of 0.8796 on QC and 0.8744 on QI. These findings underscore the importance of systematic data preprocessing and tailored training strategies for building robust, resource-efficient multilingual relevance systems.
Similar Papers
Improving Product Search Relevance with EAR-MP: A Solution for the CIKM 2025 AnalytiCup
Information Retrieval
Helps online stores understand shoppers in any language.
A Data-Centric Approach to Multilingual E-Commerce Product Search: Case Study on Query-Category and Query-Item Relevance
Information Retrieval
Helps online stores understand shoppers in any language.
Analyticup E-commerce Product Search Competition Technical Report from Team Tredence_AICOE
Information Retrieval
Helps online shoppers find products in any language.