Score: 1

Alibaba International E-commerce Product Search Competition DILAB Team Technical Report

Published: October 21, 2025 | arXiv ID: 2510.18499v1

By: Hyewon Lee , Junghyun Oh , Minkyung Song and more

Potential Business Impact:

Helps online stores find products in any language.

Business Areas:
Semantic Search Internet Services

This study presents the multilingual e-commerce search system developed by the DILAB team, which achieved 5th place on the final leaderboard with a competitive overall score of 0.8819, demonstrating stable and high-performing results across evaluation metrics. To address challenges in multilingual query-item understanding, we designed a multi-stage pipeline integrating data refinement, lightweight preprocessing, and adaptive modeling. The data refinement stage enhanced dataset consistency and category coverage, while language tagging and noise filtering improved input quality. In the modeling phase, multiple architectures and fine-tuning strategies were explored, and hyperparameters optimized using curated validation sets to balance performance across query-category (QC) and query-item (QI) tasks. The proposed framework exhibited robustness and adaptability across languages and domains, highlighting the effectiveness of systematic data curation and iterative evaluation for multilingual search systems. The source code is available at https://github.com/2noweyh/DILAB-Alibaba-Ecommerce-Search.

Country of Origin
🇰🇷 Korea, Republic of

Repos / Data Links

Page Count
6 pages

Category
Computer Science:
Machine Learning (CS)