Alibaba International E-commerce Product Search Competition DILAB Team Technical Report
By: Hyewon Lee , Junghyun Oh , Minkyung Song and more
Potential Business Impact:
Helps online stores find products in any language.
This study presents the multilingual e-commerce search system developed by the DILAB team, which achieved 5th place on the final leaderboard with a competitive overall score of 0.8819, demonstrating stable and high-performing results across evaluation metrics. To address challenges in multilingual query-item understanding, we designed a multi-stage pipeline integrating data refinement, lightweight preprocessing, and adaptive modeling. The data refinement stage enhanced dataset consistency and category coverage, while language tagging and noise filtering improved input quality. In the modeling phase, multiple architectures and fine-tuning strategies were explored, and hyperparameters optimized using curated validation sets to balance performance across query-category (QC) and query-item (QI) tasks. The proposed framework exhibited robustness and adaptability across languages and domains, highlighting the effectiveness of systematic data curation and iterative evaluation for multilingual search systems. The source code is available at https://github.com/2noweyh/DILAB-Alibaba-Ecommerce-Search.
Similar Papers
Alibaba International E-commerce Product Search Competition DcuRAGONs Team Technical Report
Information Retrieval
Helps online shoppers find products faster.
Analyticup E-commerce Product Search Competition Technical Report from Team Tredence_AICOE
Information Retrieval
Helps online shoppers find products in any language.
A Data-Centric Approach to Multilingual E-Commerce Product Search: Case Study on Query-Category and Query-Item Relevance
Information Retrieval
Helps online stores understand shoppers in any language.