Score: 0

Evaluating Embedding Models and Pipeline Optimization for AI Search Quality

Published: November 27, 2025 | arXiv ID: 2511.22240v1

By: Philip Zhong, Kent Chen, Don Wang

Potential Business Impact:

Makes AI search find information much better.

Business Areas:

Semantic Search Internet Services

We evaluate the performance of various text embedding models and pipeline configurations for AI-driven search systems. We compare sentence-transformer and generative embedding models (e.g., All-MPNet, BGE, GTE, and Qwen) at different dimensions, indexing methods (Milvus HNSW/IVF), and chunking strategies. A custom evaluation dataset of 11,975 query-chunk pairs was synthesized from US City Council meeting transcripts using a local large language model (LLM). The data pipeline includes preprocessing, automated question generation per chunk, manual validation, and continuous integration/continuous deployment (CI/CD) integration. We measure retrieval accuracy using reference-based metrics: Top-K Accuracy and Normalized Discounted Cumulative Gain (NDCG). Our results demonstrate that higher-dimensional embeddings significantly boost search quality (e.g., Qwen3-Embedding-8B/4096 achieves Top-3 accuracy about 0.571 versus 0.412 for GTE-large/1024), and that neural re-rankers (e.g., a BGE cross-encoder) further improve ranking accuracy (Top-3 up to 0.527). Finer-grained chunking (512 characters versus 2000 characters) also improves accuracy. We discuss the impact of these factors and outline future directions for pipeline automation and evaluation.

Advancing Retrieval-Augmented Generation for Structured Enterprise and Internal Data

Computation and Language

Helps computers understand company secrets better.

16 Jul 2025 0

88%

On-Premise AI for the Newsroom: Evaluating Small Language Models for Investigative Document Search

Information Retrieval

Helps reporters find facts faster and safer.

29 Sep 2025 1

88%

Embedding Is (Almost) All You Need: Retrieval-Augmented Inference for Generalizable Genomic Prediction Tasks

Genomics

Makes DNA analysis faster and greener.

6 Aug 2025 1

View PDF Login to Bookmark

Page Count

13 pages

Evaluating Embedding Models and Pipeline Optimization for AI Search Quality

Makes AI search find information much better.

Technical Abstract

Advancing Retrieval-Augmented Generation for Structured Enterprise and Internal Data

On-Premise AI for the Newsroom: Evaluating Small Language Models for Investigative Document Search

Embedding Is (Almost) All You Need: Retrieval-Augmented Inference for Generalizable Genomic Prediction Tasks