Score: 1

Dimension vs. Precision: A Comparative Analysis of Autoencoders and Quantization for Efficient Vector Retrieval on BEIR SciFact

Published: November 17, 2025 | arXiv ID: 2511.13057v1

By: Satyanarayan Pati

Potential Business Impact:

Makes search engines use less computer memory.

Business Areas:

Semantic Search Internet Services

Dense retrieval models have become a standard for state-of-the-art information retrieval. However, their high-dimensional, high-precision (float32) vector embeddings create significant storage and memory challenges for real-world deployment. To address this, we conduct a rigorous empirical study on the BEIR SciFact benchmark, evaluating the trade-offs between two primary compression strategies: (1) Dimensionality Reduction via deep Autoencoders (AE), reducing original 384-dim vectors to latent spaces from 384 down to 12, and (2) Precision Reduction via Quantization (float16, int8, and binary). We systematically compare each method by measuring the "performance loss" (or gain) relative to a float32 baseline across a full suite of retrieval metrics (NDCG, MAP, MRR, Recall, Precision) at various k cutoffs. Our results show that int8 scalar quantization provides the most effective "sweet spot," achieving a 4x compression with a negligible [~1-2%] drop in nDCG@10. In contrast, Autoencoders show a graceful degradation but suffer a more significant performance loss at equivalent 4x compression ratios (AE-96). binary quantization was found to be unsuitable for this task due to catastrophic performance drops. This work provides a practical guide for deploying efficient, high-performance retrieval systems.

Dimension vs. Precision: A Comparative Analysis of Autoencoders and Quantization for Efficient Vector Retrieval on BEIR SciFact

Information Retrieval

Shrinks computer search data, keeping it fast.

17 Nov 2025 1

90%

Optimization of embeddings storage for RAG systems using quantization and dimensionality reduction techniques

Information Retrieval

Shrinks AI's memory needs, keeping it smart.

30 Apr 2025 0

88%

Vector Quantization using Gaussian Variational Autoencoder

Machine Learning (CS)

Makes images easier for computers to understand.

7 Dec 2025 1

View PDF Login to Bookmark

Page Count

16 pages

Dimension vs. Precision: A Comparative Analysis of Autoencoders and Quantization for Efficient Vector Retrieval on BEIR SciFact

Makes search engines use less computer memory.

Technical Abstract

Dimension vs. Precision: A Comparative Analysis of Autoencoders and Quantization for Efficient Vector Retrieval on BEIR SciFact

Optimization of embeddings storage for RAG systems using quantization and dimensionality reduction techniques

Vector Quantization using Gaussian Variational Autoencoder