Score: 0

OrchANN: A Unified I/O Orchestration Framework for Skewed Out-of-Core Vector Search

Published: December 28, 2025 | arXiv ID: 2512.22838v1

By: Chengying Huan , Lizheng Chen , Zhengyi Yang and more

Potential Business Impact:

Finds similar items faster, using less storage.

Business Areas:

Database Data and Analytics, Software

Approximate nearest neighbor search (ANNS) at billion scale is fundamentally an out-of-core problem: vectors and indexes live on SSD, so performance is dominated by I/O rather than compute. Under skewed semantic embeddings, existing out-of-core systems break down: a uniform local index mismatches cluster scales, static routing misguides queries and inflates the number of probed partitions, and pruning is incomplete at the cluster level and lossy at the vector level, triggering "fetch-to-discard" reranking on raw vectors. We present OrchANN, an out-of-core ANNS engine that uses an I/O orchestration model for unified I/O governance along the route-access-verify pipeline. OrchANN selects a heterogeneous local index per cluster via offline auto-profiling, maintains a query-aware in-memory navigation graph that adapts to skewed workloads, and applies multi-level pruning with geometric bounds to filter both clusters and vectors before issuing SSD reads. Across five standard datasets under strict out-of-core constraints, OrchANN outperforms four baselines including DiskANN, Starling, SPANN, and PipeANN in both QPS and latency while reducing SSD accesses. Furthermore, OrchANN delivers up to 17.2x higher QPS and 25.0x lower latency than competing systems without sacrificing accuracy.

Breaking the Storage-Compute Bottleneck in Billion-Scale ANNS: A GPU-Driven Asynchronous I/O Framework

Databases

Makes searching huge data much faster.

14 Jul 2025 2

91%

Approximate Nearest Neighbor Search of Large Scale Vectors on Distributed Storage

Databases

Finds similar items in huge online lists faster.

20 Oct 2025 2

91%

Scalable Disk-Based Approximate Nearest Neighbor Search with Page-Aligned Graph

Machine Learning (CS)

Finds information faster on huge computer files.

29 Sep 2025 2

View PDF Login to Bookmark

Page Count

13 pages

OrchANN: A Unified I/O Orchestration Framework for Skewed Out-of-Core Vector Search

Finds similar items faster, using less storage.

Technical Abstract

Breaking the Storage-Compute Bottleneck in Billion-Scale ANNS: A GPU-Driven Asynchronous I/O Framework

Approximate Nearest Neighbor Search of Large Scale Vectors on Distributed Storage

Scalable Disk-Based Approximate Nearest Neighbor Search with Page-Aligned Graph