Score: 0

Evaluating Retrieval-Augmented Generation Strategies for Large Language Models in Travel Mode Choice Prediction

Published: August 24, 2025 | arXiv ID: 2508.17527v1

By: Yiming Xu, Junfeng Jiao

Potential Business Impact:

Helps predict how people travel better.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Accurately predicting travel mode choice is essential for effective transportation planning, yet traditional statistical and machine learning models are constrained by rigid assumptions, limited contextual reasoning, and reduced generalizability. This study explores the potential of Large Language Models (LLMs) as a more flexible and context-aware approach to travel mode choice prediction, enhanced by Retrieval-Augmented Generation (RAG) to ground predictions in empirical data. We develop a modular framework for integrating RAG into LLM-based travel mode choice prediction and evaluate four retrieval strategies: basic RAG, RAG with balanced retrieval, RAG with a cross-encoder for re-ranking, and RAG with balanced retrieval and cross-encoder for re-ranking. These strategies are tested across three LLM architectures (OpenAI GPT-4o, o4-mini, and o3) to examine the interaction between model reasoning capabilities and retrieval methods. Using the 2023 Puget Sound Regional Household Travel Survey data, we conduct a series of experiments to evaluate model performance. The results demonstrate that RAG substantially enhances predictive accuracy across a range of models. Notably, the GPT-4o model combined with balanced retrieval and cross-encoder re-ranking achieves the highest accuracy of 80.8%, exceeding that of conventional statistical and machine learning baselines. Furthermore, LLM-based models exhibit superior generalization abilities relative to these baselines. Findings highlight the critical interplay between LLM reasoning capabilities and retrieval strategies, demonstrating the importance of aligning retrieval strategies with model capabilities to maximize the potential of LLM-based travel behavior modeling.

Retrieval Augmented Generation with Multi-Modal LLM Framework for Wireless Environments

Networking and Internet Architecture

Makes wireless internet faster and more reliable.

9 Mar 2025 1

93%

Retrieval Augmented Generation Evaluation in the Era of Large Language Models: A Comprehensive Survey

Computation and Language

Tests how AI uses outside facts to answer questions.

21 Apr 2025 0

92%

Rethinking Retrieval-Augmented Generation for Medicine: A Large-Scale, Systematic Expert Evaluation and Practical Insights

Computation and Language

Makes AI doctors more truthful and helpful.

10 Nov 2025 1

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

24 pages

Evaluating Retrieval-Augmented Generation Strategies for Large Language Models in Travel Mode Choice Prediction

Helps predict how people travel better.

Technical Abstract

Retrieval Augmented Generation with Multi-Modal LLM Framework for Wireless Environments

Retrieval Augmented Generation Evaluation in the Era of Large Language Models: A Comprehensive Survey

Rethinking Retrieval-Augmented Generation for Medicine: A Large-Scale, Systematic Expert Evaluation and Practical Insights