Score: 0

Quantifying and Mitigating Selection Bias in LLMs: A Transferable LoRA Fine-Tuning and Efficient Majority Voting Approach

Published: November 17, 2025 | arXiv ID: 2511.21709v1

By: Blessed Guda , Lawrence Francis , Gabrial Zencha Ashungafac and more

Potential Business Impact:

Makes AI answer questions more fairly.

Business Areas:

A/B Testing Data and Analytics

Multiple Choice Question (MCQ) answering is a widely used method for evaluating the performance of Large Language Models (LLMs). However, LLMs often exhibit selection bias in MCQ tasks, where their choices are influenced by factors like answer position or option symbols rather than the content. This bias undermines the reliability of MCQ as an evaluation framework. Most existing selection bias metrics require answer labels and measure divergences between prediction and answer distributions, but do not fully capture the consistency of a model's predictions across different orderings of answer choices. Existing selection bias mitigation strategies have notable limitations: majority voting, though effective, is computationally prohibitive; calibration-based methods require validation sets and often fail to generalize across datasets. To address these gaps, we propose three key contributions: (1) a new unsupervised label-free Permutation Bias Metric (PBM) that directly quantifies inconsistencies in model predictions across answer permutations, providing a more precise measure of selection bias, (2) an efficient majority voting approach called Batch Question-Context KV caching (BaQCKV), to significantly reduce computational costs while preserving bias mitigation effectiveness, and (3) an unsupervised Low-Rank Adaptation (LoRA-1) fine-tuning strategy based on our proposed metric and the BaQCKV that mitigates selection bias, providing a computationally efficient alternative that maintains model generalizability. Experiments across multiple MCQ benchmarks demonstrate that our approaches reduce bias, increasing consistency in accuracy while minimizing computational costs.

Benchmarking and Mitigating MCQA Selection Bias of Large Vision-Language Models

CV and Pattern Recognition

Fixes AI's tendency to pick wrong answers.

20 Sep 2025 0

90%

More Bias, Less Bias: BiasPrompting for Enhanced Multiple-Choice Question Answering

Computation and Language

Helps AI better answer tricky questions by thinking.

25 Nov 2025 1

89%

Improving Score Reliability of Multiple Choice Benchmarks with Consistency Evaluation and Altered Answer Choices

Computation and Language

Measures how well AI answers questions reliably.

26 Nov 2025 0

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

20 pages

Quantifying and Mitigating Selection Bias in LLMs: A Transferable LoRA Fine-Tuning and Efficient Majority Voting Approach

Makes AI answer questions more fairly.

Technical Abstract

Benchmarking and Mitigating MCQA Selection Bias of Large Vision-Language Models

More Bias, Less Bias: BiasPrompting for Enhanced Multiple-Choice Question Answering

Improving Score Reliability of Multiple Choice Benchmarks with Consistency Evaluation and Altered Answer Choices