Score: 1

FAIRE: Assessing Racial and Gender Bias in AI-Driven Resume Evaluations

Published: April 2, 2025 | arXiv ID: 2504.01420v1

By: Athena Wen , Tanush Patil , Ansh Saxena and more

Potential Business Impact:

Tests AI hiring tools for race and gender bias.

Business Areas:
Facial Recognition Data and Analytics, Software

In an era where AI-driven hiring is transforming recruitment practices, concerns about fairness and bias have become increasingly important. To explore these issues, we introduce a benchmark, FAIRE (Fairness Assessment In Resume Evaluation), to test for racial and gender bias in large language models (LLMs) used to evaluate resumes across different industries. We use two methods-direct scoring and ranking-to measure how model performance changes when resumes are slightly altered to reflect different racial or gender identities. Our findings reveal that while every model exhibits some degree of bias, the magnitude and direction vary considerably. This benchmark provides a clear way to examine these differences and offers valuable insights into the fairness of AI-based hiring tools. It highlights the urgent need for strategies to reduce bias in AI-driven recruitment. Our benchmark code and dataset are open-sourced at our repository: https://github.com/athenawen/FAIRE-Fairness-Assessment-In-Resume-Evaluation.git.

Repos / Data Links

Page Count
14 pages

Category
Computer Science:
Computation and Language