Score: 0

Labels have Human Values: Value Calibration of Subjective Tasks

Published: January 10, 2026 | arXiv ID: 2601.06631v1

By: Mohammed Fayiz Parappan, Ricardo Henao

Potential Business Impact:

Teaches computers to understand different opinions.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Building NLP systems for subjective tasks requires one to ensure their alignment to contrasting human values. We propose the MultiCalibrated Subjective Task Learner framework (MC-STL), which clusters annotations into identifiable human value clusters by three approaches (similarity of annotator rationales, expert-value taxonomies or rater's sociocultural descriptors) and calibrates predictions for each value cluster by learning cluster-specific embeddings. We demonstrate MC-STL on several subjective learning settings, including ordinal, binary, and preference learning predictions, and evaluate it on multiple datasets covering toxic chatbot conversations, offensive social media posts, and human preference alignment. The results show that MC-STL consistently outperforms the baselines that ignore the latent value structure of the annotations, delivering gains in discrimination, value-specific calibration, and disagreement-aware metrics.

Country of Origin
πŸ‡ΊπŸ‡Έ United States

Page Count
19 pages

Category
Computer Science:
Computation and Language