Labels have Human Values: Value Calibration of Subjective Tasks
By: Mohammed Fayiz Parappan, Ricardo Henao
Potential Business Impact:
Teaches computers to understand different opinions.
Building NLP systems for subjective tasks requires one to ensure their alignment to contrasting human values. We propose the MultiCalibrated Subjective Task Learner framework (MC-STL), which clusters annotations into identifiable human value clusters by three approaches (similarity of annotator rationales, expert-value taxonomies or rater's sociocultural descriptors) and calibrates predictions for each value cluster by learning cluster-specific embeddings. We demonstrate MC-STL on several subjective learning settings, including ordinal, binary, and preference learning predictions, and evaluate it on multiple datasets covering toxic chatbot conversations, offensive social media posts, and human preference alignment. The results show that MC-STL consistently outperforms the baselines that ignore the latent value structure of the annotations, delivering gains in discrimination, value-specific calibration, and disagreement-aware metrics.
Similar Papers
Taking a SEAT: Predicting Value Interpretations from Sentiment, Emotion, Argument, and Topic Annotations
Computation and Language
AI learns how people see the world differently.
Humans Hallucinate Too: Language Models Identify and Correct Subjective Annotation Errors With Label-in-a-Haystack Prompts
Computation and Language
Helps computers understand feelings and right from wrong.
Training and Evaluating with Human Label Variation: An Empirical Study
Machine Learning (CS)
Teaches computers to learn from many different opinions.