Cultural Compass: A Framework for Organizing Societal Norms to Detect Violations in Human-AI Conversations
By: Myra Cheng , Vinodkumar Prabhakaran , Alice Oh and more
Potential Business Impact:
Teaches AI to follow rules in different countries.
Generative AI models ought to be useful and safe across cross-cultural contexts. One critical step toward this goal is understanding how AI models adhere to sociocultural norms. While this challenge has gained attention in NLP, existing work lacks both nuance and coverage in understanding and evaluating models' norm adherence. We address these gaps by introducing a taxonomy of norms that clarifies their contexts (e.g., distinguishing between human-human norms that models should recognize and human-AI interactional norms that apply to the human-AI interaction itself), specifications (e.g., relevant domains), and mechanisms (e.g., modes of enforcement). We demonstrate how our taxonomy can be operationalized to automatically evaluate models' norm adherence in naturalistic, open-ended settings. Our exploratory analyses suggest that state-of-the-art models frequently violate norms, though violation rates vary by model, interactional context, and country. We further show that violation rates also vary by prompt intent and situational framing. Our taxonomy and demonstrative evaluation pipeline enable nuanced, context-sensitive evaluation of cultural norm adherence in realistic settings.
Similar Papers
Explainable Ethical Assessment on Human Behaviors by Generating Conflicting Social Norms
Computers and Society
AI learns right from wrong by seeing good and bad reasons.
Exploring Artificial Intelligence and Culture: Methodology for a comparative study of AI's impact on norms, trust, and problem-solving across academic and business environments
Human-Computer Interaction
Shows how people and AI learn together.
Reasoning Shapes Alignment: Investigating Cultural Alignment in Large Reasoning Models with Cultural Norms
Artificial Intelligence
Teaches computers to understand different cultures.