Score: 1

Unified Game Moderation: Soft-Prompting and LLM-Assisted Label Transfer for Resource-Efficient Toxicity Detection

Published: June 1, 2025 | arXiv ID: 2506.06347v1

By: Zachary Yang, Domenico Tullo, Reihaneh Rabbany

Potential Business Impact:

Finds mean players in games faster.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Toxicity detection in gaming communities faces significant scaling challenges when expanding across multiple games and languages, particularly in real-time environments where computational efficiency is crucial. We present two key findings to address these challenges while building upon our previous work on ToxBuster, a BERT-based real-time toxicity detection system. First, we introduce a soft-prompting approach that enables a single model to effectively handle multiple games by incorporating game-context tokens, matching the performance of more complex methods like curriculum learning while offering superior scalability. Second, we develop an LLM-assisted label transfer framework using GPT-4o-mini to extend support to seven additional languages. Evaluations on real game chat data across French, German, Portuguese, and Russian achieve macro F1-scores ranging from 32.96% to 58.88%, with particularly strong performance in German, surpassing the English benchmark of 45.39%. In production, this unified approach significantly reduces computational resources and maintenance overhead compared to maintaining separate models for each game and language combination. At Ubisoft, this model successfully identifies an average of 50 players, per game, per day engaging in sanctionable behavior.

Context-Aware Toxicity Detection in Multiplayer Games: Integrating Domain-Adaptive Pretraining and Match Metadata

Computation and Language

Helps games catch mean players better.

2 Apr 2025 0

88%

Use Me Wisely: AI-Driven Assessment for LLM Prompting Skills Development

Computers and Society

Teaches computers to grade student writing automatically.

4 Mar 2025 1

88%

Robust Persona-Aware Toxicity Detection with Prompt Optimization and Learned Ensembling

Computation and Language

Helps computers judge bad words better for everyone.

5 Jan 2026 0

View PDF Login to Bookmark

Country of Origin

🇨🇦 Canada

Repos / Data Links

github.com

Page Count

11 pages

Unified Game Moderation: Soft-Prompting and LLM-Assisted Label Transfer for Resource-Efficient Toxicity Detection

Finds mean players in games faster.

Technical Abstract

Context-Aware Toxicity Detection in Multiplayer Games: Integrating Domain-Adaptive Pretraining and Match Metadata

Use Me Wisely: AI-Driven Assessment for LLM Prompting Skills Development

Robust Persona-Aware Toxicity Detection with Prompt Optimization and Learned Ensembling