Score: 1

Combating Toxic Language: A Review of LLM-Based Strategies for Software Engineering

Published: April 21, 2025 | arXiv ID: 2504.15439v1

By: Hao Zhuo, Yicheng Yang, Kewen Peng

Potential Business Impact:

Cleans up harmful words in computer code.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Large Language Models (LLMs) have become integral to software engineering (SE), where they are increasingly used in development workflows. However, their widespread use raises concerns about the presence and propagation of toxic language--harmful or offensive content that can foster exclusionary environments. This paper provides a comprehensive review of recent research on toxicity detection and mitigation, focusing on both SE-specific and general-purpose datasets. We examine annotation and preprocessing techniques, assess detection methodologies, and evaluate mitigation strategies, particularly those leveraging LLMs. Additionally, we conduct an ablation study demonstrating the effectiveness of LLM-based rewriting for reducing toxicity. By synthesizing existing work and identifying open challenges, this review highlights key areas for future research to ensure the responsible deployment of LLMs in SE and beyond.

Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks

Software Engineering

Tests if AI coding helpers are safe.

2 Apr 2025 1

90%

Guardians and Offenders: A Survey on Harmful Content Generation and Safety Mitigation of LLM

Computation and Language

Makes AI safer and less likely to say bad things.

7 Aug 2025 0

90%

Guardians and Offenders: A Survey on Harmful Content Generation and Safety Mitigation

Computation and Language

Makes AI safer and less likely to say bad things.

7 Aug 2025 0

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Repos / Data Links

github.com github.com github.com github.com

Page Count

28 pages

Combating Toxic Language: A Review of LLM-Based Strategies for Software Engineering

Cleans up harmful words in computer code.

Technical Abstract

Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks

Guardians and Offenders: A Survey on Harmful Content Generation and Safety Mitigation of LLM

Guardians and Offenders: A Survey on Harmful Content Generation and Safety Mitigation