Increasing the Robustness of the Fine-tuned Multilingual Machine-Generated Text Detectors
By: Dominik Macko, Robert Moro, Ivan Srba
Potential Business Impact:
Finds fake writing online to stop lies.
Since the proliferation of LLMs, there have been concerns about their misuse for harmful content creation and spreading. Recent studies justify such fears, providing evidence of LLM vulnerabilities and high potential of their misuse. Humans are no longer able to distinguish between high-quality machine-generated and authentic human-written texts. Therefore, it is crucial to develop automated means to accurately detect machine-generated content. It would enable to identify such content in online information space, thus providing an additional information about its credibility. This work addresses the problem by proposing a robust fine-tuning process of LLMs for the detection task, making the detectors more robust against obfuscation and more generalizable to out-of-distribution data.
Similar Papers
Robust and Fine-Grained Detection of AI Generated Texts
Computation and Language
Finds AI writing mixed with human writing.
AI Generated Text Detection Using Instruction Fine-tuned Large Language and Transformer-Based Models
Computation and Language
Finds fake writing made by computers.
mdok of KInIT: Robustly Fine-tuned LLM for Binary and Multiclass AI-Generated Text Detection
Computation and Language
Finds fake writing from smart computer programs.