Score: 0

Anomaly Detection in Human Language via Meta-Learning: A Few-Shot Approach

Published: July 26, 2025 | arXiv ID: 2507.20019v1

By: Saurav Singla , Aarav Singla , Advik Gupta and more

Potential Business Impact:

Finds bad words in texts with little examples.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

We propose a meta learning framework for detecting anomalies in human language across diverse domains with limited labeled data. Anomalies in language ranging from spam and fake news to hate speech pose a major challenge due to their sparsity and variability. We treat anomaly detection as a few shot binary classification problem and leverage meta-learning to train models that generalize across tasks. Using datasets from domains such as SMS spam, COVID-19 fake news, and hate speech, we evaluate model generalization on unseen tasks with minimal labeled anomalies. Our method combines episodic training with prototypical networks and domain resampling to adapt quickly to new anomaly detection tasks. Empirical results show that our method outperforms strong baselines in F1 and AUC scores. We also release the code and benchmarks to facilitate further research in few-shot text anomaly detection.

Page Count
35 pages

Category
Computer Science:
Computation and Language