Score: 1

PANER: A Paraphrase-Augmented Framework for Low-Resource Named Entity Recognition

Published: October 20, 2025 | arXiv ID: 2510.17720v1

By: Nanda Kumar Rengarajan, Jun Yan, Chun Wang

Potential Business Impact:

Helps computers find specific words in text.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Named Entity Recognition (NER) is a critical task that requires substantial annotated data, making it challenging in low-resource scenarios where label acquisition is expensive. While zero-shot and instruction-tuned approaches have made progress, they often fail to generalize to domain-specific entities and do not effectively utilize limited available data. We present a lightweight few-shot NER framework that addresses these challenges through two key innovations: (1) a new instruction tuning template with a simplified output format that combines principles from prior IT approaches to leverage the large context window of recent state-of-the-art LLMs; (2) introducing a strategic data augmentation technique that preserves entity information while paraphrasing the surrounding context, thereby expanding our training data without compromising semantic relationships. Experiments on benchmark datasets show that our method achieves performance comparable to state-of-the-art models on few-shot and zero-shot tasks, with our few-shot approach attaining an average F1 score of 80.1 on the CrossNER datasets. Models trained with our paraphrasing approach show consistent improvements in F1 scores of up to 17 points over baseline versions, offering a promising solution for groups with limited NER training data and compute power.

Meta-Pretraining for Zero-Shot Cross-Lingual Named Entity Recognition in Low-Resource Philippine Languages

Computation and Language

Teaches computers to understand new languages faster.

2 Sep 2025 0

88%

Enhancing NER Performance in Low-Resource Pakistani Languages using Cross-Lingual Data Augmentation

Computation and Language

Helps computers understand rare languages better.

7 Apr 2025 1

87%

Positional Attention for Efficient BERT-Based Named Entity Recognition

Computation and Language

Makes computers find important words faster.

3 May 2025 1

View PDF Login to Bookmark

Country of Origin

🇨🇦 Canada

Page Count

9 pages

PANER: A Paraphrase-Augmented Framework for Low-Resource Named Entity Recognition

Helps computers find specific words in text.

Technical Abstract

Meta-Pretraining for Zero-Shot Cross-Lingual Named Entity Recognition in Low-Resource Philippine Languages

Enhancing NER Performance in Low-Resource Pakistani Languages using Cross-Lingual Data Augmentation

Positional Attention for Efficient BERT-Based Named Entity Recognition