Combining Distantly Supervised Models with In Context Learning for Monolingual and Cross-Lingual Relation Extraction
By: Vipul Rathore , Malik Hammad Faisal , Parag Singla and more
Potential Business Impact:
Helps computers find relationships in text better.
Distantly Supervised Relation Extraction (DSRE) remains a long-standing challenge in NLP, where models must learn from noisy bag-level annotations while making sentence-level predictions. While existing state-of-the-art (SoTA) DSRE models rely on task-specific training, their integration with in-context learning (ICL) using large language models (LLMs) remains underexplored. A key challenge is that the LLM may not learn relation semantics correctly, due to noisy annotation. In response, we propose HYDRE -- HYbrid Distantly Supervised Relation Extraction framework. It first uses a trained DSRE model to identify the top-k candidate relations for a given test sentence, then uses a novel dynamic exemplar retrieval strategy that extracts reliable, sentence-level exemplars from training data, which are then provided in LLM prompt for outputting the final relation(s). We further extend HYDRE to cross-lingual settings for RE in low-resource languages. Using available English DSRE training data, we evaluate all methods on English as well as a newly curated benchmark covering four diverse low-resource Indic languages -- Oriya, Santali, Manipuri, and Tulu. HYDRE achieves up to 20 F1 point gains in English and, on average, 17 F1 points on Indic languages over prior SoTA DSRE models. Detailed ablations exhibit HYDRE's efficacy compared to other prompting strategies.
Similar Papers
Bridging Generative and Discriminative Learning: Few-Shot Relation Extraction via Two-Stage Knowledge-Guided Pre-training
Computation and Language
Teaches computers to find facts with little data.
LLM-OREF: An Open Relation Extraction Framework Based on Large Language Models
Computation and Language
Computers learn new facts without human help.
GLiDRE: Generalist Lightweight model for Document-level Relation Extraction
Computation and Language
Helps computers understand relationships between words in long texts.