LegalRAG: A Hybrid RAG System for Multilingual Legal Information Retrieval
By: Muhammad Rafsan Kabir , Rafeed Mohammad Sultan , Fuad Rahman and more
Potential Business Impact:
Finds legal information in police documents faster.
Natural Language Processing (NLP) and computational linguistic techniques are increasingly being applied across various domains, yet their use in legal and regulatory tasks remains limited. To address this gap, we develop an efficient bilingual question-answering framework for regulatory documents, specifically the Bangladesh Police Gazettes, which contain both English and Bangla text. Our approach employs modern Retrieval Augmented Generation (RAG) pipelines to enhance information retrieval and response generation. In addition to conventional RAG pipelines, we propose an advanced RAG-based approach that improves retrieval performance, leading to more precise answers. This system enables efficient searching for specific government legal notices, making legal information more accessible. We evaluate both our proposed and conventional RAG systems on a diverse test set on Bangladesh Police Gazettes, demonstrating that our approach consistently outperforms existing methods across all evaluation metrics.
Similar Papers
All for law and law for all: Adaptive RAG Pipeline for Legal Research
Computation and Language
Helps lawyers find correct legal answers faster.
All for law and law for all: Adaptive RAG Pipeline for Legal Research
Computation and Language
Helps lawyers find correct legal answers faster.
Multilingual Retrieval-Augmented Generation for Knowledge-Intensive Task
Computation and Language
Helps computers answer questions in any language.