An Artificial Intelligence Driven Semantic Similarity-Based Pipeline for Rapid Literature
By: Abhiyan Dhakal, Kausik Paudel, Sanjog Sigdel
Potential Business Impact:
Finds important research papers automatically.
We propose an automated pipeline for performing literature reviews using semantic similarity. Unlike traditional systematic review systems or optimization based methods, this work emphasizes minimal overhead and high relevance by using transformer based embeddings and cosine similarity. By providing a paper title and abstract, it generates relevant keywords, fetches relevant papers from open access repository, and ranks them based on their semantic closeness to the input. Three embedding models were evaluated. A statistical thresholding approach is then applied to filter relevant papers, enabling an effective literature review pipeline. Despite the absence of heuristic feedback or ground truth relevance labels, the proposed system shows promise as a scalable and practical tool for preliminary research and exploratory analysis.
Similar Papers
Intelligent Scientific Literature Explorer using Machine Learning (ISLE)
Information Retrieval
Helps scientists find and understand research faster.
An Auditable Pipeline for Fuzzy Full-Text Screening in Systematic Reviews: Integrating Contrastive Semantic Highlighting and LLM Judgment
Computation and Language
Helps doctors quickly find important health study facts.
Academic Literature Recommendation in Large-scale Citation Networks Enhanced by Large Language Models
Applications
Finds the best science papers for researchers.