What's In Your Field? Mapping Scientific Research with Knowledge Graphs and Large Language Models
By: Abhipsha Das , Nicholas Lourie , Siavash Golkar and more
Potential Business Impact:
Helps scientists find answers in millions of papers.
The scientific literature's exponential growth makes it increasingly challenging to navigate and synthesize knowledge across disciplines. Large language models (LLMs) are powerful tools for understanding scientific text, but they fail to capture detailed relationships across large bodies of work. Unstructured approaches, like retrieval augmented generation, can sift through such corpora to recall relevant facts; however, when millions of facts influence the answer, unstructured approaches become cost prohibitive. Structured representations offer a natural complement -- enabling systematic analysis across the whole corpus. Recent work enhances LLMs with unstructured or semistructured representations of scientific concepts; to complement this, we try extracting structured representations using LLMs. By combining LLMs' semantic understanding with a schema of scientific concepts, we prototype a system that answers precise questions about the literature as a whole. Our schema applies across scientific fields and we extract concepts from it using only 20 manually annotated abstracts. To demonstrate the system, we extract concepts from 30,000 papers on arXiv spanning astrophysics, fluid dynamics, and evolutionary biology. The resulting database highlights emerging trends and, by visualizing the knowledge graph, offers new ways to explore the ever-growing landscape of scientific knowledge. Demo: abby101/surveyor-0 on HF Spaces. Code: https://github.com/chiral-carbon/kg-for-science.
Similar Papers
Leveraging Large Language Models for Realizing Truly Intelligent User Interfaces
Digital Libraries
Helps organize science papers faster with smart AI.
From Symbolic to Neural and Back: Exploring Knowledge Graph-Large Language Model Synergies
Computation and Language
Makes computers smarter by connecting facts.
Are Large Language Models Effective Knowledge Graph Constructors?
Computation and Language
Helps computers build better knowledge maps.