From keywords to semantics: Perceptions of large language models in data discovery
By: Maura E Halstead , Mark A. Green , Caroline Jay and more
Potential Business Impact:
Lets computers find science info without exact words.
Current approaches to data discovery match keywords between metadata and queries. This matching requires researchers to know the exact wording that other researchers previously used, creating a challenging process that could lead to missing relevant data. Large Language Models (LLMs) could enhance data discovery by removing this requirement and allowing researchers to ask questions with natural language. However, we do not currently know if researchers would accept LLMs for data discovery. Using a human-centered artificial intelligence (HCAI) focus, we ran focus groups (N = 27) to understand researchers' perspectives towards LLMs for data discovery. Our conceptual model shows that the potential benefits are not enough for researchers to use LLMs instead of current technology. Barriers prevent researchers from fully accepting LLMs, but features around transparency could overcome them. Using our model will allow developers to incorporate features that result in an increased acceptance of LLMs for data discovery.
Similar Papers
Current and Future Use of Large Language Models for Knowledge Work
Human-Computer Interaction
Helps people use AI to do work faster.
LLM/Agent-as-Data-Analyst: A Survey
Artificial Intelligence
Computers understand and analyze all kinds of data.
LLM-Based Information Extraction to Support Scientific Literature Research and Publication Workflows
Digital Libraries
Helps find important ideas in science papers.