Unveiling Challenges for LLMs in Enterprise Data Engineering
By: Jan-Micha Bodensohn , Ulf Brackmann , Liane Vogel and more
Potential Business Impact:
Helps computers sort big company data faster.
Large Language Models (LLMs) have demonstrated significant potential for automating data engineering tasks on tabular data, giving enterprises a valuable opportunity to reduce the high costs associated with manual data handling. However, the enterprise domain introduces unique challenges that existing LLM-based approaches for data engineering often overlook, such as large table sizes, more complex tasks, and the need for internal knowledge. To bridge these gaps, we identify key enterprise-specific challenges related to data, tasks, and background knowledge and conduct a comprehensive study of their impact on recent LLMs for data engineering. Our analysis reveals that LLMs face substantial limitations in real-world enterprise scenarios, resulting in significant accuracy drops. Our findings contribute to a systematic understanding of LLMs for enterprise data engineering to support their adoption in industry.
Similar Papers
LLM-Powered Knowledge Graphs for Enterprise Intelligence and Analytics
Artificial Intelligence
Connects all your work data for smarter answers.
Research Challenges in Relational Database Management Systems for LLM Queries
Databases
Makes computer databases understand and use smart language.
Large Language Models in the Data Science Lifecycle: A Systematic Mapping Study
Computers and Society
Helps computers do data science tasks better.