Score: 0

Large Language Models (LLMs) for Source Code Analysis: applications, models and datasets

Published: March 21, 2025 | arXiv ID: 2503.17502v1

By: Hamed Jelodar, Mohammad Meymani, Roozbeh Razavi-Far

Potential Business Impact:

Helps computers understand and fix computer code.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Large language models (LLMs) and transformer-based architectures are increasingly utilized for source code analysis. As software systems grow in complexity, integrating LLMs into code analysis workflows becomes essential for enhancing efficiency, accuracy, and automation. This paper explores the role of LLMs for different code analysis tasks, focusing on three key aspects: 1) what they can analyze and their applications, 2) what models are used and 3) what datasets are used, and the challenges they face. Regarding the goal of this research, we investigate scholarly articles that explore the use of LLMs for source code analysis to uncover research developments, current trends, and the intellectual structure of this emerging field. Additionally, we summarize limitations and highlight essential tools, datasets, and key challenges, which could be valuable for future work.

Large Language Model (LLM) for Software Security: Code Analysis, Malware Analysis, Reverse Engineering

Cryptography and Security

Helps computers find computer viruses faster.

7 Apr 2025 0

92%

Large Language Models for Code Generation: A Comprehensive Survey of Challenges, Techniques, Evaluation, and Applications

Software Engineering

Lets anyone write computer programs with plain English.

3 Mar 2025 1

92%

Large Language Models in the Data Science Lifecycle: A Systematic Mapping Study

Computers and Society

Helps computers do data science tasks better.

12 Aug 2025 0

View PDF Login to Bookmark

Country of Origin

🇨🇦 Canada

Page Count

24 pages

Large Language Models (LLMs) for Source Code Analysis: applications, models and datasets

Helps computers understand and fix computer code.

Technical Abstract

Large Language Model (LLM) for Software Security: Code Analysis, Malware Analysis, Reverse Engineering

Large Language Models for Code Generation: A Comprehensive Survey of Challenges, Techniques, Evaluation, and Applications

Large Language Models in the Data Science Lifecycle: A Systematic Mapping Study