Relative Positioning Based Code Chunking Method For Rich Context Retrieval In Repository Level Code Completion Task With Code Language Model
By: Imranur Rahman, Md Rayhanur Rahman
Potential Business Impact:
Helps computers finish writing code faster.
Code completion can help developers improve efficiency and ease the development lifecycle. Although code completion is available in modern integrated development environments (IDEs), research lacks in determining what makes a good context for code completion based on the information available to the IDEs for the large language models (LLMs) to perform better. In this paper, we describe an effective context collection strategy to assist the LLMs in performing better at code completion tasks. The key idea of our strategy is to preprocess the repository into smaller code chunks and later use syntactic and semantic similarity-based code chunk retrieval with relative positioning. We found that code chunking and relative positioning of the chunks in the final context improve the performance of code completion tasks.
Similar Papers
Beyond More Context: How Granularity and Order Drive Code Completion Quality
Software Engineering
Helps computers write better code by finding good examples.
Impact-driven Context Filtering For Cross-file Code Completion
Software Engineering
Helps computers write better code by picking good examples.
Completion by Comprehension: Guiding Code Generation with Multi-Granularity Understanding
Software Engineering
Helps computers write better code by understanding its structure.