BanglaForge: LLM Collaboration with Self-Refinement for Bangla Code Generation
By: Mahir Labib Dihan, Sadif Ahmed, Md Nafiu Rahman
Potential Business Impact:
Helps computers write code from Bengali words.
Bangla is a low-resource language for code generation, lacking large-scale annotated datasets and tools to transform natural language specifications into executable programs. This makes Bangla-to-code generation a challenging task requiring innovative solutions. To address this, we introduce BanglaForge, a novel framework for generating code from Bangla function descriptions. BanglaForge leverages a retrieval-augmented dual-model collaboration paradigm with self-refinement, combining in-context learning, llm-based translation, systematic prompt engineering, and iterative self-refinement based on execution feedback, where a coder generates initial solutions and a reviewer enhances them for robustness. On the BLP-2025 Bangla Code Generation benchmark, BanglaForge achieves a competitive Pass@1 accuracy of 84.00%, demonstrating the effectiveness of retrieval, model collaboration, and self-refinement for low-resource Bangla code generation.
Similar Papers
Retriv at BLP-2025 Task 2: Test-Driven Feedback-Guided Framework for Bangla-to-Python Code Generation
Computation and Language
Helps computers write code from Bengali instructions.
TigerCoder: A Novel Suite of LLMs for Code Generation in Bangla
Computation and Language
Helps computers write computer code in Bangla.
Enhancing LLM Code Generation Capabilities through Test-Driven Development and Code Interpreter
Software Engineering
Helps computers write Bengali code easily.