Automatically Generating Web Applications from Requirements Via Multi-Agent Test-Driven Development
By: Yuxuan Wan , Tingshuo Liang , Jiakai Xu and more
Potential Business Impact:
Builds complete websites from just a description.
Developing full-stack web applications is complex and time-intensive, demanding proficiency across diverse technologies and frameworks. Although recent advances in multimodal large language models (MLLMs) enable automated webpage generation from visual inputs, current solutions remain limited to front-end tasks and fail to deliver fully functional applications. In this work, we introduce TDDev, the first test-driven development (TDD)-enabled LLM-agent framework for end-to-end full-stack web application generation. Given a natural language description or design image, TDDev automatically derives executable test cases, generates front-end and back-end code, simulates user interactions, and iteratively refines the implementation until all requirements are satisfied. Our framework addresses key challenges in full-stack automation, including underspecified user requirements, complex interdependencies among multiple files, and the need for both functional correctness and visual fidelity. Through extensive experiments on diverse application scenarios, TDDev achieves a 14.4% improvement on overall accuracy compared to state-of-the-art baselines, demonstrating its effectiveness in producing reliable, high-quality web applications without requiring manual intervention.
Similar Papers
TENET: Leveraging Tests Beyond Validation for Code Generation
Software Engineering
Helps AI write better code by testing it.
Leveraging Test Driven Development with Large Language Models for Reliable and Verifiable Spreadsheet Code Generation: A Research Framework
Software Engineering
Makes computer code more correct and trustworthy.
Tests as Prompt: A Test-Driven-Development Benchmark for LLM Code Generation
Software Engineering
Helps computers write code from tests.