Score: 0

VSA:Visual-Structural Alignment for UI-to-Code

Published: December 23, 2025 | arXiv ID: 2512.20034v1

By: Xian Wu , Ming Zhang , Zhiyu Fang and more

The automation of user interface development has the potential to accelerate software delivery by mitigating intensive manual implementation. Despite the advancements in Large Multimodal Models for design-to-code translation, existing methodologies predominantly yield unstructured, flat codebases that lack compatibility with component-oriented libraries such as React or Angular. Such outputs typically exhibit low cohesion and high coupling, complicating long-term maintenance. In this paper, we propose \textbf{VSA (VSA)}, a multi-stage paradigm designed to synthesize organized frontend assets through visual-structural alignment. Our approach first employs a spatial-aware transformer to reconstruct the visual input into a hierarchical tree representation. Moving beyond basic layout extraction, we integrate an algorithmic pattern-matching layer to identify recurring UI motifs and encapsulate them into modular templates. These templates are then processed via a schema-driven synthesis engine, ensuring the Large Language Model generates type-safe, prop-drilled components suitable for production environments. Experimental results indicate that our framework yields a substantial improvement in code modularity and architectural consistency over state-of-the-art benchmarks, effectively bridging the gap between raw pixels and scalable software engineering.

Modular Layout Synthesis (MLS): Front-end Code via Structure Normalization and Constrained Generation

Information Retrieval

Builds websites faster with smarter code.

22 Dec 2025 0

87%

Enhancing Visual Sentiment Analysis via Semiotic Isotopy-Guided Dataset Construction

CV and Pattern Recognition

Helps computers understand feelings in pictures better.

16 Dec 2025 0

86%

VISCA: Inferring Component Abstractions for Automated End-to-End Testing

Software Engineering

Helps computers test websites better.

4 Jun 2025 1

View PDF Login to Bookmark

VSA:Visual-Structural Alignment for UI-to-Code

Technical Abstract

Modular Layout Synthesis (MLS): Front-end Code via Structure Normalization and Constrained Generation

Enhancing Visual Sentiment Analysis via Semiotic Isotopy-Guided Dataset Construction

VISCA: Inferring Component Abstractions for Automated End-to-End Testing