ScriptViT: Vision Transformer-Based Personalized Handwriting Generation
By: Sajjan Acharya, Rajendra Baskota
Potential Business Impact:
Makes computer handwriting look like real people's writing.
Styled handwriting generation aims to synthesize handwritten text that looks both realistic and aligned with a specific writer's style. While recent approaches involving GAN, transformer and diffusion-based models have made progress, they often struggle to capture the full spectrum of writer-specific attributes, particularly global stylistic patterns that span long-range spatial dependencies. As a result, capturing subtle writer-specific traits such as consistent slant, curvature or stroke pressure, while keeping the generated text accurate is still an open problem. In this work, we present a unified framework designed to address these limitations. We introduce a Vision Transformer-based style encoder that learns global stylistic patterns from multiple reference images, allowing the model to better represent long-range structural characteristics of handwriting. We then integrate these style cues with the target text using a cross-attention mechanism, enabling the system to produce handwritten images that more faithfully reflect the intended style. To make the process more interpretable, we utilize Salient Stroke Attention Analysis (SSAA), which reveals the stroke-level features the model focuses on during style transfer. Together, these components lead to handwriting synthesis that is not only more stylistically coherent, but also easier to understand and analyze.
Similar Papers
Quo Vadis Handwritten Text Generation for Handwritten Text Recognition?
CV and Pattern Recognition
Makes old handwriting easier for computers to read.
HTR-ConvText: Leveraging Convolution and Textual Information for Handwritten Text Recognition
CV and Pattern Recognition
Helps computers read messy handwriting better.
Autoregressive Styled Text Image Generation, but Make it Reliable
CV and Pattern Recognition
Makes computers write text that looks like real handwriting.