POSESTITCH-SLT: Linguistically Inspired Pose-Stitching for End-to-End Sign Language Translation
By: Abhinav Joshi , Vaibhav Sharma , Sanjeet Singh and more
Potential Business Impact:
Teaches computers to translate sign language faster.
Sign language translation remains a challenging task due to the scarcity of large-scale, sentence-aligned datasets. Prior arts have focused on various feature extraction and architectural changes to support neural machine translation for sign languages. We propose POSESTITCH-SLT, a novel pre-training scheme that is inspired by linguistic-templates-based sentence generation technique. With translation comparison on two sign language datasets, How2Sign and iSign, we show that a simple transformer-based encoder-decoder architecture outperforms the prior art when considering template-generated sentence pairs in training. We achieve BLEU-4 score improvements from 1.97 to 4.56 on How2Sign and from 0.55 to 3.43 on iSign, surpassing prior state-of-the-art methods for pose-based gloss-free translation. The results demonstrate the effectiveness of template-driven synthetic supervision in low-resource sign language settings.
Similar Papers
SONAR-SLT: Multilingual Sign Language Translation via Language-Agnostic Sentence Embedding Supervision
Computation and Language
Translates sign language to many spoken languages.
Sign Language Translation with Sentence Embedding Supervision
Computation and Language
Teaches computers to translate sign language without labels.
Spatio-temporal Sign Language Representation and Translation
Computation and Language
Translates sign language into written words.