Score: 0

Bangla Sign Language Translation: Dataset Creation Challenges, Benchmarking and Prospects

Published: November 26, 2025 | arXiv ID: 2511.21533v1

By: Husne Ara Rubaiyeat, Hasan Mahmud, Md Kamrul Hasan

Potential Business Impact:

Helps computers understand Bangla sign language.

Business Areas:

Semantic Web Internet Services

Bangla Sign Language Translation (BdSLT) has been severely constrained so far as the language itself is very low resource. Standard sentence level dataset creation for BdSLT is of immense importance for developing AI based assistive tools for deaf and hard of hearing people of Bangla speaking community. In this paper, we present a dataset, IsharaKhobor , and two subset of it for enabling research. We also present the challenges towards developing the dataset and present some way forward by benchmarking with landmark based raw and RQE embedding. We do some ablation on vocabulary restriction and canonicalization of the same within the dataset, which resulted in two more datasets, IsharaKhobor_small and IsharaKhobor_canonical_small. The dataset is publicly available at: www.kaggle.com/datasets/hasanssl/isharakhobor [1].

IsharaKotha: A Comprehensive Avatar-based Bangla Sign Language Corpus

Human-Computer Interaction

Translates Bangla text into sign language animations.

21 Nov 2025 0

91%

Introducing A Bangla Sentence - Gloss Pair Dataset for Bangla Sign Language Translation and Research

Computation and Language

Helps computers understand Bangla sign language sentences.

11 Nov 2025 0

88%

Isharah: A Large-Scale Multi-Scene Dataset for Continuous Sign Language Recognition

CV and Pattern Recognition

Helps computers understand real-world sign language.

4 Jun 2025 1

View PDF Login to Bookmark

Page Count

14 pages

Bangla Sign Language Translation: Dataset Creation Challenges, Benchmarking and Prospects

Helps computers understand Bangla sign language.

Technical Abstract

IsharaKotha: A Comprehensive Avatar-based Bangla Sign Language Corpus

Introducing A Bangla Sentence - Gloss Pair Dataset for Bangla Sign Language Translation and Research

Isharah: A Large-Scale Multi-Scene Dataset for Continuous Sign Language Recognition