From Data to Dialogue: Unlocking Language for All
By: Dakota Ellis , Samy Bakikerali , Wanshan Chen and more
Potential Business Impact:
Helps people learn languages faster with fewer words.
Traditional linguists have proposed the use of a General Service List (GSL) to assist new language learners in identifying the most important words in English. This process requires linguistic expertise, subjective input, and a considerable amount of time. We attempt to create our own GSL and evaluate its practicality against the industry standard (The NGSL). We found creating a Specialized Word List (SWL), or a word list specific to a subset of the overall corpus, to be the most practical way for language-learners to optimize the process. The SWL's that we created using our model outperformed the industry standard, reaching the 95% coverage required for language comprehension with fewer words comparatively. By restricting the SWL process to objective criteria only, it can be automated, scaled, and tailored to the needs of language-learners across the globe.
Similar Papers
Linguists should learn to love speech-based deep learning models
Computation and Language
Helps computers understand talking, not just writing.
Cross-Lingual Interleaving for Speech Language Models
Computation and Language
Helps computers understand many languages from talking.
Overcoming Data Scarcity in Generative Language Modelling for Low-Resource Languages: A Systematic Review
Computation and Language
Helps computers talk in less common languages.