Neural Decoding of Overt Speech from ECoG Using Vision Transformers and Contrastive Representation Learning
By: Mohamed Baha Ben Ticha , Xingchen Ran , Guillaume Saldanha and more
Potential Business Impact:
Lets paralyzed people talk by reading brain signals.
Speech Brain Computer Interfaces (BCIs) offer promising solutions to people with severe paralysis unable to communicate. A number of recent studies have demonstrated convincing reconstruction of intelligible speech from surface electrocorticographic (ECoG) or intracortical recordings by predicting a series of phonemes or words and using downstream language models to obtain meaningful sentences. A current challenge is to reconstruct speech in a streaming mode by directly regressing cortical signals into acoustic speech. While this has been achieved recently using intracortical data, further work is needed to obtain comparable results with surface ECoG recordings. In particular, optimizing neural decoders becomes critical in this case. Here we present an offline speech decoding pipeline based on an encoder-decoder deep neural architecture, integrating Vision Transformers and contrastive learning to enhance the direct regression of speech from ECoG signals. The approach is evaluated on two datasets, one obtained with clinical subdural electrodes in an epileptic patient, and another obtained with the fully implantable WIMAGINE epidural system in a participant of a motor BCI trial. To our knowledge this presents a first attempt to decode speech from a fully implantable and wireless epidural recording system offering perspectives for long-term use.
Similar Papers
Efficient Transformer-Integrated Deep Neural Architectures for Robust EEG Decoding of Complex Visual Imagery
Human-Computer Interaction
Lets people control robot arms with their thoughts.
Reconstructing Unseen Sentences from Speech-related Biosignals for Open-vocabulary Neural Communication
Human-Computer Interaction
Lets brains speak any new sentence.
A Penny for Your Thoughts: Decoding Speech from Inexpensive Brain Signals
Sound
Reads thoughts to make speech from brain waves.