Score: 0

NeuroLex: A Lightweight Domain Language Model for EEG Report Understanding and Generation

Published: November 17, 2025 | arXiv ID: 2511.12851v1

By: Kang Yin, Hye-Bin Shin

Potential Business Impact:

Helps doctors understand brain wave reports better.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Clinical electroencephalogram (EEG) reports encode domain-specific linguistic conventions that general-purpose language models (LMs) fail to capture. We introduce NeuroLex, a lightweight domain-adaptive language model trained purely on EEG report text from the Harvard Electroencephalography Database. Unlike existing biomedical LMs, NeuroLex is tailored to the linguistic and diagnostic characteristics of EEG reporting, enabling it to serve as both an independent textual model and a decoder backbone for multimodal EEG-language systems. Using span-corruption pretraining and instruction-style fine-tuning on report polishing, paragraph summarization, and terminology question answering, NeuroLex learns the syntax and reasoning patterns characteristic of EEG interpretation. Comprehensive evaluations show that it achieves lower perplexity, higher extraction and summarization accuracy, better label efficiency, and improved robustness to negation and factual hallucination compared with general models of the same scale. With an EEG-aware linguistic backbone, NeuroLex bridges biomedical text modeling and brain-computer interface applications, offering a foundation for interpretable and language-driven neural decoding.

Large Language Models for EEG: A Comprehensive Survey and Taxonomy

Signal Processing

Lets computers understand brain signals like words.

2 Jun 2025 1

89%

NeuroLingua: A Language-Inspired Hierarchical Framework for Multimodal Sleep Stage Classification Using EEG and EOG

Machine Learning (CS)

Helps machines understand sleep stages better.

12 Nov 2025 1

88%

EEGAgent: A Unified Framework for Automated EEG Analysis Using Large Language Models

Machine Learning (CS)

Lets computers understand brain waves for health.

13 Nov 2025 0

View PDF Login to Bookmark

Page Count

4 pages

NeuroLex: A Lightweight Domain Language Model for EEG Report Understanding and Generation

Helps doctors understand brain wave reports better.

Technical Abstract

Large Language Models for EEG: A Comprehensive Survey and Taxonomy

NeuroLingua: A Language-Inspired Hierarchical Framework for Multimodal Sleep Stage Classification Using EEG and EOG

EEGAgent: A Unified Framework for Automated EEG Analysis Using Large Language Models