An evaluation of DeepSeek Models in Biomedical Natural Language Processing
By: Zaifu Zhan , Shuang Zhou , Huixue Zhou and more
Potential Business Impact:
Helps computers understand medical writing better.
The advancement of Large Language Models (LLMs) has significantly impacted biomedical Natural Language Processing (NLP), enhancing tasks such as named entity recognition, relation extraction, event extraction, and text classification. In this context, the DeepSeek series of models have shown promising potential in general NLP tasks, yet their capabilities in the biomedical domain remain underexplored. This study evaluates multiple DeepSeek models (Distilled-DeepSeek-R1 series and Deepseek-LLMs) across four key biomedical NLP tasks using 12 datasets, benchmarking them against state-of-the-art alternatives (Llama3-8B, Qwen2.5-7B, Mistral-7B, Phi-4-14B, Gemma-2-9B). Our results reveal that while DeepSeek models perform competitively in named entity recognition and text classification, challenges persist in event and relation extraction due to precision-recall trade-offs. We provide task-specific model recommendations and highlight future research directions. This evaluation underscores the strengths and limitations of DeepSeek models in biomedical NLP, guiding their future deployment and optimization.
Similar Papers
DeepSeek in Healthcare: A Survey of Capabilities, Risks, and Clinical Applications of Open-Source Large Language Models
Computation and Language
Helps computers solve math and medical problems.
A Review of DeepSeek Models' Key Innovative Techniques
Machine Learning (CS)
Makes smart computer programs better and cheaper.
DeepSeek performs better than other Large Language Models in Dental Cases
Computation and Language
Helps dentists understand patient history better.