Score: 2

Investigating Recent Large Language Models for Vietnamese Machine Reading Comprehension

Published: March 23, 2025 | arXiv ID: 2503.18062v1

By: Anh Duc Nguyen , Hieu Minh Phi , Anh Viet Ngo and more

Potential Business Impact:

Helps computers understand Vietnamese text better.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Large Language Models (LLMs) have shown remarkable proficiency in Machine Reading Comprehension (MRC) tasks; however, their effectiveness for low-resource languages like Vietnamese remains largely unexplored. In this paper, we fine-tune and evaluate two state-of-the-art LLMs: Llama 3 (8B parameters) and Gemma (7B parameters), on ViMMRC, a Vietnamese MRC dataset. By utilizing Quantized Low-Rank Adaptation (QLoRA), we efficiently fine-tune these models and compare their performance against powerful LLM-based baselines. Although our fine-tuned models are smaller than GPT-3 and GPT-3.5, they outperform both traditional BERT-based approaches and these larger models. This demonstrates the effectiveness of our fine-tuning process, showcasing how modern LLMs can surpass the capabilities of older models like BERT while still being suitable for deployment in resource-constrained environments. Through intensive analyses, we explore various aspects of model performance, providing valuable insights into adapting LLMs for low-resource languages like Vietnamese. Our study contributes to the advancement of natural language processing in low-resource languages, and we make our fine-tuned models publicly available at: https://huggingface.co/iaiuet.

Empowering Smaller Models: Tuning LLaMA and Gemma with Chain-of-Thought for Ukrainian Exam Tasks

Computation and Language

Helps Ukrainian computers understand hard tests.

18 Mar 2025 0

89%

Resource-Efficient Fine-Tuning of LLaMA-3.2-3B for Medical Chain-of-Thought Reasoning

Computation and Language

Makes AI better at doctor questions, uses less power.

6 Oct 2025 2

89%

LAG-MMLU: Benchmarking Frontier LLM Understanding in Latvian and Giriama

Computation and Language

Tests AI understanding in new languages.

14 Mar 2025 2

View PDF Login to Bookmark

Country of Origin

🇻🇳 Viet Nam

Repos / Data Links

github.com

Page Count

7 pages

Investigating Recent Large Language Models for Vietnamese Machine Reading Comprehension

Helps computers understand Vietnamese text better.

Technical Abstract

Empowering Smaller Models: Tuning LLaMA and Gemma with Chain-of-Thought for Ukrainian Exam Tasks

Resource-Efficient Fine-Tuning of LLaMA-3.2-3B for Medical Chain-of-Thought Reasoning

LAG-MMLU: Benchmarking Frontier LLM Understanding in Latvian and Giriama