ixi-GEN: Efficient Industrial sLLMs through Domain Adaptive Continual Pretraining
By: Seonwu Kim , Yohan Na , Kihun Kim and more
Potential Business Impact:
Makes small AI models work much better for businesses.
The emergence of open-source large language models (LLMs) has expanded opportunities for enterprise applications; however, many organizations still lack the infrastructure to deploy and maintain large-scale models. As a result, small LLMs (sLLMs) have become a practical alternative, despite their inherent performance limitations. While Domain Adaptive Continual Pretraining (DACP) has been previously explored as a method for domain adaptation, its utility in commercial applications remains under-examined. In this study, we validate the effectiveness of applying a DACP-based recipe across diverse foundation models and service domains. Through extensive experiments and real-world evaluations, we demonstrate that DACP-applied sLLMs achieve substantial gains in target domain performance while preserving general capabilities, offering a cost-efficient and scalable solution for enterprise-level deployment.
Similar Papers
Continual Pre-Training is (not) What You Need in Domain Adaption
Computation and Language
Teaches AI to understand laws better.
DACP: Domain-Adaptive Continual Pre-Training of Large Language Models for Phone Conversation Summarization
Computation and Language
Makes AI better at summarizing messy conversations.
Less Data, More Security: Advancing Cybersecurity LLMs Specialization via Resource-Efficient Domain-Adaptive Continuous Pre-training with Minimal Tokens
Computation and Language
Teaches computers to find computer security problems.