Score: 1

Ministral 3

Published: January 13, 2026 | arXiv ID: 2601.08584v1

By: Alexander H. Liu , Kartik Khandelwal , Sandeep Subramanian and more

Potential Business Impact:

Computers understand pictures and solve hard problems.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

We introduce the Ministral 3 series, a family of parameter-efficient dense language models designed for compute and memory constrained applications, available in three model sizes: 3B, 8B, and 14B parameters. For each model size, we release three variants: a pretrained base model for general-purpose use, an instruction finetuned, and a reasoning model for complex problem-solving. In addition, we present our recipe to derive the Ministral 3 models through Cascade Distillation, an iterative pruning and continued training with distillation technique. Each model comes with image understanding capabilities, all under the Apache 2.0 license.

Repos / Data Links

Page Count
14 pages

Category
Computer Science:
Computation and Language