Score: 2

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Published: December 23, 2025 | arXiv ID: 2512.20848v1

By: NVIDIA , : , Aaron Blakeman and more

Potential Business Impact:

Makes AI smarter and faster, understanding more.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

We present Nemotron 3 Nano 30B-A3B, a Mixture-of-Experts hybrid Mamba-Transformer language model. Nemotron 3 Nano was pretrained on 25 trillion text tokens, including more than 3 trillion new unique tokens over Nemotron 2, followed by supervised fine tuning and large-scale RL on diverse environments. Nemotron 3 Nano achieves better accuracy than our previous generation Nemotron 2 Nano while activating less than half of the parameters per forward pass. It achieves up to 3.3x higher inference throughput than similarly-sized open models like GPT-OSS-20B and Qwen3-30B-A3B-Thinking-2507, while also being more accurate on popular benchmarks. Nemotron 3 Nano demonstrates enhanced agentic, reasoning, and chat abilities and supports context lengths up to 1M tokens. We release both our pretrained Nemotron 3 Nano 30B-A3B Base and post-trained Nemotron 3 Nano 30B-A3B checkpoints on Hugging Face.

NVIDIA Nemotron 3: Efficient and Open Intelligence

Computation and Language

Makes computers smarter and faster for many tasks.

24 Dec 2025 1

95%

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Computation and Language

Makes computers think faster for hard problems.

20 Aug 2025 3

95%

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Computation and Language

Makes computers think faster for hard problems.

20 Aug 2025 3

View PDF Login to Bookmark

Repos / Data Links

github.com github.com github.com github.com github.com github.com huggingface.co

Page Count

40 pages

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Makes AI smarter and faster, understanding more.

Technical Abstract

NVIDIA Nemotron 3: Efficient and Open Intelligence

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model