Nemotron 3 Super

Released 2026-03 · reasoning · 1.0M tokens · 7 benchmarks · Open weight

Editorial notes

Lanzado 11 marzo 2026 en GTC 2026. Hybrid Mamba-Transformer MoE 120B / 12B activos (es el Super 120B-A12B). LatentMoE architecture. 5x throughput NVFP4 Blackwell. Supera GPT-OSS-120B con +10% throughput/GPU. #1 DeepResearch Bench. Scores del model card oficial nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 (AIME25 90.2, LiveCodeBench v5 81.2, HLE sin tools 18.3, SWE-V scaffold OpenHands 60.5). Auditoria 2026-06-08: MMLU-Pro corregido 83.3->83.7 y GPQA-D 79.4->79.2 (transcripcion imprecisa vs el 83.73/79.23 del card).

Spec sheet

Company: Nvidia
Country: US
Type: reasoning
Release: 2026-03
Context: 1.0M tokens
Params total: 120B
Params active (MoE): 12B
License: nvidia-open-model
Quants: BF16, Q8_0, Q5_K_M, Q4_K_M
Pricing (openrouter): $0.09/$0.45/M
Slug: nemotron-3-super

Quick install

2 tools

ollama.com

ollama run nemotron-3:super

Note: ~70 GB VRAM Q4_K_M

docs.vllm.ai

vllm serve nvidia/Nemotron-3-Super --tensor-parallel-size 2

Benchmarks (7)

Reasoning 3

Coding 2

Math 1

AIME-2025

American Invitational Mathematics Examination 2025.

90.2

Instruction 1

Arena-Hard

Hard prompts from the Arena — 500 challenging tasks.

73.9

Cite this model

BibTeX · APA

BibTeX

@misc{frontier-nemotron-3-super,
  title  = {Nemotron 3 Super},
  author = {{Nvidia}},
  year   = {2026},
  note   = {Frontier Benchmarks AI atlas. Accessed 2026-06-17},
  url    = {https://frontierbenchmarks.com/models/nemotron-3-super}
}

APA

Nvidia (2026). Nemotron 3 Super [Large language model]. Frontier Benchmarks AI. Retrieved 2026-06-17, from https://frontierbenchmarks.com/models/nemotron-3-super

Citation reflects the atlas page, not the original model paper. For the paper, see the "Resources" section above.

Battle vs another model ← All models More from Nvidia Methodology