NVIDIA Nemotron Labs 3 Elastic 12B A2B NVFP4

monology · released 2026-06-20 · other license

NVIDIA Nemotron Labs 3 Elastic 12B A2B NVFP4 by monology — 7.33B open-weight local LLM. License: other. Benchmarks, VRAM requirements per quantization, GPU compatibility, pricing and community reviews on slopllm.com.

Key specs

TypeLocal open-weight
Parameters7.33B total · MoE, — active
Architecturenemotron_h
Context window262K tokens
Knowledge cutoff
Modalitiestext
Recommended backends
Minimum viable rig

Benchmark scores

GPQA Diamond
SWE-bench Verified
AIME
MMLU-Pro
BFCL v3 (tool use)
Composite score
Community ratingNo reviews yet

VRAM & disk per quantization

QuantVRAMDiskRAMContext
Q4_K_M5.8 GB4.3 GB262K