NVIDIA Nemotron Labs 3 Elastic 12B A2B NVFP4 by monology — 7.33B open-weight local LLM. License: other. Benchmarks, VRAM requirements per quantization, GPU compatibility, pricing and community reviews on slopllm.com.
Key specs
| Type | Local open-weight |
|---|
| Parameters | 7.33B total · MoE, — active |
|---|
| Architecture | nemotron_h |
|---|
| Context window | 262K tokens |
|---|
| Knowledge cutoff | — |
|---|
| Modalities | text |
|---|
| Recommended backends | — |
|---|
| Minimum viable rig | — |
|---|
Benchmark scores
| GPQA Diamond | — |
|---|
| SWE-bench Verified | — |
|---|
| AIME | — |
|---|
| MMLU-Pro | — |
|---|
| BFCL v3 (tool use) | — |
|---|
| Composite score | — |
|---|
| Community rating | No reviews yet |
|---|
VRAM & disk per quantization
| Quant | VRAM | Disk | RAM | Context |
|---|
| Q4_K_M | 5.8 GB | 4.3 GB | — | 262K |