FastContext 1.0 4B RL W4a16 G128

useful-quants · released 2026-06-23 · mit license

FastContext 1.0 4B RL W4a16 G128 by useful-quants — 4.05B open-weight local LLM. License: mit. Benchmarks, VRAM requirements per quantization, GPU compatibility, pricing and community reviews on slopllm.com.

Key specs

TypeLocal open-weight
Parameters4.05B total
Architectureqwen3
Context window262K tokens
Knowledge cutoff
Modalitiestext
Recommended backends
Minimum viable rig

Benchmark scores

GPQA Diamond
SWE-bench Verified
AIME
MMLU-Pro
BFCL v3 (tool use)
Composite score
Community ratingNo reviews yet

VRAM & disk per quantization

QuantVRAMDiskRAMContext
Q4_K_M3.8 GB2.3 GB262K