Qwen3 8B Sftep2 Bal Klge Easysft 16bit Vllm

didula-wso2 · released 2026-06-23 · apache-2.0 license

Qwen3 8B Sftep2 Bal Klge Easysft 16bit Vllm by didula-wso2 — 8.19B open-weight local LLM. License: apache-2.0. Benchmarks, VRAM requirements per quantization, GPU compatibility, pricing and community reviews on slopllm.com.

Key specs

TypeLocal open-weight
Parameters8.19B total
Architectureqwen3
Context window41K tokens
Knowledge cutoff
Modalitiestext, vision
Recommended backends
Minimum viable rig

Benchmark scores

GPQA Diamond
SWE-bench Verified
AIME
MMLU-Pro
BFCL v3 (tool use)
Composite score
Community ratingNo reviews yet

VRAM & disk per quantization

QuantVRAMDiskRAMContext
Q4_K_M6.3 GB4.8 GB41K