Qwen3-Coder 30B-A3B Instruct

30B-A3B

Alibaba Qwen3

Agentic coding MoE with 3.3B active params and 256K native context. Top open coder for 16–24GB cards.

2.1M HF downloads1123 likesQwen/Qwen3-Coder-30B-A3B-Instruct· stats from 6/25/2026
Consumer GPUMac / Apple Silicon

262K

Max Context

3

Quant Variants

GGUF Q5_K_M

Best Quality

99.0%

Accuracy Retained

Quantization Variants

Per-quant VRAM, quality loss, and inference speed on RTX 4090

FormatLevelBPWVRAMPPL LossSpeedActions
GGUFQ4_K_M4.8519.2 GB2.3%92 tok/s
CalcHF
GGUFQ5_K_M5.6822.0 GB1.0%80 tok/s
CalcHF
AWQINT4417.8 GB3.2%115 tok/s
CalcHF