Qwen3 235B-A22B Instruct

235B-A22B

Alibaba Qwen3

Qwen3 flagship MoE (22B active / 235B total). Q4_K_M ~142GB; rivals DeepSeek-R1 class models.

692 HF downloads10 likesQwen/Qwen3-235B-A22B-GGUF· stats from 6/25/2026
Pro GPU

41K

Max Context

3

Quant Variants

GGUF Q4_K_M

Best Quality

97.8%

Accuracy Retained

Quantization Variants

Per-quant VRAM, quality loss, and inference speed on RTX 4090

FormatLevelBPWVRAMPPL LossSpeedActions
GGUFQ4_K_M4.85142.0 GB2.2%10 tok/s
CalcHF
GGUFQ3_K_M3.87115.0 GB4.8%12 tok/s
CalcHF
AWQINT44125.0 GB3.2%14 tok/s
CalcHF