Back to Quant Hub

DeepSeek-R1-Distill-Qwen-14B

14B

DeepSeek

R1 reasoning distilled into 14B. Huge community interest; excellent chain-of-thought.

47.6K HF downloads236 likesbartowski/DeepSeek-R1-Distill-Qwen-14B-GGUF· stats from 6/24/2026
Consumer GPU

131K

Max Context

3

Quant Variants

EXL2 4.65bpw

Best Quality

98.0%

Accuracy Retained

Quantization Variants

Per-quant VRAM, quality loss, and inference speed on RTX 4090

FormatLevelBPWVRAMPPL LossSpeedActions
GGUFQ4_K_M4.8510.2 GB2.8%95 tok/s
EXL24.65bpw4.659.8 GB2.0%128 tok/s
AWQINT449.2 GB3.6%118 tok/s