DeepSeek-R1

671B MoE

DeepSeek

DeepSeek-R1 reasoning model built on V3 MoE. Chain-of-thought at frontier scale — use distill variants for local GPUs.

7.2M HF downloads13414 likesdeepseek-ai/DeepSeek-R1· stats from 6/25/2026
Pro GPU

164K

Max Context

2

Quant Variants

GGUF Q4_K_M

Best Quality

98.2%

Accuracy Retained

Quantization Variants

Per-quant VRAM, quality loss, and inference speed on RTX 4090

FormatLevelBPWVRAMPPL LossSpeedActions
GGUFQ4_K_M4.85385.0 GB1.8%4 tok/s
CalcHF
GGUFQ3_K_M3.87310.0 GB4.2%5 tok/s
CalcHF