Back to Quant Hub

StarCoder2 15B

15B

BigCode

BigCode's open code model trained on 600+ languages. Great for polyglot dev.

Consumer GPU

16K

Max Context

2

Quant Variants

GGUF Q4_K_M

Best Quality

96.8%

Accuracy Retained

Quantization Variants

Per-quant VRAM, quality loss, and inference speed on RTX 4090

FormatLevelBPWVRAMPPL LossSpeedActions
GGUFQ4_K_M4.8510.5 GB3.2%92 tok/s
GPTQINT449.2 GB4.8%115 tok/s