Too big

by ShunAonuma - opened Dec 15, 2025

Why it's a 9 GB model after 4-bit quantization?
The 4-bit GGUF versions (like Q4_K_M) are about 5 GB.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment