Watch precision drop from bf16 to int4 — the model shrinks, the lights stay on. Same quality, half the disk, fully local on your machine.
ollama pull gemma4:e2b-it-qat