Ornith-1.0-35B Q3_K_M: 17 GB VRAM, all benchmarks pass, extreme quantization holds up
Ornith-1.0-35B has been quantized to Q3_K_M, achieving 16.8 GB on disk and ~17 GiB loaded VRAM. Validated with KL divergence probes and 14/14 behavior suite, it loses only 16 points of top-1 agreement vs Q6_K while halving memory usage. Single-GPU th...