Skip to content

GTPQModel v3.0.0

Latest
Compare
Choose a tag to compare
@Qubitium Qubitium released this 14 Apr 14:15
· 76 commits to main since this release
a0c7753

🎉 New ground-breaking GPTQ v2 quantization option for improved model quantization accuracy validated by GSM8K_PLATINUM benchmarks vs original gptq.
✨ New Phi4-MultiModal model support.
✨ New Nvidia Nemotron Ultra model support.
✨ New Dream model support. New experimental multi-gpu quantization support. Reduced vram usage. Faster quantization.

What's Changed

New Contributors

Full Changelog: v2.2.0...v3.0.0