Niche Model of the Day: Nemotron 49B 3bpw exl3

brucethemoose@lemmy.world · edit-2 4 days ago

Niche Model of the Day: Nemotron 49B 3bpw exl3

brucethemoose@lemmy.world · edit-2 3 days ago

^ what was said, not supported yet, though you can give it a shot theoretically.

Basically exl3 means you can run 32B models, totally on GPU without a ton of quantization loss, if you can get it working on your computer. But exl2/exl3 is less popular largely because it’s PyTorch based, hence more finicky to setup (no GGUF single files, no Macs, no easy install, especially on AMD).

Niche Model of the Day: Nemotron 49B 3bpw exl3

Niche Model of the Day: Nemotron 49B 3bpw exl3

turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3 at 3.0bpw