Niche Model of the Day: Nemotron 49B 3bpw exl3

brucethemoose@lemmy.world · edit-2 3 days ago

Niche Model of the Day: Nemotron 49B 3bpw exl3

brucethemoose@lemmy.world · edit-2 3 days ago

That, and exl2 has ROCm support.

There was always the bugaboo of uttering a prayer to get rocm flash attention working (come on, AMD…), but exl3 has plans to switch to flashinfer, which should eliminate that issue.

Niche Model of the Day: Nemotron 49B 3bpw exl3

Niche Model of the Day: Nemotron 49B 3bpw exl3

turboderp/Llama-3.3-Nemotron-Super-49B-v1-exl3 at 3.0bpw