thickertoofan@lemm.ee to LocalLLaMA@sh.itjust.worksEnglish · 6 days agoMicrosoft just released BitNet!github.comexternal-linkmessage-square6fedilinkarrow-up117arrow-down10file-text
arrow-up117arrow-down1external-linkMicrosoft just released BitNet!github.comthickertoofan@lemm.ee to LocalLLaMA@sh.itjust.worksEnglish · 6 days agomessage-square6fedilinkfile-text
minus-squarehendrik@palaver.p3x.delinkfedilinkEnglisharrow-up1·6 days agoNice. Any additional info on how difficult it was to train this and whether we can expect more? They have a 3B model in the demo video, but doesn’t seem like they released that… I mean I’d like something a bit larger.
Nice. Any additional info on how difficult it was to train this and whether we can expect more? They have a 3B model in the demo video, but doesn’t seem like they released that… I mean I’d like something a bit larger.