When DeepSeek V4 and R2?

pepperfree@sh.itjust.works · 8 days ago

When DeepSeek V4 and R2?

Omega@discuss.online · 8 days ago

I wonder if a good fine tuned model beats every general purpose LLM if you need it for a really specific purpose

laz@lemmy.dbzer0.com · 8 days ago

Yes it does

kata1yst@sh.itjust.works · 8 days ago

Of course, and this is why the new hotness is a Mixture of Experts for one model that is effectively a bunch of experts arguing over the answer, or else on a different scale there’s the Combination of Agents where different specialized agents perform specialized tasks.

pepperfree@sh.itjust.works · 8 days ago

There is new project which they share fine-tuned modernbert on some task. Here is the org https://huggingface.co/adaptive-classifier