Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

MoEs have a lot of technical complexity and aren't well supported in the open source world. We plan to release a MoE soon(ish).

I do think that MoEs are clearly the future. I think we will release more MoEs moving forward once we have the tech in place to do so efficiently. For all use cases except local usage, I think that MoEs are clearly superior to dense models.



Even local, MoE are just so much faster, and they let you pick a large/less quantized model and still get a useful speed.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: