Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Right now I get 59 tok/sec on GPT-OSS 120B using Unsloth's dynamic 4-bit quants, via llama.cpp https://news.ycombinator.com/item?id=45881049


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: