Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Groq is claiming 284 tokens/second on Llama 3.1 70b, so they’re in the same ballpark.

https://groq.com/12-hours-later-groq-is-running-llama-3-inst...



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: