Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> it technically requires less GPU processing to run

Not when you have to scale. There's a reason why every LLM SaaS aggressively rate limits and even then still experiences regular outages.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: