Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The 13b and 30b run quite well on a 4090 at 4-bit quantization.


Ah dang I missed that I was still using the 8bit mode, I'll look into that thanks!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: