Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The hallucination part is somewhat solvable, but the inference cost is huge. Especially in the chat format where you want long history, you have to pay for all the previous tokens in each round of dialogue. The cost is about $0.01 per page, so it can become $0.10 per reply after 10 pages of history accumulated. Even short lines of text have the same overhead.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: