Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Repetition penalty is a matter of, generate a token, then multiply that logit by the penalty. (If the logit is negative, divide instead of multiply.)

https://github.com/shawwn/llama has an implementation (check the commit history).



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: