Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Yes, you are totally correct, but I believe this term is omitted from the cross-entropy loss function that is used in machine learning? Because it is a constant which does not contribute to the optimization.

Please correct me if I'm wrong.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: