Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I think a Model-specific SemVer needs to be created to be clearer as to what degree of change has taken place, in the age of model weights.

Something that distinguishes between a completely new pre-training process/architecture, and standard RLHF cycles/optimizations.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: