Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Looking forward to your progress! Just checked the paper and it says the underlying backbone is still DETR. My guess would be that SAM3 uses more video frames during the training process and caused the dilution of sparse engineering-paper-like data.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: