Hacker Newsnew | past | comments | ask | show | jobs | submit | rocauc's commentslogin

As someone that works on a platform users have used for labeling 1B images, I'm bullish SAM 3 can automate at least 90% of the work. Data prep is flipped to models being human-assisted instead of humans being model-assisted (see "autolabel" https://blog.roboflow.com/sam3/). I'm optimistic majority of users can now start deploying a model to then curate data instead of the inverse.


A brief history. SAM 1 - Visual prompt to create pixel-perfect masks in an image. No video. No class names. No open vocabulary. SAM 2 - Visual prompting for tracking on images and video. No open vocab. SAM 3 - Open vocab concept segmentation on images and video.

Roboflow has been long on zero / few shot concept segmentation. We've opened up a research preview exploring a SAM 3 native direction for creating your own model: https://rapid.roboflow.com/


The model supports batch inference, so all prompts are sent to the model, and we parse the results.


I tried it on transparent glass mugs, and it does pretty well. At least better than other available models: https://i.imgur.com/OBfx9JY.png

Curious if you find interesting results - https://playground.roboflow.com


Yes. But also note that redistribution of SAM 3 requires using the same SAM 3 license downstream. So libraries that attempt to, e.g., relicense the model as AGPL are non-compliant.


Yes. It's a custom license with an Acceptable Use Policy preventing military use and export restrictions. The custom license permits commercial use.


If this is whats in the consumer space I'd imagine the government has something much more advanced. Its probably a foregone conclusion that they are recording the entire country (maybe the world) and storing everyone's movements or are getting close to it.


yes, downdetectorsdowndetectorsdowndetectorsdowndetector is available.



Is there a length limit for domain names? :)


Yes, according to RFC 1035 section 2.3.4 [0], it's 255 octets. Long answer written by a human: https://superuser.com/a/1843870

[0] https://www.rfc-editor.org/rfc/rfc1035#section-2.3.4


i've reached semantic satiation


Time for updetector.com! (On the plus side, this could detect if itself was up!)


The bike lane compliant vehicle category is exciting. Infinite Machine (infinitemachine.com) made me aware of this category with their Olto model, which is at a (surprisingly) superior price point.


Not nearly enough gradient for a vibe coded site :)


One of the most common uses for edge AI not listed in this course is computer vision. You similarly want real-time inference for processing video. Another open source project that makes it easy to use SOTA vision models on the edge is inference: https://github.com/roboflow/inference


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: