Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Are there general LLMs that can translate text 'on image', as opposed to spitting out the translation as text? I feel like it would be easier to reason about in images that are very text-heavy like some of these diagrams, but from what I remember, last time I tried, chatgpt and claude would only give me a text translation


In this specific case , it is a svg, so you can ask to translate the svg source.


Google Lens does exactly that.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: