Hacker Newsnew | past | comments | ask | show | jobs | submit | conradkay's commentslogin

Top LLMs are still very bad at poker, see this breakdown of a recent Kaggle experiment: <https://www.youtube.com/watch?v=jyv1bv7JKIQ>

What do you mean by sweep training here?


I think in the past it was more obvious. Rails switched to SQLite as the default somewhat recently


Yeah, that's the one prominent example but, like you said, also just rather recently. Since "the network is slow, duh" has always been true, I wonder why.


My guess would be that performance improvements (mostly hardware from Moore's law and the proliferation of SSDs, but also SQLite itself) have led to far fewer websites needing to run on more than 1 computer, and most are fine on a $5/month VPS

And stuff like https://litestream.io/ or SQLite adding STRICT mode



I'd imagine that's it. With WAL you can probably hit >1000 writes a second


usually electron apps are 150-300MB


Sam Altman posted with a comparison to Gemini 3 and Opus 4.5

https://x.com/sama/status/1999185784012947900


I see, thanks for this.


they mean it used to be $15/m input and $75/m output tokens


I wonder how much of it was just a https://en.wikipedia.org/wiki/Clever_Hans situation


Almost all of it was exactly this.

That's why it's never been done again. Because it was never done in the first place.


And at least in Hans' case, the general public could instantly verify the result, as opposed to having it interpreted by a biased handler.


It's a 200 billion dollar company, roughly what Anthropic is raising at


I don't see that much reason to be skeptical since this basically lines up with the trend we've been seeing in their performance.


Good article from 2023, not much data though if that's what you're looking for:

https://nymag.com/intelligencer/article/ai-artificial-intell...

unwalled: https://archive.ph/Z6t35

Generally seems similar today just on a bigger Scale. And much more focus on coding

Here in the US DataAnnotation seems to be the most marketed company offering these jobs


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: