Hacker Newsnew | past | comments | ask | show | jobs | submit | adw's commentslogin

The quality of the error messages matters a _lot_ (agents read those too!) and Python is particularly good there.

Especially since Python 3.14 shipped big improvements to error messages: https://docs.python.org/3/whatsnew/3.14.html#whatsnew314-imp...

> to the extent that our systems' world models are effectively indistinguishable from the real world.

https://genius.com/Jorge-luis-borges-on-exactitude-in-scienc...


> But is this "stochastic nature" inherent to the LLM?

At any kind of reasonable scale, yes. CUDA accelerators, like most distributed systems, are nondeterministic, even at zero temperature (which you don't want) with fixed seed.


> The issues starts when the software starts living longer

There's going to be a bifurcation; caricaturing it, "operating system kernels" and "disposable code". In the latter case, you don't maintain it; you dispose of it and vibe-code up a new one.


What this is saying is "you need an objective criterion you can use as a success metric" (aka a verifiable reward in RL terms). "Design of verifiers" is a specific form of domain expertise.

This applies to exploits, but it applies _extremely_ generally.

The increased interest in TLA+, Lean, etc comes from the same place; these are languages which are well suited to expressing deterministic success criteria, and it appears that (for a very wide range of problems across the whole of software) given a clear enough, verifiable enough objective, you can point the money cannon at it until the problem is solved.

The economic consequences of that are going to be very interesting indeed.


Property tax valuation.


More like a 2001 Renault Clio. Camrys are already bloatware.


Switching tools is _very easy_.


The Prime credit card is Chase.


Thanks for the clarification. I had forgotten the distinction.


Inside scoop: the pub group who owned that pub (still going, owns four in Cambridge and environs) was cofounded by Steve Early, a Cambridge computer scientist who wrote his own POS software, so it was very much a case of "yeah, that sounds like fun, I'll add it". (Until tax and primary rate risk made it not fun, so it was removed.)

The POS software's on GitHub: https://github.com/sde1000/quicktill


That's brilliant insight, thank you. I enjoyed reading Steve's extensive read me.

Also I'm planning a trip to Cambridge so I've bookmarked one of the pubs for a visit.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: