More

lewdwig · 2026-01-20T09:00:51 1768899651

The standard skeptical position (“LLMs have no theory of mind”) assumes a single unified self that either does or doesn’t model other minds. But this paper suggests models have access to a space of potential personas, steering away increases the model’s tendency to identify as other entities, which they traverse based on conversational dynamics. So it’s less no theory of mind and more too many potential minds, insufficiently anchored.

lewdwig · 2026-01-12T12:35:25 1768221325

A language which is not 1.0, and has repeatedly changed its IO implementation in a non-backwards-compatible way is certainly a courageous choice for production code.

dnautics · 2026-01-12T18:47:03 1768243623

So, I'm noodling around with writing a borrow checker for zig, and you don't get to appreciate this working with zig on a day to day level, but the internals of how the zig compiler works are AMAZING. Also, the io refactor will (I think) let me implement aliasing checking (alias xor mutable).

ivanjermakov · 2026-01-12T14:27:20 1768228040

In my experience, migrating small-scale projects takes from minutes to single digit hours.

Standard library is changing. The core language semantics - not so much. You can update from std.ArrayListUnmanaged to std.array_list.Aligned with to greps.

bluecalm · 2026-01-12T15:37:00 1768232220

Right? People must really like the design choices in Zig to do that instead of choosing another language. It's very interesting just because of that.

Philpax · 2026-01-12T13:57:43 1768226263

It's certainly not a choice I would have made, but there's sufficient precedent for it now (TigerBeetle, Ghostty, etc) that I can understand it.

hu3 · 2026-01-12T14:21:47 1768227707

also Bun

blue_pants · 2026-01-12T15:45:42 1768232742

also Roc

Iridescent_ · 2026-01-12T17:18:30 1768238310

This one is far from prod-ready however

steeve · 2026-01-12T13:41:27 1768225287

the upside is absolutely worth it

lewdwig · 2026-01-06T18:01:27 1767722487

One thing that seems to mark most nerds is a tendency towards being utopian about tech in general but deeply sceptical of specific tech.

lewdwig · 2025-10-13T14:21:41 1760365301

I don’t hate git either but you’ll meet very few people who will claim its UX is optimal. JJ’s interaction model is much simpler than git’s, and the difficulty I found is that the better you know git, the harder it is to unlearn all its quirks.

lewdwig · 2025-09-08T12:52:26 1757335946

To Broadcom you’re not a customer, you’re a mark, a patsy, stooge, a _victim_. Their aim is to establish exactly what they can get away with, how far they can abuse you, before you’ll just walk away.

travisgriggs · 2025-09-08T14:12:11 1757340731

But this is where all/most “platforms” go. As the product offering flounders over time, your quality talent (engineering and business) boils off to other opportunities. Then the short term value extraction methodologies show up, and everyone looks on in horror as the platform is “destroyed” through “mismanaged” consumer relationships.

Working in agtech, I’ve always wondered if this isn’t just the disenfranchised farmer story.

Give a farmer 1,000 acres to farm, and if they’re playing the long game, they’ll intermix their high value crops with responsible crop rotations. Managed well, this business can go on indefinitely.

But tell them they have 5 years left to farm the ground, and that the land will be of no value after that, they’ll grow the most expensive crop they can every year, soil quality be damned. It makes the most sense from a value extraction point of view.

Broadcom seems to be the kind of farmers that buy up forsaken land and extract as much value as possible before it finally fails.

lewdwig · 2025-09-08T10:06:41 1757326001

I have noticed that LLMs are actually pretty decent at redteaming code, so I’ve made it a habit of getting them to do that for code they generate periodically. A good loop is (a) generate code, (b) add test coverage for the code (to 70-80%) (c) redteam the code for possible performance/security concerns, (d) add regression tests for the issues uncovered and then fix the code.

ThailandJohn · 2025-09-08T10:15:37 1757326537

The glaring thing most people seem to miss that llm generated code is like TOS and unless you work in a more enterprise team setting? You are not going to catch 90% of the issues...

If this was used before releasing the tea spill fiasco, only to name one? It would never have been a fiasco. Just saying..

lewdwig · 2025-09-05T20:55:14 1757105714

I’m sure this’ll be misreported and wilfully misinterpreted because of the current fractious state of the AI discourse, but given the lawsuit was to do with piracy, not the copyright-compliance of LLMs, and in any case, given they settled out of court, thus presumably admit no wrongdoing, conveniently no legal precedent is established either way.

I would not be surprised if investors made their last round of funding contingent on settling this matter out of court precisely to ensure no precedents are set.

lewdwig · 2025-08-29T12:01:22 1756468882

TBH I’m surprised it’s taken them this long to change their mind on this, because I find it incredibly frustrating to know that current gen agentic coding systems are incapable of actually learning anything from their interactions with me - especially when they make the same stupid mistakes over and over.

const_cast · 2025-08-29T13:16:31 1756473391

Okay they're not going to be learning in real time. Its not like you're getting your data stolen and then getting something out of it - you're not. What you're talking about is context.

Data gathered for training still has to be used in training, i.e. a new model that, presumably, takes months to develop and train.

Not to mention your drop-in-the-bucket contribution will have next to no influence in the next model. It won't catch things specific to YOUR workflow, just common stuff across many users.

ethagnawl · 2025-08-29T14:58:35 1756479515

> Not to mention your drop-in-the-bucket contribution will have next to no influence in the next model. It won't catch things specific to YOUR workflow, just common stuff across many users.

I wonder about this. In the future, if I correct Claude when it makes fundamental mistakes about some topic like an exotic programming language, wouldn't those corrections be very valuable? It seems like it should consider the signal to noise ratio in these cases (where there are few external resources for it to mine) to be quite high and factor that in during its next training cycle.

vjerancrnjak · 2025-08-29T12:56:01 1756472161

They wouldn’t be able to learn much from interactions anyway.

Learning metric won’t be you, it will be some global shitty metric that will make the service mediocre with time.

nicce · 2025-08-29T12:10:59 1756469459

Or get more value from the users with the same subscription price. I doubt they are giving any discounts.

diggan · 2025-08-29T12:53:21 1756472001

It's actually pretty clever (albeit shitty/borderline evil), start off by saying you're different by the competitors because you care a lot about privacy and safety, and that's why you're charging higher prices than the rest. Then, once you have a solid user-base, slowly turn on the heat, step-by-step, so you end up with higher prices yet same benefits as the competitors.

lewdwig · 2025-08-24T11:09:34 1756033774

With code, I’m much more interested in it being correct and good rather than creative or novel. I see it is my job to be the arbiter of taste because the models are equally happy to create code I’d consider excellent and terrible on command.

lewdwig · 2025-08-24T11:00:25 1756033225

There are nascent signs of emergent world models in current LLMs, the problem is that they decohere very quickly due to them lacking any kind of hierarchical long term memory.

A lot of what is structurally important the model knows about your code gets lost whenever the context gets compressed.

Solving this problem will mark the next big leap in agentic coding I think.