More

danielbln · 2026-02-20T04:06:02 1771560362

Generally us humans come up with new things by remixing old ideas. Where else would they come from? We are synthesizing priors into something novel. If you break the problem space apart enough, I don't see why some LLM can't do the same.

tovej · 2026-02-20T10:22:49 1771582969

LLM's cannot synthesize text, they can only concatenate or mix statistically. Synthesis requires logical reasoning. That's not how LLMs work.

danielbln · 2026-02-20T10:37:40 1771583860

Yes it is, LLMs perform logical multi step reasoning all the time, see math proofs, coding etc. And whether you call it synthesis or statistical mixing is just semantics. Do LLMs truly understand? Who knows, probably not, but they do more than you make it out to be.

danielbln · 2026-02-20T04:04:01 1771560241

Novel problems are usually a composite of simpler and/or older problems that have been solved before. Decomposition means you can rip most novel problems apart and solve the chunks. LLMs do just fine with that.

danielbln · 2026-02-20T03:58:27 1771559907

Claude Code released just over a year ago, agentic coding came into its own maybe in May or June of last year. Maybe give it a minute?

ok_dad · 2026-02-20T04:04:58 1771560298

It’s been a minute and a half and I don’t see the evidence you can task an agent swarm to produce useful software without your input or review. I’ve seen a few experiments that failed, and I’ve seen manic garbage, but not yet anything useful outside of the agent operators imagination.

danielbln · 2026-02-20T04:11:33 1771560693

Agent swarms are what, a couple of months old? What are you even talking about. Yes, people/humans still drive this stuff, but if you think there isn't useful software out there that can be handily implemented with current gen agents that need very little or no review, then I don't know what to tell you, apart from "you're mistaken". And I say that as someone who uses three tools heavily but has otherwise no stake in them. The copium in this space is real. Everyone is special and irreplaceable, until another step change pushes them out.

dandellion · 2026-02-20T09:03:46 1771578226

The next thing after agent swarms will be swarm colonies and people will go "it's been a month since agentic swarm colonies, give it a month or two". People have been moving the goal posts like that for a couple years now, it's starting to grow stale. This is like self driving cars which were going to be workingin 2016 and replace 80% of drivers by 2017, all over again. People falling for hype instead of admitting that while it appears somewhat useful, nobody has any clue if it's 97% useful or just 3% useful but so far it's looking like the later.

ForHackernews · 2026-02-20T10:27:30 1771583250

I generally agree, but counterpoint: Waymo is successfully running robocabs in many cities today.

viking123 · 2026-02-20T11:11:42 1771585902

When does it come to Mumbai?

ForHackernews · 2026-02-20T12:22:59 1771590179

They're launching in London this year. So... 2035?

ok_dad · 2026-02-20T05:19:03 1771564743

The whole point is that an agent swarm doesn’t need a month, supposedly.

quietbritishjim · 2026-02-20T09:41:40 1771580500

We're talking about whether the human users have caught up with usage of tech, not the speed of the tech itself.

danielbln · 2026-02-19T23:57:35 1771545455

No offense, but this reads like LLM output.

lelandfe · 2026-02-20T00:00:16 1771545616

Immediately regretted the reply after I looked at the history

ohyoutravel · 2026-02-20T00:23:28 1771547008

The whole account is an LLM slop account. To what end I don’t know, but it has been happening more and more.

danielbln · 2026-02-18T11:39:15 1771414755

I just sent Opus a NYC night satellite view and it described it just as expected. Seems like you have a tooling problem, not a model problem.

ge96 · 2026-02-18T14:12:53 1771423973

Would be curious your setup this was mine.

satellite_imagery_analysis_agent = create_agent( model="claude-opus-4-6", system_prompt="your task is to analyze satellite images" )

response = satellite_imagery_analysis_agent.invoke({ "messages": [ { "role": "user", "content": "What do you see in this satellite image? https://images.unsplash.com/photo-1446776899648-aa78eefe8ed0..." } ] })

With this output:

# Satellite Image Analysis

I can see this image shows an *aerial/satellite view of a coastline*. Here are the key features I can identify:

## Geographic Features - *Ocean/Sea*: A large body of deep blue water dominates a significant portion of the image - *Coastline*: A clearly defined boundary between land and water with what appears to be a rugged or natural shoreline - *Beach/Shore*: Light-colored sandy or rocky coastal areas visible along the water's edge

## Terrain - *Varied topography*: The land area shows a mix of greens and browns, suggesting: - Vegetated areas (green patches) - Arid or bare terrain (brown/tan areas) - *Possible cliffs or elevated terrain* along portions of the coast

## Atmospheric Conditions - *Cloud cover*: There appear to be some clouds or haze in parts of the image - Generally clear conditions allowing good visibility of surface features

## Notable Observations - The color contrast between the *turquoise/shallow nearshore waters* and the *deeper blue offshore waters* suggests varying ocean depths (bathymetry) - The coastline geometry suggests this could be a *peninsula, island, or prominent headland* - The landscape appears relatively *semi-arid* based on the vegetation patterns

---

Note: Without precise geolocation metadata, I'm providing a general analysis based on visible features. The image appears to capture a scenic coastal region, possibly in a Mediterranean, subtropical, or tropical climate zone.

Would you like me to focus on any specific aspect of this image?

danielbln · 2026-02-17T03:18:16 1771298296

I don't understand why "it's just predicting words, bro" is still seen as a valuable argument. A LOT has to happen to accurately predict the next word(s) for any given topic.

If that supposed to be a dismissal, it's not a good one.

dham · 2026-02-17T03:35:59 1771299359

Because people think it's "intelligent" because it's manipulating words and you get people like Andrew Yang and Elon Musk getting one-shotted by it.

Yes, it can solve a lot of things, but an LLM isn't going to put everyone out of work, the thing after the LLM will.

signatoremo · 2026-02-17T14:25:23 1771338323

You sound exactly like Andrew Yang, the one you are criticizing, with confident sounding predictions but no substances.

dandellion · 2026-02-17T18:46:25 1771353985

The burden of proof lies on the side making claims about what AI will do, not the ones denying it.

danielbln · 2026-02-17T03:16:12 1771298172

And even visible in the UI via Ctrl+o.

danielbln · 2026-02-17T03:14:23 1771298063

Sounds to me like a bunch of physical and therefore measurable (and tangible) properties and some placebo effect on top.

danielbln · 2026-02-17T03:11:52 1771297912

LLMs have finally freed me from the shackles of yak shaving. Some dumb inconsequential tooling thing doesn't work? Agent will take care of it in a background session and I can get back to building things I do care about.

mikelevins · 2026-02-17T03:43:53 1771299833

I'm finding that in several kinds of projects ranging from spare-time amusements to serious work, LLMs have become useful to me by (1) engaging me in a conversation that elicits thoughts and ideas from me more quickly than I come up with them without the conversation, and (2) pointing me at where I can get answers to technical questions so that I get the research part of my work done more quickly.

Talking with other knowledgeable humans works just as well for the first thing, but suitable other humans are not as readily available all the time as an LLM, and suitably-chosen LLMs do a pretty good job of engaging whatever part of my brain or personality it is that is stimulated through conversation to think inventively.

For the second thing, LLMs can just answer most of the questions I ask, but I don't trust their answers for reasons that we all know very well, so instead I ask them to point me at technical sources as well, and that often gets me information more quickly than I would have by just starting from a relatively uninformed google search (though Google is getting better at doing the same job, too).

danielbln · 2026-02-13T17:40:01 1771004401

It's not that complicated. 4o was RLHF'd to be sycophantic as hell, which was fine until some one had their psychotic episode fueled by it and so they changed it with the next model.

rsynnott · 2026-02-14T12:43:48 1771073028

Not just someone, many, many people, going by the feedback on Reddit. People are mourning the damn thing.

Grossly irresponsible to ever release this IMO.