More

pookieinc · 2025-09-25T17:23:48 1758821028

I was wondering how they'd casually veer into social media and leverage their intelligence in a way that connects with the user. Like everyone else ITT, it seems like an incredibly sticky idea that leaves me feeling highly unsettled about individuals building any sense of deep emotions around ChatGPT.

pookieinc · 2025-07-23T12:54:30 1753275270

My question is: Given Apple is one of the most valuable companies on the planet, they can (and surely have) hired some of the best designers in the world. Articles like this one and many others are virtually sharing what we all think and every time a new beta comes out, it's strange to see some of the decisions that are made. The first beta came out and it was _very hard_ to see the lock screen if you had notifications. How was that missed? Or keep liquid glass, but don't make the text bright blue, so it's so hard to see. Or trigger frosted glass if dependent on whatever the background is? I sincerely do find designers to be in a hard position (especially having worked with so many of them in the past directly), but a lot of these things seem like novice mistakes. Maybe it's not even in the designing, it's on the QA front? I'm not even sure here. I'm by no means a designer, but I have to believe that they are testing this as much as we are internally and have been for a long time now... I'd like to believe they aren't just changing UI elements on the fly based on what X / Twitter feels is good or bad.

TheOtherHobbes · 2025-07-23T13:07:45 1753276065

Two theories are that Apple had to put something together quickly as a headliner because Apple Intelligence was clearly going to be a dud. So this is basically a hacked-together panic project.

Or someone high up has a Vision™, and they're so set on that Vision™ they're not listening to what underlings and users are saying.

Consider a parallel reality in which Apple did the next round of updates as a maintenance release and added some minor new features and UI tweaks. Would that have been a more positive outcome for the company?

My guess is there would have been some grumbling about not having anything new to offer, but also relief that bugs were being fixed. It would have been a bit of a non-event.

This seems more like a seismic negative event, with a lot of criticism from all quarters. (And some stanning, but less than usual.)

thewebguyd · 2025-07-23T19:53:10 1753300390

> My guess is there would have been some grumbling about not having anything new to offer, but also relief that bugs were being fixed. It would have been a bit of a non-event.

Depending on what Google has to say about Pixel & Gemini in August, I think it would have been much more than grumbling. Apple is in a damned if they do damned if they don't situation. Under the surface of liquid glass, there really isn't even anything new coming unless they have some hardware limited features planned for the iPhone 17 launch.

It's clear this "redesign" was as you said, a panic project to cover for not delivering on AI, again for a second year and having nothing to show for WWDC. Just coming out with "we fixed some bugs" would cause a PR shitstorm. Even more so if Google gets any further ahead integrating Gemini into Pixel w/ personal context like what Apple wanted to achieve with Siri/AI, plus their own redesign (Material 3 Expressive, which is actually looking really nice IMO).

> This seems more like a seismic negative event, with a lot of criticism from all quarters.

Except from normal users/non enthusiasts. My kids and her friends all installed the dev beta and are absolutely enamored with liquid glass and think it's the coolest thing ever. Mind you, these are generations of folks that weren't around for Vista/7 Aero, etc and are now obsessed with that era from a fashion and design POV. "Fruitigier aero aesthetic" and all that. These are also people that would never switch platforms no matter what Apple does because of iMessage and social status/social pressure, so Apple is in no danger of losing any marketshare over this unless Google/Android somehow becomes "cool" again and can generate enough social pressure amongst the youth.

int_19h · 2025-07-24T02:27:06 1753324026

> Except from normal users/non enthusiasts.

My wife is emphatically not a tech enthusiast. She hates what she's seen on the screenshots and demos so far, and is dreading the moment when it's out and she'll have to update.

CRConrad · 2025-07-31T11:01:50 1753959710

> It's clear this "redesign" was as you said, a panic project to cover for not delivering on AI, again for a second year and having nothing to show for WWDC.

Hm... So is their current system universally regarded as absolute shit, or what? Or does everyone[1] think it's pretty great now, but will switch to "it's shit!" immediately as of the WWDC?

Like, WTF is wrong with "We have a great system, it's still just as great, and even better now that we've worked mostly on stability and bugfixes."?

Are corporations nowadays all freaking Cinderella, or what?

___

[1]: Well, everyone who would consider buying into the Apple ecosystem.

Fluorescence · 2025-07-23T15:32:42 1753284762

> someone high up

Has to be. It has that Musky smell of banning yellow safety paint i.e. too stupid to be a team effort.

Legibility issues with translucency is such a basic thing and I expect Apple designers have gone deep on the topic e.g. mathematical models using human colour perception to determine hard limits for different type weights. I don't think the heavy frosting in past versions was an accident.

jajko · 2025-07-23T13:41:05 1753278065

But form over function is the core of why Apple is such hugely successful company with just few products. Focus on emotions rather than technical aspects. Design over usability. Less choices for users, just compare how much you can tweak in android vs ios. Removals of buttons, 3.5mm jack, sim card, removable batteries and so on and on just in phone area.

You may not like it (certainly I don't) but its extremely well received behavior. Humans are mostly emotional beings, just look at politics if you think otherwise.

Aurornis · 2025-07-23T13:18:22 1753276702

> The first beta came out and it was _very hard_ to see the lock screen if you had notifications. How was that missed?

It’s a beta for a reason.

Past betas have also had graphical weirdness in certain new features, too. They iterate on it before release.

Why has everyone suddenly forgotten what beta means?

pookieinc · 2025-07-23T13:51:36 1753278696

I understand betas very well, but something as critical as that seems more fitting for an alpha. Liquid glass notifications on top of a bright wallpaper, bleeding together so you couldn't read or see anything shouldn't be in a beta.

ARandumGuy · 2025-07-23T15:01:01 1753282861

The initial beta design had so many obvious issues that it's wild that it made it as far as it did. Hell, the readability of many UI elements was obviously terrible in the initial reveal, where you'd expect everything to be shown in the best possible light.

Obviously Apple can improve things for the final release (and it seems like they're taking some steps in that direction). But these issues should have been identified long before the beta was released, and the fact that they weren't does not inspire confidence.

dwaite · 2025-07-23T23:37:41 1753313861

The first beta often ships with core features missing or broken. It exists to get as many new features in front of third party developers as soon as possible, because Apple has very little time to accept feedback before they are locked in for shipping.

At the same time, there seems to be precious little time between when Apple decides a feature is going to ship in the next release, and when WWDC happens.

Even if there was common knowledge inside the company that a new UI was coming, it may have not been merged into mainline until closer to WWDC. At that point, individual teams will need to alter their code to build and be usable on top of the UI as part of continuing their own development - but were likely still focused on the death march for their own WWDC-launched features.

int_19h · 2025-07-24T02:31:00 1753324260

This isn't the first beta, though.

aniforprez · 2025-07-23T13:26:36 1753277196

So are we not supposed to criticize a beta at all? How are they to know what to fix unless someone actually looks at it and makes clear what's wrong? Obviously they missed a pretty critical readability issue here.

adastra22 · 2025-07-23T21:13:30 1753305210

You apparently have. Beta releases are supposed to be "we believe this to be ready to ship, but need to sort out bugs." What you describe has traditionally been alpha or even pre-alpha releases.

pookieinc · 2025-04-29T07:50:28 1745913028

For those of us who have moved the vast majority of our Google searches to ChatGPT / only use Google periodically for one-off questions, is there still a reason to switch to Kagi?

senko · 2025-04-29T07:55:34 1745913334

I use Kagi as a search engine and Perplexity and Kagi assistant as a research tool. I view those two as different use cases.

I also trust @freediver more than Sam Altman :)

ghc · 2025-04-29T13:49:07 1745934547

What kind of search does Kagi excel at compared to Perplexity? I've been using Perplexity as a google replacement for about a year now, so I haven't tried Kagi, but seeing several people mention they use both has piqued my interest.

senko · 2025-04-29T13:59:54 1745935194

To me, personally, it's about the use case: searching for a page on the internet (Kagi) or researching a particular question or topic (Perplexity).

If I know what info I want (say, that particular blog post that mentioned topic XYZ, or the web page for a car dealership, or docs for something where the site search is worse than a web search), using Kagi is quicker and easier.

Edit to add: I just noticed I always use Kagi to search YouTube instead of YTs search directly (!yt <whatever>). I do the same for Wikipedia, Yahoo Finance, GoodReads, Roger Ebert movie review site, and probably a few other sites I can't recall right now. And I also have some sites boosted and some others blocked, but I haven't been tweaking that for a long time now...

If I'm interested in a topic but don't know exactly what or where, or want a longer explanation aggregated over multiple sources, then I use Perplexity. I usually fire off my question, let it work in the background, and come back a bit later.

That's just my use case, I don't presume that everyone else behaves the same. Also I just recently got access to Kagi's assistant on my plan, which may cannibalize my Perplexity use (we'll see).

ghc · 2025-04-29T14:11:18 1745935878

Thanks for taking the time to explain; what you say makes a lot of sense. I'm definitely going to give Kagi a try.

bigstrat2003 · 2025-04-29T14:26:15 1745936775

If you believe ChatGPT is good for such usage, no. But personally I think it sucks at that and have no idea how anyone can stand it.

evertedsphere · 2025-04-29T07:55:05 1745913305

how do you tolerate the sheer latency of running the "vast majority" of your web searches through an llm

esseph · 2025-04-29T11:20:43 1745925643

How many searches previously to find the right question to ask x search time = total_search_time

# of searches is lower, total-search_time drops

criddell · 2025-04-29T13:01:00 1745931660

For me ChatGPT is great when I don’t really know what I don’t know. I still end up having to do a google search after to verify that the AI result isn’t insane. So for me ChatGPT often is just adding an extra step.

ChocolateGod · 2025-04-29T12:59:51 1745931591

The LLM can read through the results quicker than you can and provide the information you were looking for.

bigstrat2003 · 2025-04-29T14:27:32 1745936852

Well, it provides something at any rate. Whether or not it's the information you were looking for is very much a matter of luck.

carlosjobim · 2025-04-29T17:05:49 1745946349

Just try it, it's free to try.

pookieinc · 2025-02-24T19:09:33 1740424173

The biggest complaint I (and several others) have is that we continuously hit the limit via the UI after even just a few intensive queries. Of course, we can use the console API, but then we lose ability to have things like Projects, etc.

Do you foresee these limitations increasing anytime soon?

Quick Edit: Just wanted to also say thank you for all your hard work, Claude has been phenomenal.

eschluntz · 2025-02-24T19:25:30 1740425130

We are definitely aware of this (and working on it for the web UI), and that's why Claude Code goes directly through the API!

smallerfish · 2025-02-24T19:42:24 1740426144

I'm sure many of us would gladly pay more to get 3-5x the limit.

And I'm also sure that you're working on it, but some kind of auto-summarization of facts to reduce the context in order to avoid penalizing long threads would be sweet.

I don't know if your internal users are dogfooding the product that has user limits, so you may not have had this feedback - it makes me irritable/stressed to know that I'm running up close to the limit without having gotten to the bottom of a bug. I don't think stress response in your users is a desirable thing :).

justinbaker84 · 2025-02-24T21:45:52 1740433552

This is the main point I always want to communicate to the teams building foundation models.

A lot of people just want the ability to pay more in order to get more.

I would gladly pay 10x more to get relatively modest increases in performance. That is how important the intelligence is.

willsmith72 · 2025-02-24T23:21:12 1740439272

As a growth company, they likely would prefer a larger amount of users even with occasional rate limits, vs smaller pool of power users.

As long as capacity is an issue, you can't have both

cruffle_duffle · 2025-02-25T01:17:54 1740446274

If people are paying for use, then why can’t you have both?

saulpw · 2025-02-25T01:50:27 1740448227

It takes time to grow capacity to meet growing revenue/usage. As parent is saying, if you are in a growth market at time T with capacity X, you would rather have more people using it even if that means they can each use less.

brador · 2025-02-25T04:07:58 1740456478

If you can’t scale with your customer base fire your CTO.

raylad · 2025-02-25T03:18:56 1740453536

The problem with the API is that it, as it says in the documentation, could cost $100/hr.

I would pay $50/mo or something to be able to have reasonable use of Claude Code in a limited (but not as limited) way as through the web UI, but all of these coding tools seem to work only with the API and are therefore either too expensive or too limited.

rudedogg · 2025-02-25T03:33:56 1740454436

> The problem with the API is that it, as it says in the documentation, could cost $100/hr.

I've used https://github.com/cline/cline to get a similar workflow to their Claude Code demo, and yes it's amazing how quickly the token counts add up. Claude seems to have capacity issues so I'm guessing they decided to charge a premium for what they can serve up.

+1 on the too expensive or too limited sentiment. I subscribed to Claude for quite a while but got frustrated the few times I would use it heavily I'd get stuck due to the rate limits.

I could stomach a $20-$50 subscription for something like 3.7 that I could use a lot when coding, and not worry about hitting limits (or I suspect being pushed on to a quantized/smaller model when used too much).

jasonjmcghee · 2025-02-25T14:52:17 1740495137

Claude Code does caching well fwiw. Looking my costs after a few code sessions (totaling $6 or so) the vast majority is cache read, which is great to see. Without caching it'd be wildly more expensive.

Like $5+ was cache read ($0.05/token vs $3/token) so it would have cost $300+

sealthedeal · 2025-02-24T19:48:30 1740426510

I haven't been able to find ClaudeCLI for pubic access yet. Would love to use.

eschluntz · 2025-02-24T20:01:07 1740427267

>>> npm install -g @anthropic-ai/claude-code

>>> claude

kkarpkkarp · 2025-02-24T22:25:14 1740435914

see https://docs.anthropic.com/en/docs/agents-and-tools/claude-c...

mianos · 2025-02-25T03:47:55 1740455275

I paid for it for a while, but I kept running out of usage limits right in the middle of work every day. I'd end up pasting the context into ChatGPT to continue. It was so frustrating, especially because I really liked it and used it a lot.

It became such an anti-pattern that I stopped paying. Now, when people ask me which one to use, I always say I like Claude more than others, but I don’t recommend using it in a professional setting.

zaptrem · 2025-02-25T10:33:37 1740479617

I have substantial usage via their API using LibreChat and have never run into rate limits. Why not just use that?

yarbas89 · 2025-02-25T12:09:15 1740485355

That sounds more expensive than the £18/mo Claude Pro costs?

zaptrem · 2025-02-25T23:38:37 1740526717

Yes, but if you want more usage it is reasonable to expect to pay more.

divan · 2025-02-25T11:17:25 1740482245

Same.

punkpeye · 2025-02-24T19:45:34 1740426334

If you are open to alternatives, try https://glama.ai/gateway

We currently serve ~10bn tokens per day (across all models). OpenAI compatible API. No rate limits. Built in logging and tracing.

I work with LLMs every day, so I am always on top of adding models. 3.7 is also already available.

https://glama.ai/models/claude-3-7-sonnet-20250219

The gateway is integrated directly into our chat (https://glama.ai/chat). So you can use most of the things that you are used to having with Claude. And if anything is missing, just let me know and I will prioritize it. If you check our Discord, I have a decent track record of being receptive to feedback and quickly turning around features.

Long term, Glama's focus is predominantly on MCPs, but chat, gateway and LLM routing is integral to the greater vision.

I would love feedback if you are going to give a try frank@glama.ai

airstrike · 2025-02-24T19:49:37 1740426577

The issue isn't API limits, but web UI limits. We can always get around the web interface's limits by using the claude API directly but then you need to have some other interface...

punkpeye · 2025-02-24T20:15:17 1740428117

The API still has limits. Even if you are on the highest tier, you will quickly run into those limits when using coding assistants.

The value proposition of Glama is that it combines UI and API.

While everyone focuses on either one or the other, I've been splitting my time equally working on both.

Glama UI would not win against Anthropic if we were to compare them by the number of features. However, the components that I developed were created with craft and love.

You have access to:

* Switch models between OpenAI/Anthropic, etc.

* Side-by-side conversations

* Full-text search of all your conversations

* Integration of LaTeX, Mermaid, rich-text editing

* Vision (uploading images)

* Response personalizations

* MCP

* Every action has a shortcut via cmd+k (ctrl+k)

airstrike · 2025-02-24T21:10:14 1740431414

Ok, but that's not the issue the parent was mentioning. I've never hit API limits but, like the original comment mentioned, I too constantly hit the web interface limits particularly when discussing relatively large modules.

glenstein · 2025-02-24T21:42:56 1740433376

Right, that's how I read it also. It's not that there's no limits with the API, but that they're appreciably different.

m_kos · 2025-02-24T23:41:21 1740440481

Your chat idea is a little similar to Abacus AI. I wish you had a similarly affordable monthly plan for chat only, but your UI seems much better. I may give it a try!

Aeolun · 2025-02-24T22:56:05 1740437765

> Even if you are on the highest tier, you will quickly run into those limits when using coding assistants.

Even heavy coding sessions never run into Claude limits, and I’m nowhere near the highest tier.

smokeydoe · 2025-02-25T02:44:17 1740451457

I think it’s based on the tools you’re using. If I’m using Cline I don't have to try very hard to hit limits. I’m on the second tier.

thrdbndndn · 2025-02-25T01:37:05 1740447425

Just tried it, is there a reason why the webUI is so slow?

Try to delete (close) the panel on the right on a side-by-side view. It took a good second to actually close. Creating one isn't much faster.

This is unbearably slow, to be blurt.

tesch1 · 2025-02-25T16:01:43 1740499303

Who is glama.ai though? Could not find company info on the site, the Frank name writing the blog posts seems to be an alias for Popeye the sailor. Am I missing something there? How can a user vet the company?

cmdtab · 2025-02-24T20:47:00 1740430020

Do you have deepseek r1 support? I need it for a current product I’m working on.

punkpeye · 2025-02-24T21:35:47 1740432947

Indeed we do https://glama.ai/models/deepseek-r1

It is provided by DeepSeek and Avian.

I am also midway of enabling a third-provider (Nebius).

You can see all models/providers over at https://glama.ai/models

As another commenter in this tread said, we are just a 'frontend wrapper' around other people services. Therefore, it is not particularly difficult to add models that are already supported by other providers.

The benefit of using our wrapper is that you can use a single API key and you get one bill for all your AI bills, you don't need to hack together your own logic for routing requests between different providers, failovers, keeping track of their costs, worry what happens if a provider goes down, etc.

The market at the moment is hugely fragmented, with many providers unstable, constantly shifting prices, etc. The benefit of a router is that you don't need to worry about those things.

cmdtab · 2025-02-24T22:05:46 1740434746

Yeah I am aware. I use open router at the moment but I find it lacks a good UX.

punkpeye · 2025-02-24T22:25:26 1740435926

Open router is great.

They have a very solid infrastructure.

Scaling infrastructure to handle billions of tokens is no joke.

I believe they are approaching 1 trillion tokens per week.

Glama is way smaller. We only recently crossed 10bn tokens per day.

However, I have invested a lot more into UX/UI of that chat itself, i.e. while OpenRouter is entirely focused on API gateway (which is working for them), I am going for a hybrid approach.

The market is big enough for both projects to co-exist.

pclmulqdq · 2025-02-24T21:16:20 1740431780

They are just selling a frontend wrapper on other people's services, so if someone else offers deepseek, I'm sure they will integrate it.

Daniel_Van_Zant · 2025-02-25T16:40:45 1740501645

I see Cohere, is there any support for in-line citations like you can get with their first party API?

clangfan · 2025-02-24T19:35:23 1740425723

this is also my problem, ive only used the UI with $20 subscription, can I use the same subscription to use the cli? I'm afraid its like those aws api billing where there is no limit to how much I can use then get a surprise bill

eschluntz · 2025-02-24T20:14:44 1740428084

It is API billing like AWS - you pay for what you use. Every time you exit a session we print the cost, and in the middle of a session you can do /cost to see your cost so far that session!

You can track costs in a few ways and set spend limits to avoid surprises: https://docs.anthropic.com/en/docs/agents-and-tools/claude-c...

danw1979 · 2025-02-24T22:25:01 1740435901

What I really want (as a current Pro subscriber) is a subscription tier ("Ultimate" at ~$120/month ?) that gives me priority access to the usual chat interface, but _also_ a bunch of API credits that would ensure Claude and I can code together for most of the average working month (reasonable estimate would be 4 hours a day, 15 days a month).

i.e I'd like my chat and API usage to be all included under a flat-rate subscription.

Currenty Pro doesn't give me any API credits to use with coding assistants (Claude Code included ?) which is completely disjointed. And I need to be a business to use the API still ?

Honestly, Claude is so good, just please take my money and make it easy to do the above !

Aeolun · 2025-02-24T23:03:41 1740438221

I don’t think you need to be a business to use the API? At least I’m fairly certain I’m using it in a personal capacity. You are never going to hit $120/month even with full-time usage (no guarantees of course, but I get to like $40/month).

Terretta · 2025-02-25T00:39:45 1740443985

Careful -- a solo dev using it professionally, meaning, coding with it as a pair coder (XP style), can easily spend $1500/week.

dghlsakjg · 2025-02-26T17:06:54 1740589614

$1500 is 100 million output tokens, or 500 million input tokens for Claude 3.7.

The entire LOTR trilogy is ~.55 million tokens (1,200 pages, published).

If you are sending and receiving the text equivalent of several hundred copies of the LOTR trilogy every week, I don't think you are actually using AI for anything useful, or you are providing far too much context.

dghlsakjg · 2025-02-24T23:10:48 1740438648

You can do this yourself. Anyone can buy API credits. I literally just did this with my personal credit card using my gmail based account earlier today.

1. Subscribe to Claude Pro for $20 month

2. Separately, Buy $100 worth of API credits.

Now you have a Claude "ultimate" subscription where the credits roll over as an added bonus.

As someone who only uses the APIs, and not the subscription services for AI, I can tell you that $100 is A LOT of usage. Quite frankly, I've never used anywhere close to $20 in a month which is why I don't subscribe. I mostly just use text though, so if you do a lot of image generation that can add up quickly

numba888 · 2025-02-24T23:44:02 1740440642

I don't think you can generate images with claude. just asked it for pink elephant: "I can't generate images directly, but I can create an SVG representation of a pink elephant for you." And it did it :)

dr_kiszonka · 2025-02-24T23:47:10 1740440830

That is a good idea. For something like Claude Code, $100 is not a lot, though.

istjohn · 2025-02-24T23:04:47 1740438287

You don't need to be a business to use the API.

mindok · 2025-02-24T20:56:15 1740430575

Which is theoretically great, but if anyone can get an Aussie credit card to work, please let me know.

robbiep · 2025-02-24T21:16:06 1740431766

I haven’t had an issue with Aussie cards?

But I still hit limits, I use Claudemind with jetbrains stuff and there is a max of input tokens (j believe), I am ‘tier 2’ but doesn’t look like I can go past this without an enterprise agreement

zzygan · 2025-02-25T23:13:56 1740525236

No issue with AU credit card here. Is a credit card and not a debit card though

edmundsauto · 2025-02-25T00:08:18 1740442098

I use AnythingLLM so you can still have a "Projects" like RAG.

pookieinc · 2025-01-31T19:12:41 1738350761

Can't wait to try this. What's amazing to me is that when this was revealed just one short month ago, the AI landscape looked very different than it does today with more AI companies jumping into the fray with very compelling models. I wonder how the AI shift has affected this release internally, future releases and their mindset moving forward... How does the efficiency change, the scope of their models, etc.

patrickhogan1 · 2025-01-31T19:23:24 1738351404

I thought it was o3 that was released one month ago and received high scores on ARC Prize - https://arcprize.org/blog/oai-o3-pub-breakthrough

If they were the same, I would have expected explicit references to o3 in the system card and how o3-mini is distilled or built from o3 - https://cdn.openai.com/o3-mini-system-card.pdf - but there are no references.

Excited at the pace all the same. Excited to dig in. The model naming all around is so confusing. Very difficult to tell what breakthrough innovations occurred.

nycdatasci · 2025-01-31T23:35:14 1738366514

Yeah - the naming is confusing. We're seeing o3-mini. o3 yields marginally better performance given exponentially more compute. Unlike OpenAI, customers will not have an option to throw an endless amount of money at specific tasks/prompts.

echelon · 2025-01-31T19:15:24 1738350924

There's no moat, and they have to work even harder.

Competition is good.

lesuorac · 2025-01-31T19:51:19 1738353079

I really don't think this is true. OpenAI has no moat because they have nothing unique; they're using mostly other people's (like Transformers) architectures and other companies hardware.

Their value-prop (moat) is that they've burnt more money than everybody else. That moat is trivially circumvented by lighting a larger pile of money and less trivially by lighting the pile more efficently.

OpenAI isn't the only company. The Tech companies being beaten massively by Microsoft in #of H100s purchases are the ones with a moat. Google / Amazon with their custom AI chips are going to have a better performance per cost than others and that will be a moat. If you want to get the same performance per cost then you need to spend the time making your own chips which is years of effort (=moat).

sumedh · 2025-01-31T20:29:40 1738355380

> That moat is trivially circumvented by lighting a larger pile of money and less trivially by lighting the pile more efficently.

Google with all its money and smart engineers was not able to build a simple chat application.

mianos · 2025-01-31T20:47:07 1738356427

But with their internal progression structure they can build and cancel eight mediocre chat apps.

malaya_zemlya · 2025-01-31T21:00:56 1738357256

What do you mean? Gemini app is available on IOS, Android and on the web (as AI Studio https://aistudio.google.com/).

robrenaud · 2025-01-31T21:49:28 1738360168

It's a joke about how Google has released/cancelled/renamed many messenging apps.

tmnvdb · 2025-01-31T21:46:09 1738359969

It is not very good though.

aprilthird2021 · 2025-02-01T02:51:39 1738378299

Gemini is pretty good, And it does one thing way better than most other AI models, when I hold down my phone's home button it's available right away

evrenesat · 2025-02-01T08:51:17 1738399877

That's a shame on Google, Apple, Samsung, etc. Voice and other activation methods should be open to any app that claims to be an assistant. An ugly way of "gatekeeping".

lukan · 2025-01-31T21:10:21 1738357821

"OpenAI has no moat because they have nothing unique"

It seems they have high quality trainingsdata. And the knowledge to work with it.

aprilthird2021 · 2025-02-01T02:52:17 1738378337

They buy most of their data from Scale AI types. It's not any higher quality than is available to any other model farm

sangnoir · 2025-01-31T20:14:42 1738354482

> That moat is trivially circumvented by lighting a larger pile of money and less trivially by lighting the pile more efficently.

DeepSeek has proven that the latter is possible, which drops a couple of River crossing rocks into the moat.

withinboredom · 2025-01-31T20:35:52 1738355752

The fact that I can basically run o1-mini with deepseek:8b, locally, is amazing. Even on battery power, it works acceptably.

tmnvdb · 2025-01-31T21:47:01 1738360021

Those models are not comparable

withinboredom · 2025-01-31T21:59:32 1738360772

hmmm... check the deepseek-r1 repo readme :) They compare them there, but it would be nice to have external benchmarks.

sublimefire · 2025-02-01T09:16:39 1738401399

When you want to use AI in business you need some guarantees that the integration will not break because the ai company goes down or because of some breaking changes in a year. There is a reason why MSFT is in business. Similarly you will not buy Google because they do not like keeping products forever, you will not buy some unknown product just because it is 5% cheaper. OpenAI has a strong brand at the moment and this is their thing, until companies go to MSFT or AMZ to use their services with the ability to choose any model.

brookst · 2025-01-31T20:26:08 1738355168

Brand is a moat

cruffle_duffle · 2025-01-31T20:36:55 1738355815

Ask Jeeves and Altavista surely have something to say about that!

geerlingguy · 2025-01-31T23:47:18 1738367238

Add Yahoo! to that list

esafak · 2025-01-31T22:37:07 1738363027

Their brand is as tainted as Meta's, which was bad enough to merit a rebranding from Facebook.

petesergeant · 2025-02-01T08:07:37 1738397257

> OpenAI has no moat

... is definitely something I've said before, and recently, but:

> That moat is trivially circumvented by lighting a larger pile of money

If that was true, someone would have done it.

lumost · 2025-01-31T20:03:53 1738353833

Capex was the theoretical moat, same as TSMC and similar businesses. DeepSeek poked a hole in this theory. OpenAI will need to deliver massive improvements to justify a 1 billion dollar training cost relative to 5 million dollars.

usef- · 2025-01-31T21:15:55 1738358155

I don't know if you are, but a lot of people are still comparing one Deepseek training run to the entire costs of OpenAI.

The deepseek paper states that the $5mil number doesn't include development costs, only the final training run. And it doesn't include the estimated $1.4billion cost of the infrastructure/chips Deepseek owns.

Most of OpenAI's billion dollar costs is in inference, not training. It takes a lot of compute to serve so many users.

Dario said recently that Claude was in the tens of millions (and that it was a year earlier, so some cost decline is expected), do we have some reason to think OpenAI was so vastly different?

lumost · 2025-02-01T02:18:15 1738376295

Anthropic’s ceo was predicting billion dollar training runs for 2025. Current training runs were likely in the tens/hundreds of millions of dollars USD.

Inference capex costs are not a defensive moat as I can rent gpus and sell inference with linear scaling costs. A hypothetical 10 billion dollar training run on proprietary data was a massive moat.

https://www.itpro.com/technology/artificial-intelligence/dol...

dutchbookmaker · 2025-01-31T23:55:55 1738367755

It is still curious though as far as what is actually being automated?

I find huge value in these models as an augmentation of my intelligence and as a kind of cybernetic partner.

I can't think of anything that can actually be automated though in terms of white collar jobs.

The white collar model test case I have in mind is a bank analyst under a bank operations manger. I have done both in the past but there is something really lacking with the idea of the operations manager replacing the analyst with a reasoning model even though DeepSeek annihilates every bank analyst reasoning I ever worked with right now.

If you can't even arbitrage the average bank analyst there might be these really non-intuitive no AI arbitrage conditions with white color work.

gdhkgdhkvff · 2025-02-01T00:49:15 1738370955

I don’t want to pretend I know how bank analysts work, but at the very least I would assume that 4 bank analysts with reasoning models would outperform 5 bank analysts without.

wahnfrieden · 2025-01-31T19:23:14 1738351394

Collaboration is even better, per open source results.

It is the closed competition model that’s being left in the dust.

pookieinc · on June 19, 2024

"HTMX - skip this, it's just a meme"

Rare to see this take, everyone claims it's incredible. Can you share more about why?

pookieinc · on May 1, 2024

AICrete | Richmond, CA | Hybrid / Remote (North America) | Full-time | Frontend / Backend / Fullstack Engineer

At AICrete, we are committed to revolutionizing the global concrete and construction industry. Leveraging AI, machine learning, computer vision, and sophisticated automation, AICreteOS is our innovative solution designed to enhance sustainability, profitability, efficiency, and productivity in real-time. Proudly standing as the first company in the world to introduce AI to the concrete materials industry, we are at the forefront of technological advancement, setting new standards and pioneering changes that drive real impact.

We're a small team of 10-15 and we're looking for great engineers to help us build, grow, and maintain products used some of the largest concrete customers across the US (and soon abroad), as well as those interested in working at the intersection of sustainability, green tech and AI.

For more roles, see us at: https://jobs.lever.co/aicrete or DM me for any further questions.

pookieinc · on April 11, 2022

There's a really great epidemiologist who has been publishing data-driven articles on substack on the subject of COVID, from samples from waste water to control groups across the world. Her insights are often super valuable and she only speaks to what she knows and the rest goes off of data she comes across. I'd really recommend her thoughts on anything COVID-related

https://yourlocalepidemiologist.substack.com/

pookieinc · on Oct 4, 2021

As someone who has moved to the middle east (from the bay area), a large portion of this society heavily relies on Facebook, particularly for FB groups and also FB marketplace. Aside from that, this entire society basically functions on Instagram for any news / personal connections, but as it relates to Facebook, having groups is a great way to connect with people and ask questions and since we don't really have a good alternative to Craigslist here, FB marketplace fills that void.

That said, I'm sure people here use Facebook for other reasons, but anecdotally, this is what I'm seeing. Instagram and Facebook aside, the most used app I've found here is WhatsApp, so the society here is deeply integrated into the Facebook web of services

barrenko · on Oct 4, 2021

Facebook is like an octopus on your brain.

pookieinc · on April 5, 2021

Can someone enlighten me on why manufacturers who have experience with software, such as Microsoft, don’t create their own OS for their mobile phones? I recognize that it’s easier and faster to iterate to just use Android with some sprinkles on top, but even if it meant spending 4-5 years developing it, the potential market share is absolutely massive. I can see the first year or two would not be great since app developers would need to build their apps, but after that initial hurdle, then I’d imagine it wouldn’t be as bad. After that, it comes down to sales, marketing and mindshare adoption.

To me regarding LG, I was never a fan of their phones, but less competition is always bad in my book.

Edit: Yes I remember the Windows Phone and it’s failures, was thinking more of starting a newer OS these days rather than several years ago.

bogwog · on April 5, 2021

Not sure how you missed the whole Windows Phone era (https://en.wikipedia.org/wiki/Windows_Phone), but they tried doing their own operating system and it didn't work out.

Blackberry too.

And Samsung (Tizen)

And there was also WebOS which I know was used in atleast one tablet, maybe some phones too, and which is currently owned by LG.

Just think about how manufacturers are struggling to find success with their own smartphones. Imagine trying to do that with the added burden of developing a custom operating system.

Zardoz84 · on April 5, 2021

> And Samsung (Tizen)

And before Bada . And Samsung had a bad history of promising stuff for users of Bada and Tizen, that never got shipped. This is one of the reasons that I would never bought a Samsung phone again.

eli · on April 5, 2021

Microsoft, of course, famously did create their own phone OS. It wasn’t bad either, but among other problems they were a little late to market and there were no apps for it. A real chicken and egg problem. Not worth the engineering effort to build an app for a tiny marketshare and no one wants a smartphone that can’t even hail an Uber.

dtx1 · on April 5, 2021

Having developed for windows phone 8, 8.1 and 10 i can say with certainty that it was bad in a plethora of ways and microsoft's constant over promising and under delivering made it worse.

eli · on April 5, 2021

Compared to competing OSes at the time? I don’t remember any being particularly fun. (But also I didn’t spend much time in the Windows Phone ecosystem)

But I think you’re right that MS lost focus on developers on mobile. Ironic given "developers, developers, developers" was the literal mantra.

Semaphor · on April 5, 2021

Just to expand on Windows Mobile: For the problems it had (probably the biggest was trying to enter a duopoly), I have yet to hear of anyone who had one who didn’t like it. I never used one, but those who did seem to have liked it better than iOS and Android.

ryl00 · on April 5, 2021

Yep. I'm still using my Alcatel Idol 4S Windows Phone as my daily driver.

bawolff · on April 5, 2021

Microsoft has tried and failed multiple times.

villgax · on April 5, 2021

They tried & died a bunch of time. Windows ME, then Lumia series....

tempestn · on April 5, 2021

Microsoft tried; they weren't able to capture that market share. Much of that probably comes down to it being tough to catch up to the 3rd party app ecosystem Apple and Android each have.

throwbacktictac · on April 5, 2021

I believe Hauwei is being forced to develop their own OS due to the threat of US sanctions. However, I think it'd be a losing battle to try to steal enough market share of IOS/Android. They'd simply copy (or one up) your competitive advantage leaving you little room to compete.

xpressvideoz · on April 5, 2021

IIRC Huawei's OS is just a rebranded Android.

sanxiyn · on April 5, 2021

Huawei is absolutely writing their own kernel, see https://github.com/LiteOS/LiteOS.

reaperhulk · on April 5, 2021

They may be working on other things but they also trumpet harmony as their own. Ars Technica wrote an article about how original it actually is: https://arstechnica.com/gadgets/2021/02/harmonyos-hands-on-h...

sanxiyn · on April 5, 2021

I mean, "fake it till you make it" is a good strategy (although you probably shouldn't lie). It is a sound engineering decision to rewrite Android piece by piece, like replacing Linux with LiteOS.