Hacker Newsnew | past | comments | ask | show | jobs | submit | MuffinFlavored's commentslogin

> Imagine if Siri could genuinely file your taxes

I do not like reading things like this. It makes me feel very disconnected from the AI community. I defensively do not believe there exist people who would let AI do their taxes.


> You can shrink the model to a fraction of its "full" size and get 92-95% same performance, with less VRAM use.

Are there a lot of options how "how far" do you quantize? How much VRAM does it take to get the 92-95% you are speaking of?


> Are there a lot of options how "how far" do you quantize?

So many: https://www.reddit.com/r/LocalLLaMA/comments/1ba55rj/overvie...

> How much VRAM does it take to get the 92-95% you are speaking of?

For inference, it's heavily dependent on the size of the weights (plus context). Quantizing an f32 or f16 model to q4/mxfp4 won't necessarily use 92-95% less VRAM, but it's pretty close for smaller contexts.


Thank you. Could you give a tl;dr on "the full model needs ____ this much VRAM and if you do _____ the most common quantization method it will run in ____ this much VRAM" rough estimate please?

It’s a trivial calculation to make (+/- 10%).

Number of params == “variables” in memory

VRAM footprint ~= number of params * size of a param

A 4B model at 8 bits will result in 4GB vram give or take, same as params. At 4 bits ~= 2GB and so on. Kimi is about 512GB at 4 bits.


Did you eventually move to a $20/mo Claude plan, $100/mo Claude plan, $200/mo, or API based? if API based, how much are you averaging a month?

The $20 one, but it's hobby use for me, would probably need the $200 one if I was full time. Ran into the 5 hour limit in like 30 minutes the other day.

I've also been testing OpenClaw. It burned 8M tokens during my half hour of testing, which would have been like $50 with Opus on the API. (Which is why everyone was using it with the sub, until Anthropic apparently banned that.)

I was using GLM on Cerebras instead, so it was only $10 per half hour ;) Tried to get their Coding plan ("unlimited" for $50/mo) but sold out...

(My fallback is I got a whole year of GLM from ZAI for $20 for the year, it's just a bit too slow for interactive use.)


Try Codex. It's better (subjectively, but objectively they are in the same ballpark), and its $20 plan is way more generous. I can use gpt-5.2 on high (prefer overall smarter models to -codex coding ones) almost nonstop, sometimes a few in parallel before I hit any limits (if ever).

I now have 3 x 100 plans. Only then I an able to full time use it. Otherwise I hit the limits. I am q heavy user. Often work on 5 apps at the same time.

Shouldn't the 200 plan give you 4x?? Why 3 x 100 then?

Good point. Need to look into that one. Pricing is also changing constantly with Claude

and it has proven to not be a great hedge against inflation (lately, short term)

the digital gold narrative falls apart when non-digital gold outperforms it and nobody wants it for its digital properties (payments, blockchain, etc.)


Why does the news matter?

Sincerely, Somebody on "Hacker" News


Worth noting that the implied volatility extracted here is largely a function of how far OTM the strike is relative to current spot, not some market-specific view on $100. If NVDA were trading at $250 today, the options chain would reprice and you'd extract similar vol for whatever strike was ~45% below. The analysis answers "what's the probability of a near-halving from here" more than "what's special about $100." Still useful for the prediction contest, but the framing makes it sound like the market is specifically opining on that price level.


this is gpt, right?


I had a conversation (prompts) with Claude about this article because I didn't feel I could as succinctly describe my point alone.


There are grammatical mistakes and abbreviations, big tells that it's NOT ChatGPT.


Trivial to build something that is opinionated and custom to you?


> the point is validating and advertising Polymarket on national news outlets will have ugly consequences.

Coinbase

Robinhood

DraftKings

FanDuel


Yes, all of those are equally problematic.

Polymarket here is one example among the companies causing the problem. Legislation that addresses the problem should affect all the listed companies and more, not just Polymarket.


Giving credence and advertisement to all of those has caused huge societal problems. Sports Betting is a huge problem.


How did Rhianna look him in the eyes and say "yes babe, good album, release it, this is what the people wanted after 7 years, it is pleasing to listen to and enjoyable"?


I prefer when artists make music they intrinsically want to make — not what others want them to make.


the real question is how much of the art is their own and how much is outside expectations and their reactions to it.

And it's not always giving in to those voices, sometimes it's going in the opposite direction specifically to subvert those voices and expectations even if that ends up going against your initial instincts as an artist.

With someone like A$AP Rocky, there is a lot of money on the line wrt the record execs but even small indie artists playing to only a hundred people a night have to contend with audience expectation and how that can exert an influence on their creativity.


It seems the numerous leaks and trials took their toll.

I don’t disagree with you—I felt “Tailor Swif,” “DMB,” and “Both Eyes Closed” were all stronger than the tracks that made it onto this album.

But sometimes you’ve gotta ship the project in the state it’s in and move on with your life.

Maybe now he can move forward and start working on something new. And perhaps that project will be stronger.


Im sure it was more like, “hey babe, can I get a few millions to go in the studio and experiment/make some art?” And then she was like, “yeah go for it! Make some weird shit.”

If I was in his position I’d probably be doing the same. Why bother with another top hit that pleases the masses.


Because it was awesome. But also, leaks probably.


Is this the answer for citizens of countries other than America? I don't think it is.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: