More

3836293648 · 2026-02-02T17:45:29 1770054329

There is discussion about this in the Rust world, though no attempts at implementation (and yet further from stabilisation)

3836293648 · 2026-01-29T23:51:56 1769730716

Maybe if you owned the tree, not if someone a few houses down does

3836293648 · 2026-01-25T11:03:53 1769339033

This is in response to all the pushback they got from that

3836293648 · 2026-01-23T09:50:03 1769161803

But this also means tiny context windows. You can't fit gpt-oss:20b + more than a tiny file + instructions into 24GB

blizdiddy · 2026-01-24T06:37:03 1769236623

Gpt-oss is natively 4-bit, so you kinda can

3836293648 · 2026-01-25T19:04:36 1769367876

You can fit the weights + a tiny context window into 24GB, absolutely. But you can't fit anything of any reasonable size. Or Ollama's implementation is broken, but it needs to be restricted beyond usability for it not to freeze up the entire machine when I last tried to use it.

3836293648 · 2026-01-19T08:17:49 1768810669

LLMs do typically encode a confidence level in their embeddings, they just never use it when asked. There were multiple papers on this a few years back and they got reasonable results out of it. I think it was in the GPT3.5 era though

3836293648 · 2026-01-19T08:15:04 1768810504

It's mostly AI slop, but they did exist before AI (and they were miserable back then too)

3836293648 · 2026-01-01T22:50:41 1767307841

Are you seriously comparing discrimination based on factors noone can control to a group literally defined by a choice they made? And you think that's a good faith argument?

3836293648 · 2026-01-01T22:43:04 1767307384

People hate C because it's hard, people hate C++ because it truly is rubbish. Rubbish that deserved to be tried but that we've now learned was a mistake and should move on from.

3836293648 · 2026-01-01T01:45:17 1767231917

Because llm tokens don't map cleanly to what the compiler sees as a token. If coding is all LLMs will be good for this will surely change

3836293648 · 2025-12-31T11:46:32 1767181592

What on Earth have you used to get reasonable results out of a local model?

I've tried at every new model release (that can run on my 24GB card) and everything is still entirely useless.

I'm not writing web stuff though.