More

ashirviskas · 2026-01-27T00:32:53 1769473973

Apple made lower than 16GB M3 models? Man, can't wait till the cheapest model is at least 128GB.

jaredcwhite · 2026-01-27T01:37:17 1769477837

Yeah, M4 was the generation when the minimum got bumped up to 16GB.

ashirviskas · 2026-01-22T22:18:31 1769120311

Another European chiming in, I enjoyed OPs article.

ashirviskas · 2026-01-21T02:17:26 1768961846

Do you also write your bytecode by human hands? At which abstraction layer do we draw the line?

ashirviskas · 2026-01-21T02:15:16 1768961716

> it has, but python being single threaded (until recently) didn't make it an attractive choice for CLI tools.

You probably mean GIL, as python has supported multi threading for like 20 years.

Idk if ranger is slow because it is written in python. Probably it is the specific implementation.

embedding-shape · 2026-01-21T09:12:12 1768986732

> You probably mean GIL

They also probably mean TUIs, as CLIs don't do the whole "Draw every X" thing (and usually aren't interactive), that's basically what sets them apart from CLIs.

behnamoh · 2026-01-21T03:48:34 1768967314

Even my CC status line script enjoyed a 20x speed improvement when I rewrote it from python to rust.

foltik · 2026-01-21T05:12:12 1768972332

It’s surprising how quickly the bottleneck starts to become python itself in any nontrivial application, unless you’re very careful to write a thin layer that mostly shells out to C modules.

ashirviskas · 2026-01-19T22:33:25 1768862005

It is only 4 years old

dkdcio · 2026-01-19T22:38:01 1768862281

technical preview in June 2021. I was using it for a bit before that as an internal employee. so they may have rounded up slightly or also were an internal beta test

side note, I’ve been trying to remember when it launched internally if anybody knows. I feel like it was pre-COVID, but that’s a long timeline from internal use to public preview

SilenN · 2026-01-20T02:57:58 1768877878

Yes, the technical preview of Github Copilot. I rounded up.

dkdcio · 2026-01-20T03:06:54 1768878414

fair enough! the jump from that to ChatGPT’s launch (which I didn’t find that interesting), to gpt-4, to Claude Code/Codex CLI, to Gemini 3/Opus 4.5/GPT 5.2 has been insane in such a short time. I’m excited (since the release of the Codex CLI especially: https://dkdc.dev/posts/modern-agentic-software-engineering/)

ashirviskas · 2026-01-19T22:32:47 1768861967

Keyboard autocomplete?

ashirviskas · 2026-01-18T13:42:21 1768743741

I wonder what if we just crammed more into the "tokens"? I am running an experiment of replacing discrete tokens with embeddings + small byte encoder/decoder. That way you can use embedding space much more efficiently and have it contain much more nuance.

Experiments I want to build on top of it:

1. Adding lsp context to the embeddings - that way the model could _see_ the syntax better, closer to how we use IDEs and would not need to read/grep 25k of lines just to find where something is used. 2. Experiments with different "compression" ratios. Each embedding could encode a different amount of bytes and we would not rely on a huge static token dictionary.

I'm aware that papers exist that explore these ideas, but so far no popular/good open source models employ this. Unless someone can prove me wrong.

Yemoshino · 2026-01-18T15:32:27 1768750347

I found a few papers in this direction with perplexity like this one https://ceur-ws.org/Vol-4005/paper1.pdf and it doesn't seem to be that relevant for now.

The progress of a handful models seem to be so much better (because limited compute, we have only a handful of big ones, i presume) that these finetunings are just not yet relevant.

I'm also curious if a english java + html + css + javascript only model would look like in size and speed for example.

Unfortunate whenever i ask myself the question of finetunging tokens (just a few days ago this question came up again), deep diving takes too much time.

Claude only got lsp support in november i think. And its not even clear to me to what extend. So despite the feeling we are moving fast, tons of basic ideas haven't even made it in yet

tuned · 2026-01-19T06:57:51 1768805871

if you have a corpus of code snippets to train the manifold (Laplacian) on (and a good embedding model), it is definitely possible to try something like this.

stephantul · 2026-01-18T17:00:24 1768755624

There’s many examples of noisily encoding a large embedding vocabulary. This sounds a bit like T-free or H-net? Or BLT?

One of the main issues with lines of work around this are that you end up trading embedding parameters for active parameters. This is rarely a good trade-off for the sake of compute.

nl · 2026-01-18T22:33:41 1768775621

Isn't this just an awkward way of adding an extra layer to the NN, except without end-to-end training?

Models like Stable Diffusion sort of do a similar thing using Clip embeddings. It works, and it's an easy way to benefit from the pre-training Clip has. But for a language model it would seemingly make more sense to just add the extra layer.

ashirviskas · 2026-01-19T03:41:53 1768794113

I mean this is exactly what it is. Just a wrapper to replace the tokenizer. That is exactly how LLMs can read images.

I'm just focusing on different parts

appplication · 2026-01-18T15:27:04 1768750024

Not an expert in the space, but I’m not sure you need to modify tokens to get the model to see syntax, you basically get that exact association from attention.

ashirviskas · 2026-01-18T17:19:34 1768756774

You get that association that is relevant to your project only if you can cram the whole codebase. Otherwise it is making rough estimates and some of the time that seems to be where the models fail.

It can only be fully resolved with either infinite context length, or doing it similar to how humans do it - add some LSP "color" to the code tokens.

You can get a feel of what LLMs deal with when you try opening 3000 lines of code in a simple text editor and try to do something. May work for simple fixes, but not whole codebase refactors. Only ultra skilled humans can be productive in it (using my subjective definition of "productive")

ashirviskas · 2026-01-15T01:51:46 1768441906

Well, using Claude Pro/Max Calude Code api without Claude Code, instead of their actual API they monetize goes against their ToS.

I don't like it too, but it is what it is.

If I gave free water refils if you used my brand XYZ water bottle, you should not cry that you don't get free refills to your ABC branded bottle.

It may be scummy, but it does make sense.

ashirviskas · 2026-01-13T12:01:11 1768305671

You should never use GIF anymore, it is super inefficient. Just do video, it is 5x to 10x more efficient.

https://web.dev/articles/replace-gifs-with-videos

jdiff · 2026-01-13T12:39:36 1768307976

There's odd cases where it still has uses. When I was a teacher, some of the gamifying tools don't allow video embeds without a subscription, but I wanted to make some "what 3D operation is shown here" questions with various tools in Blender. GIF sizes were pretty comparable to video with largely static, less-than-a-second loops, and likely had slightly higher quality with care used to reduce color palette usage.

But I fully realize, there are vanishingly few cases with similar constraints.

ascorbic · 2026-01-13T13:19:24 1768310364

For those you can often use animated WebP, or even APNG. They all have close to universal support and are usually much smaller.

tylertyler · 2026-01-13T16:03:09 1768320189

If you need animated images in emails or text messages, GIF is the only supported format that will play the animation. Because of the size restrictions for these messaging systems the inefficient compression of GIFs is a major issue.

prmoustache · 2026-01-13T16:42:48 1768322568

I am not sure "need" is the right word here.

adzm · 2026-01-13T13:53:40 1768312420

AVIF works here also. Discord started supporting it for custom emoji.

SahAssar · 2026-01-14T00:27:43 1768350463

Videos and images are treated very differently by browsers and OS:es. I'm guessing the better suggestion would be to use apng or animated avif if you are looking for a proper gif alternative.

account42 · 2026-01-14T16:02:39 1768406559

Do browsers support progressive enhancement from gif to animated avif without javascript? The royally messed that up for animated webp.

SahAssar · 2026-01-14T19:56:30 1768420590

Yes, by using the <picture> element with <source> elements declaring the individual formats with the last one being a regular <img> with the gif.

Or you could use content-negotiation to only send avif when it's supported, but IMO the HTML way with <picture> is perhaps clearer for the client and end user.

I think the webp problem was due to browsers supporting webp but not supporting animation, transparency or other features, so content negotiation based on mime types (either via <picture> or HTTP content-negotiation) did not work properly. Safari 16.1-16.3 has the same problem with AVIF, but that is a smaller problem than it was with webp.

account42 · 2026-01-15T08:39:37 1768466377

So I guess that's a no - avif support does not necessarily mean animated avif support.

SahAssar · 2026-01-15T17:34:49 1768498489

I covered this in my comment:

> Safari 16.1-16.3 has the same problem with AVIF, but that is a smaller problem than it was with webp.

account42 · 2026-01-14T16:00:49 1768406449

Unfortunately browser vendors didn't want to support silent looping videos in <img> tags so gif stays relevant.

gsich · 2026-01-13T22:41:15 1768344075

only if looping information is stored inside the container.

ashirviskas · 2025-12-31T11:07:06 1767179226

First, so best in this?