Back when I was young, which isn't *that* far ago, one tried to put as much pre-...

DaoVeles · on Sept 5, 2024

Reminded of Kaze Emanuars work on optimising N64 software. (https://www.youtube.com/channel/UCuvSqzfO_LV_QzHdmEj84SQ)

The RAM was so terrible that essentially you try to keep the processors running in cache for as long as possible. RAM access is painful.

There is a performance profiling tool built into F3DEX3 that now shows that approximately 70% of the time the system is idle while running Zelda OoT. It is just waiting for memory transfers. The folks at SGI/RAMBUS cut corners a little too hard building that system.

But turns out this kind of performance profile is just prep for were we are heading apparently.

hinkley · on Sept 5, 2024

I was re-reminded of this when watching a performance analysis video that may or may not have been posted here (sometimes I get things here, or reddit, but sometimes the real story is in the related videos). It doesn't take a very big lookup table for it to be faster to rerun the calculations.

Especially when you throw multiprocessing in. We need better benchmarking tools that load up competing workloads in the background so you can tell how your optimization really works in production instead of in your little toy universe in the benchmark.

enva2712 · on Sept 6, 2024

fixed function compression on-die would be interesting if applied to this

microtonal · on Sept 5, 2024

On the latter point, macOS had had compression memory for a long time by now and some Linux distributions also use it out of the box (I don’t know anything about Windows).

hinkley · on Sept 5, 2024

One of the time series databases streams compressed blocks in memory when doing searches, doing distinct blocks per core. For some scenarios it's faster to do a table scan than keep extra indexes hot.

Dylan16807 · on Sept 6, 2024

That's compression for currently inactive pages, not nearly as aggressive as compressing as a matter of course.

wongogue · on Sept 5, 2024

It also has Compressed Memory since a version of Windows 10. You can see it in Task Manager.

DaoVeles · on Sept 5, 2024

Even in the 90's I recall decompressing ZIP files on a 486 being limited by the HDD speed. It felt like even then we would head towards compressed memory systems once it could be compressed quickly enough.

vardump · on Sept 5, 2024

I think the grandparent was talking about transparent memory bus data compression. CPU would fetch some bag of bytes and decompress to cache.

saagarjha · on Sept 6, 2024

This is basically intended to be fast swap rather than something to be applied to active pages.

jarbus · on Sept 5, 2024

I think quantization for large language models already do something like this - they compress the parameters in memory and then decompress when performing the forward passes