More

merb · 2026-02-24T06:49:25 1771915765

Swift and fil-c are only pseudo safe. Once you deal with the actual world and need to pass around data from memory things are always unsafe since there is no safe way of sharing memory. At least not in our current operating systems. Swift and fil-c can at least guard to some extent the api.

merb · 2026-02-23T21:52:00 1771883520

Your take is also a bad one. No what asml builds is not American technology. Why asml succeeded is because they got tons of company’s and people to help them advance the technology of the chip industry. Yes it wouldn’t be possible without the Americans. But it would also not be possible without the Europeans, the Koreans, etc… what asml did was basically ask the technology leaders in each field to build their best product so that they can take their parts and assembly this awesome piece of technology.

merb · 2026-02-19T06:33:55 1771482835

So it’s forbidden to use the Claude Mac app. I would say the ToS as it is, can’t be enforced

merb · 2026-02-18T18:33:00 1771439580

I more and more see a bug in my mouth that tries to encourage my boss to cancel Microsoft 365. I did not find the root cause yet

merb · 2026-02-13T15:25:14 1770996314

If that is the case why did minio start with the open source version? If there were only downsides? Sounds like stupid business plan

throwaway894345 · 2026-02-13T16:31:52 1771000312

They wanted adoption and a funnel into their paid offering. They were looking out for their own self-interest, which is perfectly fine; however, it’s very different from the framing many are giving in this thread of a saintly company doing thankless charity work for evil homelab users.

Ensorceled · 2026-02-13T17:31:41 1771003901

Where did I say there were only downsides? There are definitely upsides to this business model, I'm just refuting the idea that because there are for profit motives the downsides go away.

I hate when people mistreat the people that provide services to them: doesn't matter if it's a volunteer, underpaid waitress or well paid computer programmer. The mistreatment doesn't become "ok" because the person being mistreated is paid.

merb · 2026-02-14T18:01:16 1771092076

I doubt that minio pulled the open source version because they were mistreated. Really yeah there are some projects where this is a problem, but it’s mostly because the project only has a single maintainer.

People are angry about minio , but that’s because of their rugpull.

yencabulator · 2026-02-15T23:50:09 1771199409

The minio people did a lot of questionable things even before the rugpull. They tried to claim AGPL infects software over the network, on a previous version of https://min.io/compliance

> Combining MinIO software as part of a larger software stack triggers your GNU AGPL v3 obligations. The method of combining does not matter. When MinIO is linked to a larger software stack in any form, including statically, dynamically, pipes, or containerized and invoked remotely, the AGPL v3 applies to your use. What triggers the AGPL v3 obligations is the exchanging data between the larger stack and MinIO.

merb · 2025-12-28T19:07:56 1766948876

I had that problem as well. Especially when connected to two external monitors. I did not love the machine and the M1 Max was such a big upgrade because of that and I could upgrade to the m3 max later and give my m1 to somebody else. Both apple silicon machines are going strong for a long time I guess.

merb · 2025-12-10T19:53:07 1765396387

You can only use .net 4.8 when you create an outlook add-in.

I mean yes you can build it with native interop and aot. But then you would loose the .net benefits as well.

merb · 2025-12-08T19:34:47 1765222487

> 3.4 Lazy fsync by Default

Why? Why do some databases do that? To have better performance in benchmarks? It’s not like that it’s ok to do that if you have a better default or at least write a lot about it. But especially when you run stuff in a small cluster you get bitten by stuff like that.

aaronbwebber · 2025-12-08T20:25:00 1765225500

It's not just better performance on latency benchmarks, it likely improves throughput as well because the writes will be batched together.

Many applications do not require true durability and it is likely that many applications benefit from lazy fsync. Whether it should be the default is a lot more questionable though.

johncolanduoni · 2025-12-08T21:33:24 1765229604

It’s like using a non-cryptographically secure RNG: if you don’t know enough to look for the fsync flag off yourself, it’s unlikely you know enough to evaluate the impact of durability on your application.

traceroute66 · 2025-12-08T22:59:29 1765234769

> if you don’t know enough to look for the fsync flag off yourself,

Yeah, it should use safe-defaults.

Then you can always go read the corners of the docs for the "go faster" mode.

Just like Postgres's infamous "non-durable settings" page... https://www.postgresql.org/docs/18/non-durability.html

semiquaver · 2025-12-09T01:16:58 1765243018

You can batch writes while at the same time not acknowledging them to clients until they are flushed, it just takes more bookkeeping.

tybit · 2025-12-09T11:12:38 1765278758

I also think fsync before acking writes is a better default. That aside, if you were to choose async for batching writes, their default value surprises me. 2 minutes seems like an eternity. Would you not get very good batching for throughout even at something like 2 seconds too? Still not safe, but safer.

senderista · 2025-12-08T22:47:09 1765234029

For transactional durability, the writes will definitely be batched ("group commit"), because otherwise throughput would collapse.

otabdeveloper4 · 2025-12-09T12:39:14 1765283954

> Many applications do not require true durability

Pretty much no application requires true durability.

staticassertion · 2025-12-09T18:01:55 1765303315

Maybe what's confusing here is "true durability" but most people want to know that when data is committed that they can reason about the durability of that data using something like a basic MTBF formula - that is, your durability is "X computers of Y total have to fail at the same time, at which point N data loss occurs". They expect that as the number Y goes up, X goes up too.

When your system doesn't do things like fsync, you can't do that at all. X is 1. That is not what people expect.

Most people probably don't require X == Y, but they may have requirements that X > 1.

otabdeveloper4 · 2025-12-10T18:42:56 1765392176

For the vast majority of applications a rare event of data loss is no big deal and even expected.

staticassertion · 2025-12-12T04:27:29 1765513649

I think you're still not getting my point. Yes, a rare event of data loss may not be a big deal. What is a big deal is being able to reason about how rare that event is. When you have durable raft you can reason by using straightforward MTBF calculations. When you don't, you can keep adding nodes but you can't use MTBF anymore because a single failure is actually sufficient to cause data loss.

millipede · 2025-12-08T20:12:39 1765224759

I always wondered why the fsync has to be lazy. It seems like the fsync's can be bundled up together, and the notification messages held for a few millis while the write completes. Similar to TCP corking. There doesn't need to be one fsync per consensus.

aphyr · 2025-12-08T21:20:02 1765228802

Yes, good call! You can batch up multiple operations into a single call to fsync. You can also tune the number of milliseconds or bytes you're willing to buffer before calling `fsync` to balance latency and throughput. This is how databases like Postgres work by default--see the `commit_delay` option here: https://www.postgresql.org/docs/8.1/runtime-config-wal.html

to11mtm · 2025-12-08T22:03:44 1765231424

> This is how databases like Postgres work by default--see the `commit_delay` option here: https://www.postgresql.org/docs/8.1/runtime-config-wal.html

I must note that the default for Postgres is that there is NO delay, which is a sane default.

> You can batch up multiple operations into a single call to fsync.

Ive done this in various messaging implementations for throughput, and it's actually fairly easy to do in most languages;

Basically, set up 1-N writers (depends on how you are storing data really) that takes a set of items containing the data to be written alongside a TaskCompletionSource (Promise in Java terms), when your stuff wants to write it shoots it to that local queue, the worker(s) on the queue will write out messages in batches based on whatever else (i.e. tuned for write size, number of records, etc for both throughput and guaranteeing forward progress,) and then when the write completes you either complete or fail the TCS/Promise.

If you've got the right 'glue' with your language/libraries it's not that hard; this example [0] from Akka.NET's SQL persistence layer shows how simple the actual write processor's logic can be... Yeah you have to think about queueing a little bit however I've found this basic pattern very adaptable (i.e. queueing op can just send a bunch of ready-to-go-bytes and you work off that for threshold instead, add framing if needed, etc.)

[0] https://github.com/akkadotnet/Akka.Persistence.Sql/blob/7bab...

aphyr · 2025-12-08T22:08:56 1765231736

Ah, pardon me, spoke too quickly! I remembered that it fsynced by default, and offered batching, and forgot that the batch size is 0 by default. My bad!

to11mtm · 2025-12-08T22:31:58 1765233118

Well the write is still tunable so you are still correct.

Just wanted to clarify that the default is still at least safe in case people perusing this for things to worry about, well, were thinking about worrying.

Love all of your work and writings, thank you for all you do!

loeg · 2025-12-09T00:32:16 1765240336

In some contexts (interrupts) we would call this "coalescing." (I don't work in databases, can't comment about terminology there.)

kbenson · 2025-12-08T23:27:24 1765236444

That was my immediate thought as well, under the assumption the lazy fsync is for performance. I imagine in some situations, delaying the write until the write confirmation actually happens is okay (depending on delay), but it also occurred to me that if you delay enough, and you have a busy enough system, and your time to send the message is small enough, the number of open connections you need to keep open can be some small or large multiple of the amount you would need without delaying the confirmation message to actual write time.

senderista · 2025-12-08T22:53:15 1765234395

In practice, there must be a delay (from batching) if you fsync every transaction before acknowledging commit. The database would be unusably slow otherwise.

millipede · 2025-12-09T19:48:11 1765309691

Right, I think the lazy thing implies that it would happen post "commit" being returned to the client, but it doesn't need to be. The commit just needs to be wait for "an" fsync call, not its own.

mrkeen · 2025-12-08T21:19:06 1765228746

One of the perks of being distributed, I guess.

The kind of failure that a system can tolerate with strict fsync but can't tolerate with lazy fsync (i.e. the software 'confirms' a write to its caller but then crashes) is probably not the kind of failure you'd expect to encounter on a majority of your nodes all at the same time.

johncolanduoni · 2025-12-08T22:23:43 1765232623

It is if they’re in the same physical datacenter. Usually the way this is done is to wait for at least M replicas to fsync, but only require the data to be in memory for the rest. It smooths out the tail latencies, which are quite high for SSDs.

loeg · 2025-12-09T00:35:21 1765240521

> It smooths out the tail latencies, which are quite high for SSDs.

I'm sorry, tail latencies are high for SSDs? In my experience, the tail latencies are much higher for traditional rotating media (tens of seconds, vs 10s of milliseconds for SSDs).

johncolanduoni · 2025-12-09T01:18:34 1765243114

They’re higher relative to median latencies for each. A high end SSD’s P99/median is higher than a high end HDD. That’s the relevant metric for request hedging.

loeg · 2025-12-09T02:43:19 1765248199

It's approximately a factor of 1000x for both.

senderista · 2025-12-08T22:51:00 1765234260

You can push the safety envelope a bit further and wait for your data to only be in memory in N separate fault domains. Yes, your favorite ultra-reliable cloud service may be doing this.

thinkharderdev · 2025-12-08T19:43:53 1765223033

> To have better performance in benchmarks

Yes, exactly.

dilyevsky · 2025-12-08T20:54:22 1765227262

Massively improves benchmark performance. Like 5-10x

speedgoose · 2025-12-08T21:01:25 1765227685

/dev/null is even faster.

formerly_proven · 2025-12-08T21:15:03 1765228503

/dev/null tends to lose a lot more data.

onionisafruit · 2025-12-08T22:00:01 1765231201

Just wait until the jepsen report on /dev/null. It's going to be brutal.

orthoxerox · 2025-12-08T22:41:19 1765233679

/dev/null works according to spec, can't accuse it of not doing something it has never promised

cnlwsu · 2025-12-08T23:00:13 1765234813

durability through replication and distribution and better throughput to build up more within the window on a lazy fsync

merb · 2025-12-04T19:02:04 1764874924

sadly I hoped they add:

> Off-site replication of guests for manual recovery in case of datacenter failure.

which would've been an actual killer feature

jefurii · 2025-12-04T20:26:58 1764880018

You can use ZFS to replicate your VMs. IIRC each VM has its own ZFS dataset. There's probably a config file somewhere that you also need to replicate.

g4cg54g54 · 2025-12-07T23:49:56 1765151396

https://pve.proxmox.com/wiki/PVE-zsync is just that, also PBS (https://pbs.proxmox.com/wiki/Main_Page) has "live restore" what can start a kvm after just the first few megs are restored

merb · 2025-11-28T20:04:06 1764360246

I don’t think that http/3 is easier to implement than http/1.1 especially since h3 is stateful where http/1.1 is not. Especially not when everything should be working correctly and securely because the spec does not always tell about these things. Oh and multiplexing is quite a hard thing to do especially when you are also dealing with a state machine and each of your clients can be malicious.

codys · 2025-11-28T23:28:44 1764372524

I can't speak to http/3 (I haven't tried to impl it), but I can say that a bare-bones http/2 is very easy to implement because it doesn't try to pretend to be prose.