scv119's comments

scv119 · on Aug 11, 2023

The tweet is referring to a paper that fine tunes Chinese dataset on english base model. I'm not surprised with LoRA's poor result in this setup.

scv119 · on June 20, 2023

I believe you can slightly change the flash attention kernel to implement the same kernel of this page attention, since both of them work on the key/value cache at block level.

scv119 · on June 20, 2023

Pretty cool stuff and the results are amazing. Hoping we will see virtual memory get standardized in pytorch or cuda.

scv119 · on Nov 24, 2021

We wanted to ensure everyone on the team has explored the full design space, and everyone agreed that the chosen solution is the best we can think of, given the current information and constraints.

Once we have done that, everyone is aligned and it doesn't matter who to implement it; it will be implemented in the same way.

scv119 · on Nov 24, 2021

Co-creator of Delos here, Let me give a concrete example for [18].

We have a "StorageEngine" interface that wraps around RocksDB, i.e. "RocksDbStorageEngine" to provide key-value interfaces; however we are worried that someone would leak the implementation detail (RocksDB is so powerful) from the "StorageEngine" interface.

To solve the problem, we write another fully in memory storage engine implementation called "MemoryStorageEngine", which acts as a specification of the "StorageEngine" interface.

We run all storage engine tests against both implementations (rocksdb/memory), except for the durability part; we also configured the system to run over both storage engines.

By doing this:

- we can use the memory based one to enforce the behavior RocksDB based one by running unit tests against both implementations.

- no one can easily leak some implementation details of RocksDB from the StorageEngine interface; to do that, you need to first add those advanced feature into memory based StorageEngine as well!

This is just one example, and we have a list of such abstractions, such as MetadataStore, Logs etc. We created multiple implementation of each of these interfaces and ensure the system could run on any combination of those implementations.

kcartlidge · on Nov 24, 2021

Great explanation :)

scv119 · on Nov 24, 2021

The first engineer on the Delos team here. This design decision gave us an edge: we delivered the first production cluster in 8 months by layering over existing systems ; and we smoothly migrated all underlying implementations in the following 2 years without our customer aware of it (0 downtime). This design decision is well captured by the OSDI’20 Virtual Consensus in Delos paper.

throwthrow564 · on Nov 24, 2021

Very interesting. How did you deal with the issue where the old API had some feature X that wasn't supported in the new API but some customers depended on feature X?

scv119 · on Nov 24, 2021

I guess that a hard question for user facing API: you need to support both the old and the new features during the migration.

In our case, this design principle mostly applied to our internal APIs. Particularly, we designed our internal sub components (consensus protocol, particularly) in a way we can swap it without any downtime.

haimez · on Nov 24, 2021

Right. So it seems you did a lift and shift over two years without changing the basic API surface. That’s ok, but stability of requirements throughout a rearchitecting project is a luxury and not the norm

scv119 · on April 20, 2018

Agreed, technology-wise FDB and TIDB look almost the same.

scv119 · on April 7, 2014

Sorry, there is no DECENT internet connection in China

seanmcdirmid · on April 7, 2014

And yet, I still am able to work here. Barely, but still (helps that we have our own line from the office).