alucard055's comments

alucard055 · 2024-04-25T16:26:53

from sec 3.2 in the paper

"Due to our choice of using passive replication, mutations are executed on a primary node before being committed into the trans- action log. If a commit fails, for example due to network isolation, the change must not be acknowledged and must not become visible. Other database engines use isolation mechanisms like Multi-Version Concurrency Control (MVCC) to achieve this, but Redis data struc- tures do not support this functionality, and it cannot be readily decoupled from the database engine itself. Instead, MemoryDB adds a layer of client blocking. After a client sends a mutation, the reply from the mutation operation is stored in a tracker until the transaction log acknowledges persistence and only then sent to the client. Meanwhile, the Redis workloop can process other operations. Non-mutating operations can be executed immediately but must consult the tracker to determine if their results must also be delayed until a particular log write completes. Hazards are detected at the key level. If the value or data-structure in a key has been modified by an operation which is not yet persisted, the responses to read operations on that key are delayed until all data in that response is persisted. Replica nodes do not require blocking as mutations are only visible once committed to three AZs"

panarky · 2024-04-25T16:58:31

> the reply from the mutation operation is stored in a tracker until the transaction log acknowledges persistence

What are the consistency and durability properties of the tracker datastore?

Are replies from tracker mutations stored in a tracker-tracker until the tracker-transaction-log acknowledges persistence?

Is it trackers all the way down?

Onavo · 2024-04-26T00:12:17

Strong consistency at the expense of availability.

Bjartr · 2024-04-25T23:35:10

Maybe it could be done round-robin?

swasheck · 2024-04-25T18:53:47

this sounds mostly like how MS implemented synchronous availability groups in sql server

convolvatron · 2024-04-25T16:38:59

this is kind of a strange design. in order to support this we build an external model of the database for all potentially conflicting state. wouldn't it be easier to make a redis lookalike that supported high isolation levels?

jitl · 2024-04-25T16:58:43

The external model is much simpler to do for Redis than a database with complex cross-key queries like an SQL database. Redis has scans but no queries or “real” transactions with rollback. To me it sounds like more work to implement MemoryDB and then additionally re-implement the Redis interior than to implement MemoryDB and use the off-the-shelf Redis.

I also think that the decoupled design is kind of elegant, it allows the logical implementation to be developed independently of the durability bits. It’s open-core but someone else is building the core.

zadokshi · 2024-04-25T22:12:32

Yes, it does seem like all that effort to wrap redis is wasted, and that it’d be easier to just create their own. It isn’t clear to me why this is better than creating their own. Does anyone know why they would go the route of trying to wrap something around redis rather than just replace it with something that has a redis compatible API?

ec109685 · 2024-04-25T22:12:19

They also likely have technology / libraries / expertise that make building the tracker straightforward.

jchrisa · 2024-04-25T22:57:42

It's smart because you have deep control of bugwards compatibility, and can swap parts of the stack later.