One thing you can do is use :ets.slot/2 to do random probing of the cache and ev...

gnomeduck · on July 12, 2022

How does Redis do it now in 6.0 and later?

KMag · on July 12, 2022

Why not use a min heap to track expiry times? (Or a doubly-linked list, if lifetimes are uniform?)

bluesnowmonkey · on July 12, 2022

The min heap would need to live in the process heap of a particular process. It would only be accessible by that process, so other processes would need to interact with it by sending messages to the "min heap" process and waiting for responses. The min heap process would became a massive throughput bottleneck at much lower QPS than ETS, just due to the overhead of processing messages. This would be exacerbated by GC pauses as the min heap grows larger than around 100k or 1M entries, whereas an ETS table can handle 100M entries or more (never hit a limit actually).

Large, mutable, shared state like a cache works much better if it lives in ETS. Any process can read/write to a properly configured table in a few microseconds. Scales very well to large data sizes and high QPS.

The random probing approach has the disadvantage that it stores expired entries longer than necessary. This either wastes memory or reduces cache hit rate. Either of these seem preferable to limiting throughput and maximum entry count, as the min heap would.

sethammons · on July 12, 2022

that was my immediate first question