Is that looking at the symbols table of the binary, they have embedded the entirety of the Clickhouse database as their processing engine.
I hope they do open source it at some point. The more I think about it, the more I like the idea of having a transaction database do analytic predicate pushdown by just transparently querying an actual OLAP database.
We will have more detail explained recently.
It will be open sourced in a year or two. For us we need to make the code open-source ready instead of just turn on github settings.
What is emerging in HTAP is two patterns: scale-up like HANA and scale-out like TiDB 4.0. The engine/system in both cases transparently handles the merge between the OLTP delta row store and the OLAP column store (AutoETL) and there is a transparent federated query that is aware of both store types.
Does Presto or another scale-out solution transparently perform these two HTAP functions?
Almost all the data lake based products loss full control over storage system. It makes them very hard to build delta-main engine we need. To make HTAP storage transparent to query layer, TiFlash need a lot more control over storage engine than data lake can provide.
There are database products like MongoDB with closed source. I was curious about the author's reasoning in this particular case. E.g. apart of transparency and community contributions, etc