esmet's comments

esmet · on Oct 19, 2015

It's unrelated work. This is the "fractal tree" in question, open source: https://github.com/Tokutek/ft-index

esmet · on Feb 13, 2014

Interesting perspective, but it turns out that the opposite is true. Using better indexing is a general improvement to database performance and manageability.

nasalgoat · on Feb 13, 2014

His argument was that the indexing they do only improves a specific type of query. Personally I think he's wrong but that was his view a year or so ago.

esmet · on Feb 13, 2014

I see. To clarify, TokuMX's indexing technology doesn't try to improve specific queries or patterns - it simply makes general index maintenance significantly cheaper and less space intensive, so your application can define the indexes it needs, not just the ones it can afford.

esmet · on Sept 13, 2013

It's at the database level now.

esmet · on June 20, 2013

There have been very informal talks with the folks at 10gen. The topic of getting merged or not will ultimately come down to what users want. Commercialization will start with support, much like what 10gen does for regular MongoDB and what we do for TokuDB.

As for licensing, TokuMX embraces the spirit of open-source and we're confident our open-source licence plays well with the AGPL.

esmet · on June 20, 2013

LZ4 is on our road-map for experimentation. I could see a faster decompressor really helping when you have high-end flash storage.

esmet · on June 19, 2013

TokuMX is indeed a complete product, not an experiment. We're proud of it and ready to support it yesterday :)

I completely agree that pluggable storage engines are an interesting topic. But we went the integrated route (ie: no storage API) probably for the same reason: things get simpler and easier to implement when the stack is shorter.

esmet · on June 19, 2013

Most users should expect a big reduction in disk space. Depending on your data and level of fragmentation with regular MongoDB, you could see ~4x-20x reduced disk usage.

esmet · on June 5, 2013

"Fractal Tree" is indeed a marketing trademark. But the algorithms used are not necessarily secret. The implementation is similar to the cache oblivious streaming btree: http://supertech.csail.mit.edu/papers/sbtree.pdf‎

Of course what's not described in the paper include: transactions, mvcc, logging/recovery, parallelism, etc. Those are really interesting topics as well.