YCSB Benchmark Results for YugaByte and Apache Cassandra

doikor · on Nov 11, 2017

Just as a quick comment the GC configuration on those Cassandra nodes is kinda crazy. I don't think anyone is recommending anyone to run 30G maxheap CMS. Somewhere around 16G you really should switch to G1GC.

Also I highly doubt you want to have 30G heap with 60G RAM available. Having a larger heap most likely doesn't help you at all. It will just eat away memory you could use for mmapping or off heap things and increase GC pause length.

(This is the bane of running JVM any apps. You really need to know what you are doing with the GC. Also with apps like Cassandra there is no silver bullet. Just some general guidelines to follow and slowly adjust your values and monitoring of the changes as the best possible settings change with the load/hardware setup)

rkarthik007 · on Nov 13, 2017

Great points, thanks @doikor! Will try these.

It would be nice if some of these were the defaults as opposed to some settings folks have to change - say by running a "./configure" or some such.

In this case, for example, it felt a little weird that the machine type which had enough cores had 60GB RAM, and giving only around 10-20GB to Cassandra "felt" wrong, but turns out the exact opposite is true! YugaByte is in C++ and can handle much larger heaps.

arielweisberg · on Nov 13, 2017

Java's garbage collected heap is not the full extent of memory used by Cassandra. Cassandra can use as much memory as you can throw at it.

File system cache, in process decompressed page cache, key and row cache, off heap memtables etc. There are several different mechanisms for using memory that isn't on the garbage collected heap.

^ This is not a qualitative assessment of how Cassandra uses memory.

bbromhead · on Nov 12, 2017

They pretty much did the opposite of what Cassandra guidance recommends for GC tuning.

StreamBright · on Nov 11, 2017

G1GC would make wonders of this setup. I guess thisis why it is benchmarked with CMS. I just finished a project where the biggest performance tuning was to move the GC to G1GC.

jeremyjh · on Nov 11, 2017

Throughput improvement is nice but not game-changing. What I would be more interested in here is a comparison of the longest latency. One problem my former employer had in a large Cassandra deployment was GC pause times causing latency spikes exceeding SLAs for a critical service. They did eventually tune the cluster to avoid these issues but to be honest it kind of shook my faith in any JVM-based database (maybe unfairly).

rkarthik007 · on Nov 11, 2017

Hi @jeremyjh - you are absolutely correct about p99 being more important than avg latency for critical services.

We have done the following p99 measurements: - a p99 comparison of YugaByte vs Cassandra using YCSB - ran ndbench for a week and looked at the p99 of YB

The results are looking really good for YugaByte in both cases. We are going to post that very soon to our forum, will x-post here as well.

rpang · on Nov 13, 2017

Our ndbench results for YugaByte DB were just posted on our forum https://forum.yugabyte.com/t/ndbench-results-for-yugabyte-db.

In gist, we ran ndbench for 7 days and our p99 was under 6 ms with no "long tail". It is achieved by cautious architectural design and implementation choices. More details are available in our post.

rpang · on Nov 21, 2017

For those who are interested, our ndbench results post has just been moved to a new location: https://forum.yugabyte.com/t/yugabyte-db-p99-latencies-with-...

rpang · on Nov 27, 2017

Some new updates on YCSB and p99 latencies. https://forum.yugabyte.com/t/ycsb-benchmark-results-for-yuga....

In our new benchmark run, we set up Apache Cassandra using G1 GC and a smaller heap size as suggested. Still, we observed a multitude of differences in throughput and p99 latencies in YugaByte over Apache Cassandra.

riffraff · on Nov 11, 2017

If they could tune the GC to enforce predictable pauses (which is common practice enough afaik), why would that shake your faith?

The JVM cannot know in advance if you prefer throughput or predictability.

jeremyjh · on Nov 11, 2017

They managed to tune it to an acceptable level in our use case but we’d still see 50ms variation in the long tail responses. I think there are applications for whom that wouldn’t be acceptable. Like I said - it may not be fair but the impression I walked away with is that I don’t want GC in my database.

pkolaczk · on Nov 12, 2017

GC development is not standing still. Oracle claims sub 10ms pause times and terabyte heaps their new ZGC. And there are good pauseless or almost pauseless alternatives from other vendors as well.

https://www.google.pl/amp/s/www.infoworld.com/article/323539...

riffraff · on Nov 12, 2017

thanks for the clarification, I understand better now.

jjirsa · on Nov 14, 2017

http://www.dbms2.com/2013/03/18/dbms-development-marklogic-h...

> Rule 1: Developing a good DBMS requires 5-7 years and tens of millions of dollars.

> That’s if things go extremely well.

> Rule 2: You aren’t an exception to Rule 1.

> In particular:

> Concurrent workloads benchmarked in the lab are poor predictors of concurrent performance in real life. Mixed workload management is harder than you’re assuming it is. Those minor edge cases in which your Version 1 product works poorly aren’t minor after all.

jhgg · on Nov 11, 2017

Would love to see how this compares to ScyllaDB.

slashdev · on Nov 11, 2017

Exactly, given how ScyllaDB and Cassandra and now Yuga are all meant to be drop-in compatible (in theory anyway.)

Having seen the benchmarks for ScyllaDB vs Cassandra I think the crown easily goes to Scylla. That shouldn't be big surprise, they do have some amazing engineers on that team, including some of the creators of KVM. They wrote their own TCP/IP stack for Scylla and can run in kernel-bypass mode.

They also do some fascinating things like batching similar internal operations to get better instruction cache performance. I wouldn't have though I-cache would ever be a bottleneck for a database. It speaks to how highly-tuned Scylla is, where the other performance bottlenecks have been dealt with to such a degree that the I-cache behavior became a performance problem worthy of tackling.

pkolaczk · on Nov 12, 2017

Scylla is far from being a drop in replacement for Cassandra. Too many features missing and many existing basic features like counters have been added only very recently.

steeve · on Nov 12, 2017

We replicated the 10x claim in benchmarks we did on our deployment. ScyllaDB has some amazing optimization under the hood such has disk io qos, preventing compactions from impacting reads and writes much. This is killer in update heavy workloads.

tech_tool · on Nov 11, 2017

YugaByte https://github.com/YugaByte/yugabyte-db is an open source database which allows cloud native operations for Mission Critical applications.