Build Highly Available MySQL Applications Using Amazon Aurora Multi-Master

ianamartin · on Aug 9, 2019

People need to stop writing stuff about distributed systems if they won’t explicitly talk about the P in CAP theorem.

What happens here in the event of a network partition? There are several failure modes possible if I’m reading the diagram right. Network failure between the Aurora node and the shared storage layer.

I would expect that to result in a failed transaction and rollback, right? That’s not really all that available though, is it?

What about a network partition between availability zones? Maybe your local thinks it has a quorum because it can no longer see a different AZ? What happens when the partition is healed? How do the nodes reconcile? What if you’re working with multiple AZs and you have application nodes in each AZ writing to a cluster of nodes that all think they have a quorum because they can’t see each other?

What happens to those writes after the partition? Is every transaction terminated rolled back because us-east-1 can’t see us-west-3?

If they aren’t, how do the nodes achieve consensus? Are we doing a mongoDB thing here and just tossing everything after the last shared state? And even if that does work out,

What’s going on? And how is it achieved.

Look, if you need transactions at all, you most likely need isolation level: serializable. And that’s incredibly hard to get right.

Many systems really don’t need any of that at all, but when you need it, it has to be right.

This doesn’t address any of the really hardest parts of distributed systems. It doesn’t address why I would take any application at all that stores data in a RDBMS and use this.

I’m sure everyone there on that team is working hard. But people have to stop writing about distributed systems like this.

redis_mlc · on Aug 9, 2019

The above comment is, well, not very informed.

CAP is a simple, generic academic model. Researchers and implementers moved on from that a long time ago.

In the case of Google, they have partition-free networks for newer databases, so CAP rarely applies.

Galera/Percona Cluster has solutions for all the above also. Generally multi-master writes continue, and the behind server either catches up or gets automatically rebuilt.

MySQL 8 has something equivalent to Galera.

I would recommend testing (or waiting) a year for multi-master Aurora to bake. In the meantime, you can use Percona Cluster if you can't wait.

You can learn more here about the state-of-the-art in multi-region, multi-master RDBMS' here:

http://www.jebriggs.com/blog/2017/11/internet-latency-and-mu...

Source: DBA who's operated Percona Cluster for 5+ years. Contact me if you need extreme database performance or availability.

zzzcpan · on Aug 9, 2019

> In the case of Google, they have partition-free networks for newer databases, so CAP rarely applies.

No, asynchronous networks cannot physically be partition-free. They are actually always partitioned.

> Galera/Percona Cluster has solutions for all the above also

They don't have solutions to all of the above.

SahAssar · on Aug 11, 2019

What exactly is a "partition-free network"? In my understanding no such thing exists, a partition can happen even if all the servers run on the same physical hardware (because of networking stacks, bugs, misconfiguration and so on), and even if they think that no such thing can happen if runs for eternity it will eventually happen. The question that CAP asks is what happens when a partition happens? Does the DB

C: Stay consistent but not available

A: Stay available but not consistent

There is no other option.

kjeetgill · on Aug 9, 2019

I don't strictly disagree with you, but the parent has a point. It's not a technical outline of the difficult tradeoffs distributed systems must make[0], rather an ad copy where they can promise you the world and it's hard to tell what's exactly true.

Even igoring Amazon, MySQL/Percona's mha story has had a history of buggy, weak, imprecise promises.

[0] I guess Google level special sauce gets a pass since they have the most published about specially clocked hardware, and how it changes DB design.

ianamartin · on Aug 11, 2019

This is an incredible comment.

You have some real chutzpah, sir. I respect that. You are as wrong as can possibly be, but this is a bold move.

zzzcpan · on Aug 9, 2019

Amazon is likely the only one who can do this well today and they explain consistency in the post.

ianamartin · on Aug 11, 2019

I don't think Consistency was explained in the post. But that may be a weakness on my part.

PeterZaitsev · on Aug 9, 2019

Amazon Aurora magic is in storage is the case at large extent because RDS was done on the EBS storage which is super inefficient for database workloads. If you use NVMe storage you often get much better performance... and when you can use Database Level replication for high availability https://www.percona.com/blog/2017/11/28/best-practices-perco... But even if you use EBS the Magic of RocksDB/MyRocks may be better than magic of optimized storage as this blog post shows https://www.percona.com/blog/2019/07/17/assessing-mysql-perf...

joecot · on Aug 9, 2019

The important line is at the very end:

> At launch, Aurora Multi-Master supports two node clusters in a single Region. Support for more writer nodes and placement of writers in multiple Regions is planned for future releases.

Great to see more high availability options, but yet again it's just in the same region. When we see the East region go down entirely more often than single availability zones. At least this time they mention that limitation in the announcement and an intention to fix it later. In the mean time I'll stick to our multi-region Galera setup.

etaioinshrdlu · on Aug 9, 2019

How cool would it be if we could use multi-master in combination with Parallel Query, global replication, and the serverless option???

ReD_CoDE · on Aug 9, 2019

Hi friends, can I ask another question related to MySQL? And maybe AWS!

I have a lot of spreadsheets and PDFs as a silo, and I want be able to build a searchable MySQL database from those documents. Does it possible? How?

SahAssar · on Aug 11, 2019

Sounds like you are looking more for a search engine (like Elastic for example), not a classic RDBMS.

ReD_CoDE · on Aug 12, 2019

I think search engine is not a bad idea, but a researchable database (for example MySQL) whould be better

Here is an example for spreadsheets, but I think spreadsheets is easy, because there are some solutions like MySQL workbench.

https://search.equityfront.com/

But for PDFs which has unstructured and semi-structured data and vectors, it's hard to find a good solution to extrac/convert them into MySQL

vijaybritto · on Aug 9, 2019

This master slave wording has to change in tech. It's 2019

parliament32 · on Aug 9, 2019

Master/slave is a perfectly good description of the roles. If we were titling employees master/slave that might be a problem, but this is software ffs.

This faux-indignation puffery has to change. It's 2019.

rat9988 · on Aug 9, 2019

I have never found it offensive.

vijaybritto · on Aug 9, 2019

Good for you. But it's not appropriate

rat9988 · on Aug 9, 2019

It is in my opinion.

polskibus · on Aug 9, 2019

I wish it was open source like MySQL.

jinqueeny · on Aug 9, 2019

Shameless self-plug: If you are looking for an open-source scale-out MySQL that supports both multiple writes and multiple reads, feel free to give TiDB (https://github.com/pingcap/tidb) a try.

xellisx · on Aug 9, 2019

Galera is another multi master solution that sits on top of MySQL or MariaDB.

aloknnikhil · on Aug 9, 2019

Wouldn't matter. Aurora's value-add comes from its storage nodes. They would have to open source that too.