Docker Storage: An Introduction

rburhum · on July 22, 2016

Glad to see more articles like this. I find tons of writings about stateless containers, but hardly any about best practices for stateful containers. Just yesterday I was going over the Django tutorial in the official Docker docs. Everything there made sense except that it completely ignores how to handle "media" folders (i.e. the Django folder that, among other things, contains user uploads). Yes, the database is in a volume so I am glad that survives creating/destroying the postgres container, but I kind of need the other files, too.

objectivefs · on July 22, 2016

For stateful containers that need access to a shared folder you can also use a shared file system such as GlusterFS (http://www.gluster.com), EFS (https://aws.amazon.com), or ObjectiveFS (https://objectivefs.com). This keeps your state in one location and all your containers can access it through the regular file system interface, i.e. you can start using it before you have rewritten everything to be cloud native.

rburhum · on July 23, 2016

Thank you. We have been using glusterfs since we found those tutorials sometime back. I wish the default "getting started with docker" tutorial included this :-/

atmosx · on July 22, 2016

Well in today's world you'll handle the media folder by hosting files in the cloud (e.g. an S3 bucket).

Most of the times if you use volumes, it's because of poor design[1] than anything else.

[1] modern web-app design guidelines: http://12factor.net/

cyphar · on July 22, 2016

You're just punting on the problem. Now storage is someone else's problem, who can't use containers because you decided to dump your state on them. Storage with containers is something that we need to solve properly (and Docker isn't doing a great job right now).

Annatar · on July 23, 2016

Storage with containers is something that we need to solve properly

...Or you could just use SmartOS zones, and then all these artificial problems go away, as zones natively reside on the ZFS filesystem inside of the zpool underneath them. Why would one want to waste one's time on technology which is clearly not finished and doesn't offer any advantages over zones? As a logical being, I'm genuinely perplexed by the overall insistence on Docker. What is the cause of it?

cyphar · on July 24, 2016

Here we go again.

As we've discussed before, there are problems that are inherent to designing distributed systems that are not solved by SmartOS's magic sauce. Please stop pretending that every possible problem that is faced by GNU/Linux technologies has already been solved by SmartOS, it's getting quite tiresome.

"Storage with containers" refers to correctly decoupling the state from your application and storing it a way that allows for horizontal scaling. It's not as simple as proclaiming "we have a magical filesystem, therefore all problems are solved and you should use SmartOS kthxbai". There are harder problems here, and no amount of SmartOS shilling will get around that.

Annatar · on July 24, 2016

You completely ignored my question: what is the cause of insistence on Docker?

Docker storage does not solve the scaling problem either, indeed, no known technology in existence solves it: this is one of the unsolved problems in computer science, still largely terra incognita. The only way to design for horizontal scalability is in the application, where the application keeps track of the global state across all nodes it runs on. Even if one were to use distributed storage engine like Oracle RAC with the automatic storage manager and synchronous multimaster replication, one's application would still have to contain logic about ejecting/re-integrating failed RAC nodes, as well as re-trying the transaction on a different node, and that is in addition to internal loadbalancing it would have to perform, especially if the goal is high availability with exactly zero downtime or zero loss of service at all times.

As far as my "pretending", show me one problem which GNU/Linux has solved that SmartOS already hasn't. Just one.

In closing, I mentioned SmartOS on purpose: perhaps someone reading this will look it up, and try it out. And perhaps they'll like it (Linux became popular the same way). What I get out of it eventually is a job market with opportunities, but most importantly, I get to sleep through the night without having to deal with idiotic problems in GNU/Linux solved anywhere from 11 to 25 years ago in SmartOS (depending on the problem). I really do not want to waste any more time on Linux, and certainly not on Docker. SmartOS can do Docker, just so you know, although it doesn't need it to provide containerization, at all, but one does have that choice.

So if you really want to run Docker at all costs, which advantages does GNU/Linux offer you over SmartOS? Let's talk technology.

cyphar · on July 26, 2016

> what is the cause of insistence on Docker?

Because it targets developers and is based on technology usable on a very popular platform. SmartOS zones have neither of these properties.

> show me one problem which GNU/Linux has solved that SmartOS already hasn't

Packaging software and software updates. SmartOS uses NetBSD's pkgsrc, which is an interesting choice given the fact that BSDs have a very bad track record for packaging. FreeBSD still doesn't package their base system, and the recommended way of dealing with software on BSD systems is to compile it from source. GNU/Linux has solved this problem such a long time ago that I'm honestly surprised that SmartOS decided to use pkgsrc over something much more powerful like zypper+rpm, dnf+rpm or apt+dpkg.

So there's one problem that SmartOS didn't solve first, and I'd argue hasn't solved yet either. To be clear, I have no problem with SmartOS -- it has a lot of very deep technology. But claiming that it's magic pixie dust that has solved every problem that GNU/Linux has solved (and is working on solving) is being facetous and dishonest.

Also, as far as I'm aware there isn't nearly as much auditing of SmartOS going on as there is of GNU/Linux. Does SmartOS have support for all the features of grsecurity+PaX? What about support for UEFI? Or even support for many different architectures and hardware drivers? GNU/Linux may have many problems, but it beats every other free software operating system in quite a few areas (and beats a few proprietary operating systems in quite a few other areas too).

> SmartOS can do Docker, just so you know, although it doesn't need it to provide containerization, at all, but one does have that choice.

Yes, I know that. I currently am working as part of the OCI on container standardisation and am happy to see people working on that from the Solaris camp (we need to work together on standardising the workflows we want to use). But the one thing that people I work with don't do when working on container standards and canonical implementations of those standards is start screaming about how everyone should abandon a very popular platform because "I hate supporting GNU/Linux because it's not the operating system I like". Because that's just childish.

Annatar · on July 27, 2016

Packaging software and software updates. SmartOS uses NetBSD's pkgsrc, which is an interesting choice given the fact that BSDs have a very bad track record for packaging. FreeBSD still doesn't package their base system, and the recommended way of dealing with software on BSD systems is to compile it from source.

Compiling from source? pkgsrc, and by extension SmartOS, fully supports installing binary packages. The command to do this is called pkg_add. pkg_rm uninstalls a binary package. SmartOS even went a step further and uses pkgin, which works exactly the same way as apt-get. Please read pkg_add's and pkgin's manual pages before arguing further on this point.

Also here is a short document which clearly illustrates how a binary package is created and installed with pkgsrc:

http://www.perkin.org.uk/posts/creating-local-smartos-packag...

and here is another document pointing out how SmartOs has a nearly 14,000 package library of the latest version of software which normally runs on Linux; versions of which are usually newer than on Linux, and it is built fresh every day into binary packages, completely automatically:

https://www.perkin.org.uk/posts/building-packages-at-scale.h...

So there's one problem that SmartOS didn't solve first, and I'd argue hasn't solved yet either.

And based on your response, I'd argue that you haven't seen or used anything but Linux.

I'm far from being dishonest, and in fact I communicated exactly what my motivation is, and I'm sorry but it's really not my fault you haven't read SmartOS and Solaris documentation. That would be akin to me claiming that Linux is better than FreeBSD but without knowing enough about FreeBSD.

For example, your question about grsecurity+pax is nonsensical in the context of SmartOS: being Solaris based, it doesn't need Linux specific technology like grsecurity. That was precisely my point, if you use a well designed system, all of this Linux hacked-up nonsense goes away, because it was nonsense to begin with. SmartOS has a secure kernel by design, and further delineation can be achieved with role based access control.

It also doesn't need a lightweight virtual machine consisting of a single application because it has zones, and also because there would be nothing to reap your process at the end, and you'd end up with a zombie, an unreaped orphan process. I'm surprised that you don't know that Docker had to re-invent a kludgy copy of init precisely because of this problem, or else you would not have asked me that.

As for insisting on Docker because it's popular, it's flawed picking something based on popularity; if you pick a solution, it should be because it's technically sound and therefore robust, so that you can sleep through the nights when you're on-call without incidents, and read newspapers and drink your espresso during the day because the damn thing just runs and runs without needing any babysitting, like Linux does all the time.

To claim that SmartOS has no development tools, when it readily offers all the popular frameworks, languages, and has the most advanced linkers and compilers in existense us beyond obscene, I'm afraid. Please read the manual pages on the link editor, ld, and the Sun Studio compilers to get an inkling of what I'm writing about. It will tremendously help the quality of our discussion.

cyphar · on Aug 1, 2016

> your question about grsecurity+pax is nonsensical in the context of SmartOS

... no? Because many of the grsecurity+pax improvements apply to any kernel that runs on a CPU (it provides active protections against certain forms of kernel vulnerabilities caused by bugs). This includes illumos, thus the question is valid. Unless you're claiming the illumos cannot ever have a security bug.

> pkgsrc, and by extension SmartOS, fully supports installing binary packages.

Apologies, I was thinking of a different packaging system in FreeBSD. However, from my reading of the blog you linked pkgsrc only had support for signatures of packages in 2014. GNU/Linux has had this for a very long time.

My impression about package management being a shit-show on other operating systems is that every single podcast or blog post I read about those operating systems is celebrating that "package management is easy now with pkg" -- while it's actually not IMO as good as certain GNU/Linux package managers.

> and here is another document pointing out how SmartOs has a nearly 14,000 package library of the latest version of software which normally runs on Linux; versions of which are usually newer than on Linux,

Only 14000? Also, what distribution of GNU/Linux, what version of the distribution, how much automated QA happens before releases, etc?

> I'd argue that you haven't seen or used anything but Linux.

Untrue.

> but it's really not my fault you haven't read SmartOS and Solaris documentation.

It's not my fault that you haven't read the source code of runC without stating an ignorant opinon about how it works, based on a mix of outdated information and pure fabrication.

> That would be akin to me claiming that [...]

SmartOS is better than GNU/Linux without knowing about the Linux technology you're arguing about?

> I'm surprised that you don't know that Docker had to re-invent a kludgy copy of init precisely because of this problem, or else you would not have asked me that.

This is all not true, and you should know better. Docker/runC doesn't have an init process in containers. You _can_ run an init process, but it isn't necessary. In addition, the zombie problem doesn't exist because of sub-reapers which are a Linux kernel feature. You might not accept the existence of such features, but that's your perogative.

I asked you because I guessed that you didn't know about the GNU/Linux side of things. Thanks for not letting me down on that one.

> As for insisting on Docker because it's popular, it's flawed picking something based on popularity;

I answered your question of why Docker was popular, now you're complaining that I am talking about Docker because it's popular? What. In addition, I am a maintainer of runC and actually care much more about the OCI than Docker. There are some cool things coming from the Solaris and illumos folks, too bad that you're stuck in your ways and won't even consider the possibility that any GNU/Linux technology is good. That's just insane*.

> To claim that SmartOS has no development tools,

... when did I claim that? I claimed that it doesn't have anything like Linux when it comes to security frameworks like grsecurity+pax, UEFI support, hardware and driver support. You haven't addressed those arguments (claiming that grsecurity+pax isn't useful for SmartOS is showing that you don't know what it is or how it works).

Annatar · on Aug 1, 2016

.. no? Because many of the grsecurity+pax improvements apply to any kernel that runs on a CPU (it provides active protections against certain forms of kernel vulnerabilities caused by bugs). This includes illumos, thus the question is valid.

grsecurity is a set of patches for the GNU/Linux kernel. illumos is based on Solaris, so the entire argument about grsecurity is nonsense. illumos uses red zones to prevent buffer overflows, which grsecurity is attempting to address. grsecurity is a commercial product by the way. A single license costs $19,000 USD.

Enhanced auditing and process control which grsecurity provides have been part of Solaris, and therefore illumos, since Solaris 10, some even earlier. GNU/Linux is still playing catch-up, and as long as people like myself, Bryan Cantrill, Adam Leventhal and the rest of the former Sun kernel engineers live, it will be playing catch-up forever.

My impression about package management being a shit-show on other operating systems is that every single podcast or blog post I read about those operating systems is celebrating that "package management is easy now with pkg" -- while it's actually not IMO as good as certain GNU/Linux package managers.

In order for that opinion of yours to actually mean anything, the question is: how many packaging formats do you know to produce packages for? Only then will you be in a position where you would actually be competent to make such a statement, and where your opinion would actually make a difference. I myself have packaged for HP-UX, IRIX, Solaris, GNU/Linux (RPM), GNU/Linux (DPKG), and SmartOS (pkgsrc), so I'm in a position to make such statements, and yet I didn't. You, on the other hand, apparently have no such inhibitions.

Only 14000?

"Only". If you had packaged, you would have known that the same body of software could be delivered by an arbitrary number of packages, since that depends on the packager(s), respectively on the architecture.

Also, what distribution of GNU/Linux, what version of the distribution, how much automated QA happens before releases, etc?

The same thing which happens on GNU/Linux; if you think that packages on GNU/Linux get tested like a classic UNIX vendor would test it, you're naive.

This is all not true, and you should know better. Docker/runC doesn't have an init process in containers. You _can_ run an init process, but it isn't necessary.

Is that right? https://blog.phusion.nl/2015/01/20/docker-and-the-pid-1-zomb...

In addition, the zombie problem doesn't exist because of sub-reapers which are a Linux kernel feature.

As it stands, I've worked with Linux extensively and I've never heard of such a thing. Please show me that code, for if you actually manage to show me that, I will have learned something new.

In addition, I am a maintainer of runC and actually care much more about the OCI than Docker.

Aaahhh, so that's why you keep on blindly lobbying for Docker. Now we finally get to the bottom of the thing. Couldn't you just be honest from the onset like I was on what your motivation is, so everybody knows where everybody stands?

So basically, you're trying to re-invent project Kevlar, Solaris zones, in a completely generic way so as to be everything and not be anything in particular to anyone. Did you not study UNIX, and the old Henry Spencer's saying

those who do not understand UNIX are condemned to re-invent it -- badly?

You could just bite the bullet and benefit from a complete, enterprise, battle tested solution with a decade of use behind it, and use Solaris zones by using vmadm(1M) in SmartOS, you know. No need to re-invent the wheel. Again.

How many more solutions does GNU/Linux need in order to be able to run lightweight virtual machines? Can't you people engineer one solution which actually works properly, like we have it in illumos? Apparently that's too much to ask. Or you could just use zones, which have been working for a decade, and actually let one run lightweight virtual servers running at the speed of bare metal in production, today:

https://smartos.org/man/1m/vmadm

https://smartos.org/man/1m/zoneadm

https://www.youtube.com/watch?v=hgN8pCMLI2U

As for "OCI", I have no idea what you're writing about. To me, an oldschool UNIX guy, "OCI" stands for "Oracle Call Interface":

http://www.oracle.com/technetwork/database/features/oci/inde...

Finally, a word on your question about setting up Manta. Apparently it's open source:

https://github.com/joyent/manta

...which means you can set it up seven ways 'till Sunday, and do with it whatever you please, however you please. I myself don't use it, because I design my availability into the application I write, and my interest is in the infrastructure layer anyway, where things like DNS are designed to be highly available from the onset (again availability in the application, and not the OS layer).

http://dtrace.org/blogs/dap/2013/07/03/fault-tolerance-in-ma...

icebraining · on July 25, 2016

If you want to argue the merits of SmartOS over Docker on any other topic, you have plenty of avenues. This thread is about storage, and while I feel sympathetic to your position, frankly you're off-topic (and off-putting).

Annatar · on July 25, 2016

Okay, then answer me why it makes sense to waste one's time on storage contortions in Docker on Linux when a different technology offers an enterprise tested solution? In SmartOS all I have to do is instantiate a docker branded zone, and the storage it uses is automatically permanent, no extra steps needed like on Linux.

cyphar · on July 26, 2016

> and the storage it uses is automatically permanent

Is it also automatically replicated to a distributed storage pool so that it is mirrored to all of your other containers? Because that's what people are talking about when they discuss "container storage" -- they're discussing creating storage for a distributed system.

You can create persistence incredibly easily with runC (the core executor for Docker). In fact, runC by default just allows you to store all of your state in your container's rootfs (you need to manually add mounts to make your state run in a tmpfs). It's just not very useful to have all of your state on one of your compute nodes.

I really don't understand why you keep pretending that we're focusing on the simple things. If I didn't know better, I would think that you're trying to distort the problems we're discussing (which are problems for every system) into very easy problems that SmartOS has solved (but also every other operating system has solved as well). As someone who thinks SmartOS has some very powerful technology, I'm surprised that an advocate of SmartOS is resorting to strawmanning the problems that we're discussing. Surely the technology should stand for itself.

Annatar · on July 26, 2016

Is it also automatically replicated to a distributed storage pool so that it is mirrored to all of your other containers?

Even better, it uses a distributed object store backed by ZFS called "Manta". Do you have any more questions?

Meanwhile, you are very persistent at stonewalling my original question: why the insistence on using Docker?

cyphar · on July 27, 2016

> why the insistence on using Docker?

I answered this in a cousin comment. https://news.ycombinator.com/item?id=12164793

> distributed object store backed by ZFS called "Manta".

a) Is this the default for SmartOS installs, and is this how containers work by default on SmartOS? Is the method of setting this up seamelss?

b) Can you self host Manta.

c) What are you even arguing about, you're just pointing at an cloud service and declaring victory. You might want to elaborate a bit more.

> Do you have any more questions?

Yes. Apart from the fact that I could just use S3 to achieve the exact same thing, and the fact that you might not be able to store things as pure objects (since you can't do partial writes, etc etc) do you not see that as dodging the original point? Do zones provide a seamless way of mounting a particular set of manta objects into a container so that you can actually store secrets inside manta without also storing your credentials everywhere?

Also, out of interest, has Joyent liberated for Manta (I know that SmartDataCenter is free software, but I haven't checked if Manta is).

There are a few other things that GNU/Linux containers have that I think SmartOS Zones don't. Can you start application containers, or is the only mode for Zones a "boot" process with an init daemon? Also out of interest, do you have the concept of rootless zones -- zones entirely created by an unprivileged user?

rburhum · on July 22, 2016

I am not debating whether you should or should not store files on S3. I can see use cases for both using S3 (or some alternate storage).

My point is that - even in the case for django configured with S3 in this docker tutorial, there is absolutely no mention of this. The "dockerized Django" example in the documentation destroys your uploaded files every time you restart the container. Most of the "Dockerizing <whatever you want>" that I have read so far miss the maintaining state part. To me, that is overlooking an important point.

ngrilly · on July 22, 2016

Well, I keep reading this, but what if you design an app that relies on local storage, for example one that uses an embedded database like SQLite, LevelDB or BoltDB? 12 factors don't address this.

aleem · on July 22, 2016

You would not get redundancy or scale and you would need to do offsite backups yourself which is why it's not recommended over cloud DBs.

Building stateless containers is good practice.

There will always be exceptions but it's good to know the tradeoffs before making those exceptions.

ngrilly · on July 22, 2016

Yes, I'm aware stateful services need redundancy, scaling and backups. That's exactly my point.

12 Factors only address stateless processes, which is the easy part. But most applications are stateful. It doesn't matter if data are stored in an embedded database (like SQLite, LevelDB, BoltDB) or in a client-server database (like PostgreSQL, MySQL, MongoDB, RabbitMQ, Redis). 12 Factors doesn't address this issue at all.

icebraining · on July 25, 2016

The point of 12 Factors is to isolate the application from the data store, making it easy to manage and upgrade the former. That's important because they recognize applications are stateful. But yes, 12 Factors doesn't purport to guide people on how to manage the stateful part.

tokenizerrr · on July 22, 2016

Right, and third party software is a thing. There are also plenty of reasons to prefer the simplicity of SQLite even if that means you have the task of managing your own backups.

vidarh · on July 22, 2016

There are plenty of reasons to prefer that simplicity if you intend to write software meant to run on a single computer. The moment you want to deploy a service, unless the data is "mostly static" (e.g. something you might update offline and ship copies of), the simplicity that makes things like Sqlite great often ends up creating complexity.

tokenizerrr · on July 22, 2016

Not everything needs to be highly available. Using docker does not mean that you are using it in a highly available environment. Plenty of things I run in docker containers can be down for a few hours every night to run backups.

andrewstuart2 · on July 22, 2016

So I was going to be snarky about the author not having their linux user as a member of the "docker" group (thus requiring sudo every command) but that sparked a line of thought; so I'll ask:

Is it better practice to leave yourself out of the docker group, thus forcing explicit use of sudo, since the daemon runs as root? Is there a better daemon auth model that's not in use so you can at least have longer-lived tokens, etc?

Also, in case you do want to skip the sudo every time (careful with the potential security risk):

    sudo usermod -aG docker $(whoami)

atmosx · on July 22, 2016

Only a developer could ask such a question!!!! I'm joking :-P

From a sysadmin standpoint sudo is the right choice. Sudo is an established, well defined, (mostly) bug-free program, designed to specifically for that task. It's tool for the job. You can create specific policies[1] keep track of who, what, when, allow this and deny that.

I reckon that this case is a bit tricky though and most people just use sudo to get 'root', so if you're going to do just that, then I guess it's the same.

[1] http://linux.die.net/man/5/sudoers

cyphar · on July 22, 2016

> I reckon that this case is a bit tricky though and most people just use sudo to get 'root', so if you're going to do just that, then I guess it's the same.

sudo is still better in that case, because sudo leaves an audit trail in your system log. Docker doesn't keep a detailed audit log for every request made by a user.

alrs · on July 22, 2016

https://github.com/a2o/snoopy

manacit · on July 22, 2016

Handing an unprivileged user access to Docker is, functionally, the same as handing them access to sudo. Once you are allowed to run arbitrary Docker containers, you have limitless access to root on the running host.

For this reason, I would vastly prefer requiring sudo to communicate with a local Docker daemon. Sudo was designed for this purpose, has the proper logging and fin(er) grained access control.

lojack · on July 22, 2016

Everyone always says this, and I know security isn't a #1 priority and that it'd be a big mistake to assume Docker is secure -- but, are there any known security issues with Docker that could give sudo access to the host machine? Anything beyond the standard: "Don't trust it because its almost certainly not secure."

manacit · on July 23, 2016

The issue, as another poster pointed out, is not that there are vulnerabilities in the Docker daemon, but that access to the socket inherently gives you access to run arbitrary containers in privileged mode. This allows you access to the full host: https://docs.docker.com/engine/reference/run/#/runtime-privi... and everything that a normal root user can do.

At present, there is no great way to mitigate this if you're tracking the official Docker releases (at least, as far as I know).

Titanous · on July 22, 2016

Yes, you can run a privileged container that bind-mounts / on the host into the container. root.

emmelaich · on July 23, 2016

I presume that the default selinux policy in redhat7 would stop this.

But that would mean using the docker version in redhat7, which tends to trail behind a little.

cyphar · on July 22, 2016

> Is there a better daemon auth model that's not in use so you can at least have longer-lived tokens, etc?

Docker does have authorization plugins[1], and there is work to get authentication working[2]. So the answer is "eventually being in the docker group will be safer", but not yet.

Personally, I think the future of containers (at least for a lot of the cases I deal with) is going to be with rootless containers[3]. But that's another story.

[1]: https://github.com/docker/docker/pull/15365 [2]: https://github.com/docker/docker/issues/14674 [3]: https://github.com/opencontainers/runc/pull/774