Enabling overcommit machine-wide is a puerile, broken approach that not only con...

joosters · on Jan 23, 2015

And after following this advice, you end up with a system that can fail to fork() even when half of the computer's memory is free. This can also turn your once-working server into an unreliable toy.

(Also, see the comments in the original article that talk about vm.overcommit_memory=2 not actually doing what it claims to do...)

angersock · on Jan 23, 2015

I was under the impression that fork() basically set up copy-on-write pages for the child process, and thus if it exec()'ed it would lose all those pages anyways and start fresh. Are those VM settings changing that behavior?

dezgeg · on Jan 23, 2015

It wouldn't affect the copy-on-write optimization or exec(), it just causes fork() to return an error earlier. 'vm.overcommit_memory = 2' attempts to guarantee that after a fork, all of the copy-on-write-pages can still be dirtied (i.e. causing a copy) without the system running out of RAM.

quotemstr · on Jan 23, 2015

Yes, but they _can_ vfork, and you should be using vfork anyway when all you're going to do after fork is exec.

joosters · on Jan 23, 2015

Good luck making sure that all your running programs use vfork(). And why would they? If you read the man page, it is hardly encouraging:

It is rather unfortunate that Linux revived this specter from the past. The BSD man page states: "This system call will be eliminated when proper system sharing mechanisms are implemented. Users should not depend on the memory sharing semantics of vfork() as it will, in that case, be made synonymous to fork(2)."

So no-one has been encouraged to use vfork()

benmmurphy · on Jan 23, 2015

some apps also use fork and the copy on write behaviour deliberately to implement features. redis uses it to create its backup file (http://redis.io/topics/faq)

>> Redis background saving schema relies on the copy-on-write semantic of fork in modern operating systems: Redis forks (creates a child process) that is an exact copy of the parent. The child process dumps the DB on disk and finally exits.

IgorPartola · on Jan 23, 2015

Isn't vfork() 11 years deprecated?

quotemstr · on Jan 23, 2015

Deprecated my ass. It's stable and supported ABI.

vezzy-fnord · on Jan 23, 2015

It may not be deprecated, but it's a rather unsafe interface: http://ewontfix.com/7/

quotemstr · on Jan 23, 2015

It's possible to use safely, and the benefits are worth it --- no commit charge problems and no time spent copying page tables. (Even in the memory itself is copy-on-write, you still have to set up the child's address space.)

I'm sick of people cargo-culting ideas like "vfork is bad" without really understanding the issues.

rwmj · on Jan 23, 2015

If you run a mission critical server on Linux, then you need to engage your brain and understand the requirements of your workload. That's what system administrators are paid to do.

The question here is what are suitable defaults for non-critical desktop uses of Linux, given that there will always be limits on the amount of RAM we can put into a machine and/or badly behaving processes written by CADTs.

chongli · on Jan 23, 2015

Enabling overcommit machine-wide is a puerile, broken approach that not only converts your server to an unreliable toy

Why should the reliability of your server matter (beyond a certain point)? For years Erlang developers have been following the "let it crash and a supervisor will restart it" model. They seem to have the uptime numbers to back them up.

jimktrains2 · on Jan 23, 2015

Because erlang is also built around that concept. Often when a server fails it needs to be brought up by hand and then wait until it can be brought back into rotation (depending on how it's used &c).

Additionally, some applications, like a database often only run on a single host (sure you have hot spares, but fail over is often manual and recovering is defiantly manual).

So while I get your point, we're not going to throw out everything we have simply because it wasn't built around "let it fail".