Nix is a better Docker image builder than Docker's image builder

kstenerud · 2024-03-16T05:59:53 1710568793

I've tried again and again to like Nix, but at this point I have to throw in the towel.

I have 2 systems running Nix, and I'm afraid to touch them. I've already broken both of them enough that I had to reinstall from scratch in the past (yes yes - it's supposed to be impossible I know), and now I've forgotten most of it. In theory, Nix is idempotent and deterministic, but the problem is "deterministic in what way?" Unless you intimately understand what every dependent part is doing, you're going to get strange results and absolutely bizarre and unhelpful errors (or far more likely: nothing at all, with no feedback). Nix feels more like alchemy than science. Like trying to get random Lisp packages to play nice together.

Documentation is just plain AWFUL (as in: complete and technically accurate, but maddeningly obtuse), and tutorials only get you part of the way. The moment you step off the 80% path, you're in for a world of hurt, because the underlying components are just not built to support anything else. Sure, you can always "build your own", but this requires years of experiential knowledge and layers upon layers of frustration that I just don't want to deal with anymore (which is also why I left Gentoo all those years ago). And woe unto you if you want to use a more modern version than the distribution supports!

The strength of Docker is the chaos itself. You can easily build pretty much anything, without needing much more than a cursory understanding of the shell and your distro's package manager. Or you can mix and match whatever the hell you want! When things break, it's MUCH easier to diagnose and fix the problems because all of the tooling has been around for decades, which makes it mature enough to handle edge cases (and breakage is almost ALWAYS about edge cases).

Nix is more like Emacs: It can do absolutely anything if you have the patience for it and the deep, arcane knowledge to keep it from exploding in a brilliant flash of octarine. You either go full-in and drink the kool aid, or you keep it at arm's length - smiling and nodding as you back slowly towards the door whenever an enthusiast speaks.

janjongboom · 2024-03-16T07:00:38 1710572438

I've gone down the same path. I love deterministic builds, and I think Docker's biggest fault is that to the average developer a Dockerfile _looks_ deterministic - and it even is for a while (build a container twice in a row on the same machine => same output), but then packages get updated in the package manager, base images get updated w/ the same tag, and when you rebuild a month later you get something completely different. Do that times 40 (the number of containers my team manages) and now fixing containers is a significant part of your job.

So in theory Nix would be perfect. But it's not, because it's so different. Get a tool from a vendor => won't work on Nix. Get an error => impossible to quickly find a solution on the web.

Anyway, out of that frustration I've funded https://www.stablebuild.com. Deterministic builds w/ Docker, but with containers built on Ubuntu, Debian or Alpine. Currently consists of an immutable Docker Hub pull-through cache, full daily copies of the Ubuntu/Debian/Alpine package registries, full daily copies of most popular PPAs, daily copies of the PyPI index (we do a lot of ML), and arbitrary immutable file/URL cache.

So far it's been the best of both worlds in my day job: easy to write, easy to debug, wide software compatibility, and we have seen 0 issues due to non-determinism in containers that we moved over to StableBuild in my day job.

txutxu · 2024-03-16T21:43:01 1710625381

I think this issue is not specific to containers.

I've work many years on bare metal. We did (by requirement) acceptance tests, so we did need deterministic builds, before such thing had even a name, or at least before it was mentioned as much as nowadays.

Redhat has a lot of tooling around versioning of mirrors, channels, releases, updates, etc. But I'm so old that even foreman and spacewalk didn't exist, redhat satellite was out of the budget, and the project was migrating from the first versions of CentOS to Debian.

What I did was simply use DNS + Vhosts (dev, stage, prod + versions) for our own package mirrors, and bash+rsync (and of course, raid+backups), with both, CentOS and Debian (and our project packages).

So we had repos like prod/v1.1.0, stage/v1.1.0, dev/v1.1.0, dev/v2.0.0, dev/2.0.1, etc Allowing us to rebuild things without praying, backport bug fixings with confidence, etc

Feels old and simple, however I think it was the same problem/issue that people gets now (re)building containers.

If you need to be able to produce the same output from the same input, you need the same input.

BTW about stablebuild: nice project!

IshKebab · 2024-03-16T08:26:30 1710577590

But also Nix solves more problems than Docker. For example if you need to use different versions of software for different projects. Nix lets you pick and choose the software that is visible in your current environment without having to build a new Docker image for every combination, which leads to a combinatorial explosion of images and is not practical.

But I also agree with all the flaws of Nix people are pointing out here.

ktosobcy · 2024-03-16T10:32:39 1710585159

I don't have any experience with Nix but regarding stable builds of Docker: we provide Java application, have all dependencies as fixed versions so when doing a release, if someone is not doing anything fishy (re-releasing particular version, which is bad-bad-bad) you will get exactly same binaries on top of the same image (again, considering you are not using `:latest` or somesuch)...

janjongboom · 2024-03-16T10:46:29 1710585989

Until someone overwrites or deletes the Docker base image (regularly happens), or when you depend on some packages installed through apt - as you'll get the latest version (impossible to pin those).

theamk · 2024-03-16T16:03:27 1710605007

I am convinced that any sort of free public service is fundamentally incomapatible with long term reproducible builds. It is simply unfair to expect free service to maintain archives forever and never clean them up, rename itself, or go out of business.

If you want reproducibility, the first step is to copy everything to a storage you control. Luckily, this is pretty cheap nowdays

ktosobcy · 2024-03-16T11:44:07 1710589447

> Until someone overwrites or deletes the Docker base image (regularly happens)

Any source of that claim?

> or when you depend on some packages installed through apt - as you'll get the latest version (impossible to pin those).

Well... please re-read my previous comment - we do Java thing so we use any JDK base image and then we slap our distribution on top of it (which are mostly fixed-version jars).

Of course if you are after perfection and require additional packages then you can install it via dpgk or somesuch but... do you really need that? What about security implications?

janjongboom · 2024-03-16T13:36:33 1710596193

> Any source of that claim?

Any tag like ubuntu:20.04 -> this tag gets overwritten every time there's a new release (which is very often)

https://hub.docker.com/r/nvidia/cuda -> these get removed (see e.g. https://stackoverflow.com/questions/73513439/on-what-conditi...)

ktosobcy · 2024-03-16T14:44:20 1710600260

You gave example of nvidia and not ubuntu itself. What's more, you are referring to devel(opment) version, i.e. "1.0-devel-ubuntu20.04" which seems like a nightly so it's expected to be overriden (akin to "-SNAPSHOT" for java/maven)?

Besides, if you really need utmost stability you can use image digest instead of tag and you will always get exactly the same image...

lolinder · 2024-03-16T13:49:53 1710596993

Do you have an example that isn't Nvidia? They're infamous for terrible Linux support, so an egregious disregard for tag etiquette is entirely unsurprising.

codethief · 2024-03-16T14:29:38 1710599378

> Anyway, out of that frustration I've funded https://www.stablebuild.com. Deterministic builds w/ Docker, but with containers built on Ubuntu, Debian or Alpine.

Very nice project!

stefanha · 2024-03-16T10:18:17 1710584297

Another option for reproducible container images is https://github.com/reproducible-containers although you may need to cache package downloads yourself, depending on the distro you choose.

janjongboom · 2024-03-16T10:48:02 1710586082

Yeah, very similar approach. We did this before, see e.g. https://www.stablebuild.com/blog/create-a-historic-ubuntu-pa... - but then figured everyone needs exactly the same packages cached, so why not set up a generic service for that.

stefanha · 2024-03-16T12:27:15 1710592035

For Debian, Ubuntu, and Arch Linux there are official snapshots available so you don't need to cache package downloads yourself. For example, https://snapshot.debian.org/.

janjongboom · 2024-03-16T13:38:09 1710596289

Yes, fantastic work. Downside is that snapshot.debian.org is extremely slow, times out / errors out regularly - very annoying. See also e.g. https://github.com/spesmilo/electrum/issues/8496 for complaints (but it's pretty apparent once you integrate this in your builds).

codethief · 2024-03-16T14:26:35 1710599195

Ubuntu now has snapshot.ubuntu.com, see https://ubuntu.com/blog/ubuntu-snapshots-on-azure-ensuring-p...

Here's a related discussion about reproducible builds by the Docker people, where they provide some more details: https://github.com/docker-library/official-images/issues/160...

TeeWEE · 2024-03-16T08:24:29 1710577469

Just pin the dependencies and your mostly fine right?

janjongboom · 2024-03-16T09:34:43 1710581683

Yeah, but it's impossible to properly pin w/o running your own mirrors. Anything you install via apt is unpinnable, as old versions get removed when a new version is released; pinning multi-arch Docker base images is impossible because you can only pin on a tag which is not immutable (pinning on hashes is architecture dependent); Docker base images might get deleted (e.g. nvidia-cuda base images); pinning Python dependencies, even with a tool like Poetry is impossible, because people delete packages / versions from PyPI (e.g. jaxlib 0.4.1 this week); GitHub repos get deleted; the list goes on. So you need to mirror every dependency.

codethief · 2024-03-16T14:53:41 1710600821

> Anything you install via apt is unpinnable, as old versions get removed when a new version is released

Huh, I have never had this issue with apt (Debian/Ubuntu) but frequently with apk/Alpine: The package's latest version this week gets deleted next week.

pl4nty · 2024-03-17T00:44:29 1710636269

> apt is unpinnable, as old versions get removed

not necessarily, eg snapshot.debian.org

> pinning on hashes is architecture dependent

can't you pin the multi-arch manifest instead?

I still like StableBuild for protection against package deletion, and mirroring non-pinnable deps

keybits · 2024-03-16T08:43:21 1710578601

The pricing page for StableBuild says

Free …

Number of Users 1

Number of Users 15GB

Is that a mistake or if not can you explain please?

https://www.stablebuild.com/pricing

janjongboom · 2024-03-16T09:26:01 1710581161

Ah, yes, on mobile it shows the wrong pricing table... Copying here while I get it fixed:

Free => Access to all functionality, 1 user, 15GB traffic/month, 1GB of storage for files/URLs. $0

Pro => Unlimited users, 500GB traffic included (overage fees apply), 1TB of storage included. $199/mo

Enterprise => Unlimited users, 2,000GB traffic included (overage fees apply), 3TB of storage included, SAML/SSO. $499/mo

ethanwillis · 2024-03-16T15:59:27 1710604767

Are you associated with the project?

janjongboom · 2024-03-16T16:56:11 1710608171

I’m an investor in StableBuild.

korijn · 2024-03-16T15:19:30 1710602370

What is an efficient process to avoid using versions with known vulnerabilities for long times when using a tool like stablebuild?

orbital-decay · 2024-03-16T07:20:24 1710573624

>Documentation is just plain AWFUL (as in: complete and technically accurate, but maddeningly obtuse)

Documentation is often just plain erroneous, especially for the new CLI and flakes, not even edge cases. I remember spending some time trying to understand why nix develop doesn't work like described and how to make it work like it should. I feel like nobody ever actually used it for its intended purpose. Turns out that by default it doesn't just drop you into the build-time environment like the docs claim (hermetically sealed with stdenv scripts available), it's not sealed by default and the commandline options have confusing naming, you need to fish out the knowledge from the sources to make it work. Plenty of little things like this.

>In theory, Nix is idempotent and deterministic

I surely wish they talked more about edge cases that break reproducibility. Things like floating point code being sensitive to the order of operations with state potentially leaking from OS preemption, and all that. Which might be obvious, but not saying obvious things explicitly is how you get people shoot themselves in the foot.

mananaysiempre · 2024-03-16T10:20:37 1710584437

> Things like floating point code being sensitive to the order of operations with state potentially leaking from OS preemption, and all that.

That’s profoundly cursed and also something that doesn’t happen, to my knowledge. Unless the kernel programmer screwed up, an x86-64 FPU is perfectly virtualizable (and I expect an AArch64 FPU too, I just haven’t tried). So it doesn’t matter where preemtion happens.

(What did happen with x87 is that it likes to compute things in more precision than you requested, depending on how it’s configured—normally determined by the OS ABI. Yet variable spills usually happened in the declared precision, so you got different results depending on the particulars of the compiler’s register allocator. But that’s still a far cry from depending on preemption of all things, and anyway don’t use x87.

Floating-point computation does depend on associativity, in that nearestfp(nearestfp(a+b)+c) is not the same as nearestfp(a+nearestfp(b+c)), but the sane default state is that the compiler will reproduce the source code as written, without reassociating things behind your back.)

orbital-decay · 2024-03-16T12:32:16 1710592336

That's doesn't happen in a single thread, but e.g. asynchronous multithreaded code can spit values in arbitrary order, and depending on what you do you can end up with a different result (floating point is just an example). Generally, you can't guarantee 100% reproducibility for uncooperative code because there's too much hardware state that can't be isolated even in a VM. Sure, 99% software doesn't depend on it or do cursed stuff like microarchitecture probing during building, and you won't care until you try to package some automated tests for a game physics engine or something like that. What can happen, inevitably happens.

We don't need to be looking for such contrived examples actually, nixpkgs track the packages that fail to reproduce for much more trivial reasons. There aren't many of them, but they exist:

https://github.com/NixOS/nixpkgs/issues?q=is%3Aopen+is%3Aiss...

Foxboron · 2024-03-16T17:08:01 1710608881

> We don't need to be looking for such contrived examples actually, nixpkgs track the packages that fail to reproduce for much more trivial reasons. There aren't many of them, but they exist

Less than a couple of thousand packages are reproduced. Nobody has even attempted to rebuild the entirety of the nixpkgs repository and I'd make a decent wager on it being close to impossible.

fuzzy2 · 2024-03-16T08:20:08 1710577208

It’s really not that bad. However, with a standard NixOS setup, you still have a tremendous amount of non-reproducible state, both inside user accounts and in the system. I’m running a “Erase your darlings” setup, it mostly gets rid of non-reproducible state outside my user account. It’s a bit of a pain, but then what isn’t on NixOS.

https://grahamc.com/blog/erase-your-darlings/

Inside my user account, I don’t bother. I don’t like Home Manager.

SkyMarshal · 2024-03-16T09:17:32 1710580652

A nice upgrade to that is to put root in a tmpfs RAM filesystem instead of ZFS:

https://elis.nu/blog/2020/05/nixos-tmpfs-as-root/

That way it doesn't even need to bother with resetting to ZFS snapshots, instead it just wipes root on shutdown and reconstructs it in RAM on reboot.

Then, optionally, with some extra work you can put /home in tmpfs too:

https://elis.nu/blog/2020/06/nixos-tmpfs-as-home/

That setup uses Home Manager, so maybe it's not for you, but worth mentioning if we're talking about making all state declarative and reproducible. You have to use the Impermanence module and set up some soft links to permanent home folders on different drive or partition. But for making all state on the system reproducible and declarative, this is the best way afaik.

fuzzy2 · 2024-03-16T14:58:26 1710601106

Thanks, that's interesting. It allows one to stick to "regular Linux filesystems", which is probably a good thing.

SkyMarshal · 2024-03-16T15:38:22 1710603502

True, I think it's more a more elegant setup than the ZFS version. Why actively rollback to a snapshot when ephemeral memory will do that automatically on reboot.

That said I'll just mention that ZFS support on NixOS is like nothing else I've seen in Linux. ZFS is like a first-class citizen on NixOS, painless to configure and usually just works like any other filesystem.

https://old.reddit.com/r/NixOS/comments/ops0n0/big_shoutout_...

weatherlight · 2024-03-16T10:00:18 1710583218

I use both Docker and NixOs at work. I've never had any of the problems you seemed to have above. Docker is fine, performance wise it's not great on Macs. I love nix because it's trivial to get something to install and behave the same across different machines.

Nix Doc are horrible but I've found that ChatGPT4 is awesome at troubleshooting Nix issues.

I feel like 90% of the time I run into Nix issues, it's because I decided to do something "Not the Nix way."

laerus · 2024-03-16T08:41:22 1710578482

Give a try to Fedora Atomic (immutable). At this point I have pretty much played around and used every distro package maneger there is and I have broken all of them in one way or another even without doing something exotic (pacman I am looking at you). My Fedora Kinoite is still going strong even with adding/removing different layers, daily updates, and a rebase from Silverblue. Imho rpm-ostree will obsolete Nix.

plagiarist · 2024-03-16T11:56:58 1710590218

How do you alter layering without a restart? Just have an immutable base and do other rpm-ostrees in containers? Is that what flatpak is up to?

laerus · 2024-03-16T19:28:51 1710617331

You have to restart to boot into a new image. You use containers for stuff you don't need into your base distro, like cli tools, and flatpak for any desktop applications.

aredox · 2024-03-16T09:43:36 1710582216

Maybe it won't be your cup of tea given your reference to Emacs, but there's guix if you want to try a saner alternative to nix.

fransje26 · 2024-03-16T11:39:50 1710589190

> Documentation is just plain AWFUL (as in: complete and technically accurate, but maddeningly obtuse)

That has been the case for as long as I can remember. I gave up on Nix about 5 years ago because of it, and apparently not much has changed on that front since then..

namaria · 2024-03-16T20:17:29 1710620249

I never tried going all in on Nix, but I don't think it's an all or nothing proposition. In my case, I use Ubuntu for my personal notebook and I wanted to prototype something with Elixir. The distro package is versions behind latest so I can't use Phoenix 1.7 with it. The solution was simple: there's a Nix package for the latest version, so I simply used nix-shell. Bonus points for having VSCode so I didn't have to install it on my personal machine. So for the price of running <nix-shell -p vscode erlang elixir> I got all I needed with very minimal fuss.

jokethrowaway · 2024-03-16T20:23:39 1710620619

Ubuntu is so out of date it's barely useable.

I've been a nixos user for years and I generally had the opposite problem: the latest of the package you want is not available but hey here's a version from months ago - or just build it yourself (which is not hard, oftentimes updates work fine with no build change, you just point at a different version).

Also rebuilding everything at every update take forever (I had a few nix-shells with ai dependencies that would take hours to upgrade).

I love the concept of nix but I'm back to Arch, binary bleeding edge packages and AUR for less supported stuff.

namaria · 2024-03-17T14:01:16 1710684076

Yes I think I will move on from Ubuntu on the next opportunity. It was a great starter Linux tho.

mtmk · 2024-03-16T13:06:27 1710594387

I recently faced a similar hurdle with Nix, particularly when trying to run a .NET 8 AOT application. What initially seemed like it would be a simple setup spiraled into a plethora of issues, ultimately forcing me to back down. I found myself having to abandon the AOT method in favor of a more straightforward solution. To give credit where it's due, .NET AOT is relatively new and, as far as I know, still has several kinks that need ironing out. Nonetheless, I agree that, at least based on my experience, you need a solid understanding of the ins and outs before you can be reasonably productive using Nix.

Smaug123 · 2024-03-16T13:44:36 1710596676

.NET AOT really is not designed for deployment, in my experience - for example, the compilation is very hard to do in Nix-land, because a critical part of the compilation is to download a compiler from NuGet at build-time. It's archetypical of the thousand ways that .NET drives me nuts in general.

mtmk · 2024-03-20T00:10:02 1710893402

It's intended for 'cloud-native' deployments, as I understand it, so I concur that it's quite disappointing. The concept of downloading compilers via NuGet doesn't sit well with me either. However, I've observed performance enhancements in applications compiled AOT, and I remain optimistic that future versions of .NET will bring further improvements.

paulddraper · 2024-03-16T07:53:09 1710575589

> The strength of Docker is the chaos itself.

That depends whether you are okay with chaos.

It appears that you are, so it is suitable tool for you. Choose the right tool for the right job.

---

Docker is a poor choice for people who are interested in deterministic/reproducible builds.

devjab · 2024-03-16T08:39:34 1710578374

I’m not sure exactly why this is being downvoted. It seems pretty fair to want your container builds to not fail because of the “chaos” with docker images and how they change quite a lot. This isn’t about the freedom to build how you want, it’s about securing your build pipelines so that they don’t break at 4am because docker only builds 99% of the time.

I’ll use docker, I like docker, but I can see the point of how it’s not necessarily advantageous if stability is your main goal.

ownagefool · 2024-03-17T16:35:32 1710693332

It's more complicated than that. Reproducible builds help build confidence that your build process isn't compromised.

Sure, your compiler, your hardware, or your distro might be compromised, but if you follow the chain all the way through you does indeed validate version X does result in SHA y, there's now less things were blindly trusting.

It also helps with things like rolling back to earlier versions when you don't still have the binary kicking around without having to revalidate the binary.

If you're not getting the same SHA on different hardware, weeks apart, even if it's good enough for you, it's not reproducible

underdeserver · 2024-03-16T16:26:03 1710606363

I'm just here to give you points for the Discworld reference.

7speter · 2024-03-16T13:30:21 1710595821

You complain about the documentation, and the first thing I wonder is if you’ve tried using one of the prominent chatbots like chatgpt or claude to help fill in the gaps of said documentation? Maybe an obvious thing to do around here, but I’ve found they help fill in documentation gaps really well. At the same time Nix is so niche there might not have been enough information out there to feed into even chatgpt’s model…

tripdout · 2024-03-16T19:09:59 1710616199

>I've already broken both of them enough that I had to reinstall from scratch in the past (yes yes - it's supposed to be impossible I know)

Could you mention a bit about how they broke? I'm curious to see how that state looks, as from my perspective switching to a previous configuration seems to cover everything.

amelius · 2024-03-16T16:18:31 1710605911

Yes at this point I hope someone builds a friendlier version on top of Nix, so we can cleanly migrate completely away from it.

intelVISA · 2024-03-16T15:57:54 1710604674

It has a bit of a learning curve that is worth it - it's an incredible tool.

RGamma · 2024-03-16T17:25:49 1710609949

Just out of curiosity. What were you trying to do that didn't work?

benreesman · 2024-03-16T09:31:49 1710581509

Nix and NixOS are in something like the state git was in before GitHub: the fundamental idea is based on more serious computer science than the status quo (SVN, Docker), the plumbing still has some issues but isn’t worse, and the porcelain and docs are just not there for mainstream adoption.

I think that might have changed with the release of flox: https://flox.dev, it’s basically seamless (and that’s not surprising coming from DE Shaw).

Nix doesn’t really make sense without flakes and nix-command, those things are documented as experimental and defaulted off. The documentation story is getting better, but it’s not there. nixlang is pretty cool once you learn it well, but it’s never going to be an acceptable barrier to entry at the mainstream. It’s not really the package manager it’s advertised as, nix-env -iA foo is basically never what you want. It’s completely unsurprising that it’s still a secret weapon of companies with an appetite for bleeding-edge shit that requires in-house expertise.

flox addresses all of this for the “try it out and immediately have a better time” barrier.

Nix/NixOS or something like it is going to send Docker to the same dustbin Subversion is in now, but that’s not going to happen until it has the GitHub moment, and then it’ll happen all at once.

Most of the complaints with Nix in this thread are technically false, but eminently understandable and more importantly (repeat after me Nix folks): it’s never the users fault.

I’m aware that I’m part of a shrinking cohort who ever knew a world without git/GitHub, so I know this probably sounds crazy to a large part of the readership, but listen to Linus explaining to a room full of people who passed the early Google hiring bar why they should care about a tool they feel is too complicated for them:

https://youtu.be/MjIPv8a0hU8?si=QC0UnHXRdMpp2tI4

ronef · 2024-03-16T18:22:35 1710613355

Ron from flox.dev here, the note brought a lot of smiles across the team. We've been working on this for a while now and would love to hear if there is anything we can prioritize or do to keep making it better.

benreesman · 2024-03-17T00:24:57 1710635097

I’m glad to hear it! I’ve been grappling with how to package something I’m calling “HYPER // MODERN” (which I can talk about if you’re curious) and we’re pretty locked-in on flox at this point, it had been a combination of flakes and Homebrew and flox is just a better time.

If you drop me an email at b7r6@b7r6.net (I also just joined your slack) I’d love to give my feedback on this or that nitpick.

But overall, well done friends, very very nice stuff.

rfoo · 2024-03-16T10:09:22 1710583762

I believe for a developer tool to success, for the most common thing to do there has to be at least three ways an engineer may misuse your tool and still get it "done" (by leaving non-obvious tech debts behind).

This is true for git, but not so true yet for Nix, so I'm not sure a GitHub-like moment helps.

benreesman · 2024-03-16T11:12:49 1710587569

In a full-metal-jacket NixOS setting it’s bloody hard to bash (no pun intended) your way through to the next screen by leaving behind tech debt (Python comes to mind, I made the mistake of trying to use Nix to manage Python once, never again).

But anywhere else you just brew/apt/rpm install whatever and nix develop —impure, which is easier than most intermediate git stuff and plenty of beginner stuff. git and Nix are almost the same data structure if you start poking around in .git or /nix/store, I might not understand what you mean without examples.

But all my guesses about what you might mean are addressed well by flox.

whazor · 2024-03-15T22:46:16 1710542776

This blog post is missing the reasoning on why shared docker layers are useful. It is because of caching. The more images are sharing the same layers the better, as it allows you to cache more stuff. Better caching means faster startup of containers.

Why is docker bad at this? In order to enjoy the caching benefit, each time you build a docker image you want it to output as much existing layers as possible. So running apt-get install python3 today should result in the exact same layer as yesterday, if there are no new updates. But this requires the all the files to be exactly the same, including the metadata like creation time. As docker layers are cached by hashing the files.

Now, Nix already does storing dependencies by hash. So the layers will always be the same with the same version and same configuration.

hamandcheese · 2024-03-16T02:18:13 1710555493

I would rephrase this as:

The Dockerfile format imposes a hierarchical relationship between layers. This quickly becomes very annoying, since dependencies usually form dependency graphs, not dependency trees.

Alternative tools, like nix (probably bazel too), are not bound in the same way. They can achieve fine grained caching by mapping their dependency graph to docker layers, which is something that can not be expressed with a Dockerfile.

runeks · 2024-03-17T15:54:49 1710690889

> The Dockerfile format imposes a hierarchical relationship between layers. This quickly becomes very annoying, since dependencies usually form dependency graphs, not dependency trees.

Isn't a Dockerfile just a sequence of dependencies, rather than a tree?

hamandcheese · 2024-03-17T19:17:28 1710703048

You're right, though it becomes hierarchical once you have multiple Dockerfiles inheriting from some base image (which I did not articulate in my original comment).

cpuguy83 · 2024-03-16T05:21:49 1710566509

Steps in a stage are hierarchical.

The final result need not be. You can build a bunch of things then merge the results in a final stage without any hierarchy (this is "COPY --link" in a Dockerfile).

georgyo · 2024-03-16T09:05:36 1710579936

That requires some very explicit and non-obvious effort to do. It's quite painful to do this properly in Docker.

cpuguy83 · 2024-03-16T16:05:16 1710605116

And the consensus seems to be nix is not straight forward?

georgyo · 2024-03-16T16:29:54 1710606594

And the snake eats it tail.

With nix, re-usability is very high. It's a function that is very baked in at very low levels of it's design. This comes with up front complexity but getting to these reusable layers is basically forced.

Docker is very simple and often touts reusable layers, but in practice is not. Unless you tackle that complexity.

Making reproducible and reusable content takes effort. Other tools are not designed for that. As a result the getting to the same state requires a similar amount of complexity. Worse, with docker you can never be sure that you actually succeeded in your goal of reproducibility.

An analogy could be rust. Rust has up front complexity, but tackling that complexity gives confidence that memory safety and concurrency primitives are done correctly. It's not that C _can't_ achieve the same runtime safety, it's just requires a lot more skill to do correctly; and even then memory exploits are reported on a near daily basses for very popular and widely used libraries.

Complex problems are complex. And sooner or later you'll need to face that complexity.

cpuguy83 · 2024-03-16T17:04:22 1710608662

This is not how docker works. Docker, exactly like nix, is based on a graph of content addressable dependencies.

What you are describing is chaining a bunch of commands together. Yes, this forms a dependency chain stored in separate layers and is part of the cache chain.

Nix suffers the exact same problems with reproducibility. The thing it provides is the toolchain of dependencies that are reproducible. Docker does not provide your dependencies.

If the inputs change then so does the output. If the output itself is not reproducible (like, say an artifact with a build-time embedded in it) then you have something that is inherently not reproducible and two people trying to build the same exact nix package will have different results.

EDIT: Fixed a sentence I apparently got distracted while writing and didn't complete (about layer caching).

Foxboron · 2024-03-16T17:20:47 1710609647

Nix is not content addressable though, the hashes is based off on the derivation files which are equal to the lock files you would find in other package managers.

> The thing it provides is the toolchain of dependencies that are reproducible. [...] If the inputs change then so does the output. If the output itself is not reproducible (like, say an artifact with a build-time embedded in it) then you have something that is inherently not reproducible and two people trying to build the same exact nix package will have different results.

There are no guarantees they are reproducible. The only guarantees Nix gives you is that the build environment is the same which allows you to make some claims about the system behaving the same way. But they are certainly no guarantees about artifacts being bit-by-bit identical.

cpuguy83 · 2024-03-16T17:27:20 1710610040

It's content addressable, just what are you addressing?

The content address of a docker image is a json blob (referencing other objects).

The content address of a Dockerfile "RUN" command is the content address of what came before it and the command being run.

Foxboron · 2024-03-16T17:29:58 1710610198

>It's content addressable, just what are you addressing?

In the case of Nix it's addressed by the input. Not the content of the build. It's an important distinction and one Nix also makes.

https://nixos.org/manual/nix/stable/command-ref/new-cli/nix3...

But doing this is going to give you a slight headace as most of the package repository in Nix is not checked for reproducible builds and there are no way to guarantee the hashes are actually static.

cpuguy83 · 2024-03-16T17:52:19 1710611539

Right, all builds are dependent on their inputs. Your inputs determine your outputs. If your input(s) change, then so does your output.

We are saying the same thing here, I'm just trying to point out this is exactly how docker build works, but rather it is more about what you are willing to put into your docker build.

Foxboron · 2024-03-16T18:24:32 1710613472

I think we are talking past each other. I'm just trying to clear up a misconception on how nix works, not anything about the docker portion of what you have written.

georgyo · 2024-03-16T18:48:58 1710614938

It would seem you don't understand how either work. They are basically opposites in how they actually work.

Docker layers are completely independent from each other. A docker layer is a sha256 sum of that layer. Separately there is an image manifest, which is also fetched with a sha256 of that manifest, states the order of the layers and at runtime those layers are stacked up on each other.

With docker, there is no explicit dependency chain. A layer is just a tarball or some JSON. Some tooling can take advantage of this fact. How nix builds docker images takes advantage of this.

Nix on the other hand, the output hash is not tied to the output hash, because the output hash is irrelevant to how it was produced. You also cannot know in advance the output hash of something. IE. If I do say "echo foo > bar.txt", I cannot know the sha256 sum of bar.txt until the code runs. But before the code runs I can know the hash all the inputs that will create bar.txt.

This fundamental difference means two builds, executing the same code can share the outputs. Provided that the build environment is trust worthy.

cpuguy83 · 2024-03-16T20:05:09 1710619509

You are describing the makeup of an OCI image, which is the _output_ of a typical docker build (and also the output of the nix image builder).

While docker build can/does output OCI images, that is only an output. How that output comes to be is not the output itself, same as the nix side the article is talking about.

georgyo · 2024-03-17T01:09:04 1710637744

> How that output comes to be is not the output itself, same as the nix side the article is talking about.

I see the confusion now. The nix image builders OCI layer's contents are _only_ nix store paths which _do_ include the input.

Nix store paths are are guaranteed to never overlap each other, and such the order of layering them in the docker manifest does not matter. But the docker layers are just tar balls of nix store paths. Each layer in the image has no dependence on previous or future layers at all. It is just one or more nix store paths.

cpuguy83 · 2024-03-19T16:31:51 1710865911

I'm not talking about OCI images (again, that's the _output_). I'm talking about how they are built. OCI images are OCI images, they get extracted the same way no matter if there's conflicting paths or not.

What I'm saying here through multiple different threads is, buildkit and nix build things the same way. `docker build` is not just a Dockerfile builder, its actually a grpc service (with services running on both the docker CLI and in the daemon). This service is actually very generic. It includes builtin support for Dockerfiles, which just converts the Dockerfile format into what buildkit calls "LLB", which is analogous to LLVM IR.

What I'm also saying is, people are comparing "docker build" with a Dockerfile that's using a package manager that's not even provided by docker to nix. This is not an apples to apples comparison, and in fact you can implement nix packaging using buildkit (https://github.com/reproducible-containers/buildkit-nix).

I'm also saying that `Dockerfile` does actually support merging dependencies without being order dependent (this is `COPY --link`). But also, you can drive buildkit operations without going through Dockerfile. You can also plug in your own format with the `syntax=<some/image>` at the top of your file. This isn't "convert to dockerfile", its "convert to LLB", which is all the Dockerfile frontend does.

Finally, I'm saying nix isn't in and of itself some magic tool to have a reproducible build. You still have to account for all the same things. What it does do, at a package management level, is make it easier to not have dependencies that change automatically over time (which has its own plusses and minuses).

m1keil · 2024-03-15T23:51:38 1710546698

In Docker, if the layers are cached, the layer with apt-get won't be automatically invalidated (unless --no-cache or any changes to the upper layers).

whazor · 2024-03-16T05:49:12 1710568152

I am thinking more about pipelines that run daily. Also, this ‘docker cache’ effectively means not running the step. So you might miss important security updates. Via Nix you can ensure that your dependencies are updated. And no updates means same hash.

When said caching, I meant on the nodes that run the containers. With Nix you can also update only one layer, while keeping the other layers the same.

raffraffraff · 2024-03-16T00:25:52 1710548752

But that's what I would expect to happen. I don't see a problem.

eichin · 2024-03-16T00:45:46 1710549946

Won't get invalidated even if what "apt-get install python3" does changes - the cache is only based on the syntax of the RUN string plus the previous layer hash, IIRC. (COPY actually invalidates if the file being copied changes, so maybe there's a way to fetch a hash of the repo and stash it where copy will notice, or something, but then it seems you need external tooling to do that bit?)

m1keil · 2024-03-16T01:07:27 1710551247

I didn't claim there is a problem. The original comment made it sound as if docker will expire a cached layer because the (potential) result of apt-get is different, which isn't the case.

xlii · 2024-03-15T22:41:33 1710542493

I spent last 2-3 days trying to get Docker images built on Darwin and I feel that this article is a universe making fun of me.

Nix is absolutely the best tool for what I want to achieve but it has those dark forsaken corners that just suck your soul out dry.

I love it but sometimes it feels like being a Morty on Rick’s adventure to the compilerland.

takeda · 2024-03-16T04:02:43 1710561763

The big problem is how docker was designed. It is essentially a jail that is supposed to contain a Linux binary.

Things are straight forward on Linux. You build your binary, place in a docker container and you are done. The nix code will also be straight forward. If you can build your code, then creating a container is just one more operation away.

Unfortunately docker requires Linux binary and you are on Mac. So the docker desktop actually runs a Linux VM and performs all operations on it, abstracting this away from you.

Nix doesn't do that and you have two options:

1. Do cross compilation, the problem is that for this to work you need to be able to cross compile down to glibc, the problem is that while this will work for most community used dependencies you might get some package where the author didn't put effort making sure it cross compile. To make things worse the Hydra that populates standard caches that nix uses, doesn't do cross compile builds, so you will run into lengthy processes that might potentially end with a failure.

2. You can have a Linux builder, that you add to your Mac and configure to send build jobs for x86_64-linux to that builder. Now you could have a physical box, create a VM, or even have a NixOS docker container (after all docker ion Mac runs inside of the VM).

The #1 seems like the proper way, while #2 is more of a practical way.

I think you are running into issues, because you're likely trying #1, and that requires a lot of experience not only with Nix, but also with cross compiling. I wish Nix's Hydra would also build Darwin to Linux cross compilation as that would not only provide caches, but also help making sure the cross compilation doesn't break, but that would also increase costs for them.

I think you should try the #2 solution.

Edit: looks like there might have been an official solution to this problem: https://ryantm.github.io/nixpkgs/builders/special/darwin-bui... I haven't used it yet.

indiv0 · 2024-03-16T09:38:45 1710581925

Hydra not populating with cross compile builds is the bane of my existence.

I'm using `clang` from `pkgs.pkgsCross.musl64.llvmPackages_latest.stdenv` to cross-compile Rust binaries from ARM macos to `x86_64-unknown-linux-musl`. It _works_, but every time I update my `flake.nix` it rebuilds *the entire LLVM toolchain*. On an M2 air, that takes something like 4 hours. It's incredibly frustrating and makes me wary of updating my dependencies or my flake file.

The alternative is to switch to dockerized builds but:

1) That adds a fairly heavyweight requirement to the build process

2) All the headache of writing dockerfiles with careful cache layering

3) Most importantly, feels like admitting defeat.

cloudripper · 2024-03-16T14:12:42 1710598362

Not sure if this applies to your situation, but I believe you can avoid a full rebuild by modularizing the flake.nix derivations into stages (calling a separate *.nix for each stage in my case). That is how it appears to be working for me on a project (I am building a cc toolchain without pkgscross).

I pass the output of each stage of a toolchain as a dependency to the next stage. By chaining the stages, changes made to a single stage only require a rebuild of each succeeding stage. The final stage is the default of the flake, so you can easy get the complete package.

In addition, I can debug along the toolchain by entering a single stage env with nix develop <stage>

Not sure if this is the most optimal way, but it appears to work in modularizing the rebuild.(using 23.11)

renewiltord · 2024-03-16T00:44:27 1710549867

I use Orbstack and it works flawlessly to do this. Really good tool. I use Docker to cross-compile for {aarch64,amd64} x {linux,darwin} since not all the cross-compiling is super robust across our stacks (I'm using a specific glibc for one Linux part, etc.). Just a bunch of docker on my Darwin aarch64 and it compiles everything. Good experience.

e40 · 2024-03-16T00:52:13 1710550333

I installed Orbstack and found that I didn't really need it, so I removed the directory in /Applications. Wow, for weeks and weeks I found remnants of it in a lot of places. Very disappointing that it left so much cruft around. They should have an uninstaller. It left a really bad taste and I'm unlikely to try it again.

Before someone asks. I've been using macOS for a long time. I've never seen remnants like this from a program. Sure, there are often directories left in ~/Library/Application Support/, but this was more than that. Unfortunately, I didn't write down the details, but I ran across the bits in at least 3-4 places.

kdrag0n · 2024-03-16T08:31:26 1710577886

Dev here — I've been meaning to update the Homebrew cask to be more complete on zap, but there's a good reason that all of these are needed:

- ~/.orbstack

- Docker context that points to OrbStack (for CLI)

- "source ~/.orbstack/shell/init.zsh" in .zprofile/bash_profile (to add CLI tools to PATH)

- ~/.ssh/config (for convenient SSH to OrbStack's Linux machines)

- Symlinks to CLI tools in ~/.local/bin, ~/bin, or /usr/local/bin depending on what's available (to add CLI tools to existing shells on first install — only one of these is used, not all)

- Standard macOS paths (~/Library/{Application Support, Preferences, Caches, HTTPStorages, Saved Application State, WebKit})

- Keychain items (for secure storage)

- ~/OrbStack (empty dir for mounting shared files)

- /Library/PrivilegedHelperTools (to create symlinks for compatibility)

Not sure what the best solution is for people who don't use Homebrew to uninstall it. I've never liked separate uninstaller apps, and it's not possible to detect removal from /Applications when the app isn't running.

xlii · 2024-03-16T09:29:07 1710581347

IMO documenting this (and uninstall section in GUI with link) would be enough for me. Used that and never felt neglected by devs.

And cough since we’re at with - did you consider Nixpkgs distribution?

I’m slowly moving deeper and deeper into ecosystem and use Home Manager for utilities that I use often (and use nix shell/nix run for one offs). Some packages are strictly GUI and while they aren’t handled flawlessly (self-updaters) it’s nice to have them on a single list.

Yet based on your list it’s a definitely a nixventure…

mdaniel · 2024-03-16T18:02:32 1710612152

> I've never liked separate uninstaller apps

And yet, you are the only(?) one with that knowledge, so the alternative seems to be replying to HN threads with a curated list of things that a user must now open iTerm2 and handle by themselves. Something, unless I'm mistaken, that computers are really good at doing (Gatekeeper and privilege elevation nonsense aside)

Even just linking to the zap portion of your brew cask could go a long way since it would be the most succinct manifest if I correctly understand what it does

e40 · 2024-03-17T21:14:16 1710710056

Thanks. I was able to clean up more items.

I agree this should be documented, but I still appreciate uninstallers.

Also, I'm a little confused about your statement:

> Not sure what the best solution is for people who don't use Homebrew to uninstall it.

You said at the start you've "been meaning to update the Homebrew cask to be more complete on zap" ... does that mean Homebrew uninstall will not do a complete job?

cqqxo4zV46cp · 2024-03-16T01:03:50 1710551030

I’ve found this to be the norm for ‘Docker Desktop alternatives’. Not to say that Orbstack isn’t uniquely messy.

xlii · 2024-03-16T01:13:42 1710551622

I’m also on Orbstack mostly for performance.

But unfortunately cross compiling quickly broke when I started doing mild customization (and one reasons I’m doing this is a complex setup that’s very sensitive to version changes).

In the end solution was to “simply” get darwin.linux-builder up but that pulled a lot of weight behind it.

It works, but it’s not the first time I spent my time on nix-ventures.

bsder · 2024-03-16T01:19:53 1710551993

In Docker, the dark corners have dust. In Nix, the dark corners have a grue.

xedrac · 2024-03-16T01:36:43 1710553003

100% this. Nix may seem better, until something goes wrong and you have to waste your weekend digging into it's depths.

MadnessASAP · 2024-03-16T04:11:56 1710562316

On the flip side, once you have fixed the problem it has a very strong tendency to stay fixed. More importantly, the fix does not typically require me to remember that fix months later.

If something does break, rollbacks are free and an integral part of Nix.

endgame · 2024-03-16T01:26:35 1710552395

https://github.com/gytis-ivaskevicius/high-quality-nix-conte...

This sort of twenty-minute adventure?

deathanatos · 2024-03-16T03:38:01 1710560281

macOS is a definitely rougher. I use colima there, and it does alright. There are one or two bugs with it, but I think those are primarily around volumes. But it does alright with building Docker images.

The rougher part is the speed of it; it's a one-two punch between the hardware & the fact that Docker has to emulate a Linux VM.

miduil · 2024-03-16T15:35:44 1710603344

The way I've set this up for our macos devs at work was a script that runs nix builds inside docker-for-desktop using the official nixos upstream docker image (and some tricks to get ssh forwarding, filesystem mounts, ...) working. Works quite alright. Benefit is you don't need some weird Linux remote builder vm with ssh running.

amouat · 2024-03-20T12:15:35 1710936935

I'm using this quote:

"I love it but sometimes it feels like being a Morty on Rick’s adventure to the compilerland."

jossephus01 · 2024-03-16T03:35:20 1710560120

My experience with building Docker images for Java applications using Nix wasn't very pleasant though. After the deprecation of gradle2nix, there doesn't seem to be a clear alternative method for building Docker images for Gradle-based Java applications. I challenged a friend to create the smallest possible Docker image for a simple Spring Boot application some time ago. While I was using Nix, the resulting image was twice the size of the image built without Nix. You can check out the code for yourself here: https://github.com/jossephus/Docker_challenge/blob/main/flak... .

tadfisher · 2024-03-16T05:13:32 1710566012

That's because you're including two JDKs, zulu and the one that gradle includes via its jdk argument. Look for gradleGen in nixpkgs to see what I mean.

And sorry for gradle2nix, I'm working on an improvement that's less of a hack.

jossephus01 · 2024-03-16T05:31:55 1710567115

Thanks tadfisher, I will check it out. This is by no means meant to be a dunk on gradle2nix. Love your work on android-nixpkgs and I will be looking for the alternative. Thanks.

rapnie · 2024-03-16T06:55:38 1710572138

> And sorry for gradle2nix, I'm working on an improvement that's less of a hack.

Don't be. Thanks for your work. Excited to learn about the improvement. Can you tell more about what you have in mind?

tripdout · 2024-03-16T19:24:26 1710617066

Hey, also wanted to thank you for android-nixpkgs - it's great

takeda · 2024-03-16T05:14:19 1710566059

I haven't used java in over a decade so won't be able to help much with that, but for example I was able to get my application to fit in just 70MB container including python and all dependencies + busybox and tini

It looked something like this: https://gist.github.com/takeda/17b6b645ad4758d5aaf472b84447b...

So what I did was:

- link everything with musl

- compile python and disable all packages that I didn't use in my application

- trim boto3/botocore, to remove all stuff I did not use, that sucker on it's own is over 100MB

The thing is what you need to understand is that the packages are primarily targeting the NixOS operating system, where in normal situation you have plenty of disk space, and you rather want all features to be available (because why not?). So you end up with bunch of dependencies, that you don't need. Alpine image for example was designed to be for docker, so the goal with all packages is to disable extra bells and whistles.

This is why your result is bigger.

To build a small image you will need to use override and disable all that unnecessary shit. Look at zulu for example:

https://github.com/NixOS/nixpkgs/blob/master/pkgs/developmen...

you add alsa, fontconfig (probably comes with entire X11), freetype, xorg (oh, nvm fontconfig, it's added explicitly), cups, gtk, cairo and ffmpeg)

Notice how your friend carefully extracts and places only needed files in the container, while you just bundle the entire zulu package with all of its dependencies in your project.

Edit: tadfisher seems to be more familiar with it than me, so I would start with that advice and modify code so it only includes a single jdk. Then things that I mentioned could cut the size of jdk further.

Edit2: noticed another comment from tadfisher about openjdk_headless, so things might be even simpler than I thought.

chrisandchris · 2024-03-16T10:32:23 1710585143

I've never used Nix, but this looks like hell'a of an unreadable config file (compared to docker)? How do you manage these files?

takeda · 2024-03-16T22:09:35 1710626975

This does far more than Dockerfile though.

- it contains information how to actually build the application

- how to set up a dev environment

- how to build application with musl

- how to build application with glibc

- how to build python with only with expat, libffi, openssl, zlib packages

- how to take botocore and patch it up to only have cloudformation, dynamodb, ec2, elbv2, ssm, sso, sts clients

Try to get all of that into a single Dockerfile and see how complicated mess you end up with.

The actual docker configuration is here:

https://gist.github.com/takeda/17b6b645ad4758d5aaf472b84447b...

It might be still confusing to you at first, as you're used to list of incremental steps how to get to the final result, while this description instead is declarative (you're describing not the steps to do, but what the final image should be).

It's basically comparing bash script with bunch of "aws" CLI invocations to a terraform or cloudformation file.

whateveracct · 2024-03-16T13:26:19 1710595579

It's not actually unreadable - you just have to learn convention on top of the Nix language. For instance, what mkDerivation does. Actually, the Nix language usage here is somewhat minimal. Mostly let bindings (aka lambda calculus).

I wouldn't expect a layman to be able to grok that file. That's fine though - it's not for laymen.

guitarbill · 2024-03-16T14:21:52 1710598912

> It's not actually unreadable - you just have to learn convention on top of the Nix language. For instance, what mkDerivation does. Actually, the Nix language usage here is somewhat minimal. Mostly let bindings (aka lambda calculus).

> I wouldn't expect a layman to be able to grok that file. That's fine though - it's not for laymen.

This is the kind of comment that makes me want to stay far, far away from Nix and the Nix "community".

whateveracct · 2024-03-16T16:56:57 1710608217

Why? Saying that Nix is complicated and isn't trivial to use or read without learning prerequisite knowledge is bad now?

I actually pointed out that mkDerivation is something helpful to learn - that's one thing I wish someone made me sit and learn when I first got exposed to Nix. It unlocks a lot.

chrisandchris · 2024-03-17T06:51:50 1710658310

I wouldn't state it's _bad_. It just adds another layer of complexity (by, for sure, also giving something back) and as someone not working in Fortune 500 (but rather in a SME with <20 people), another layer of conplexity & another language is sonetimes just not feasable.

takeda · 2024-03-16T22:24:27 1710627867

I think whateveracct was referring to is this link:

https://github.com/NixOS/nixpkgs/blob/master/pkgs/developmen...

What that file is doing, is building a package, and it essentially is a combination of what Makefile and what RPM spec file does.

I don't know if you're familiar with those tools, but if you aren't it takes some time to know them enough to understand what is happening. So why would be different here?

elbear · 2024-03-16T10:49:39 1710586179

What do you mean by manage?

I agree with your assertion regarding the language though. I think nix-lang makes it harder to get into Nix.

jossephus01 · 2024-03-16T05:23:07 1710566587

You are correct. I havent done any trimming. Thanks for the suggestions and the gist. Thanks

takeda · 2024-03-16T05:35:39 1710567339

I found this discussion and contains code fragments and links that might help.

https://discourse.nixos.org/t/how-to-create-a-docker-image-w...

markelliot · 2024-03-16T08:05:05 1710576305

Hard to beat jib (https://github.com/GoogleContainerTools/jib/tree/master/jib-...) for minimal Java OCI containers.

tadfisher · 2024-03-16T05:25:02 1710566702

Oh, and openjdk_headless skips the GTK and X dependencies that you won't need for Spring.

okr · 2024-03-16T05:56:16 1710568576

That's interesting. We have some applications, that produce PDFs, which use fonts, which usually requires a non-headless (headfull?) jdk. At AWS i wonder, what the default alpine jdk contains. And how much space could be saved, if people were more aware, that they can use a headless one.

hawk_ · 2024-03-16T08:36:30 1710578190

> headfull?

Wonder if there is a good term for this. I have been jokingly referring to this as 'headed' and headless as 'beheaded'.

okr · 2024-03-16T11:45:28 1710589528

Axed! :)

max-privatevoid · 2024-03-16T13:10:31 1710594631

I decided to participate in your challenge and cleaned up your Nix code a little bit. It seems like the main task of the challenge is building a really small JRE.

I've switched to using a headless OpenJDK build from Nixpkgs as a baseline instead of Zulu, to remove all the unnecessary dependencies on GUI libraries. Then I've used pkgs.jre_minimal to produce a custom minimal JRE with jlink.

The image size now comes out to 161MB, which is slightly larger than the demo_jlink image. This is because it actually includes all the modules required to run the application, resulting in a ~90MB JRE. The jdeps invocation in Dockerfile_jlink fails to detect all the modules, so that JRE is only built with java.base. Building my minimal JRE with only java.base brings the JRE size down to about 50MB, the resulting (broken) container image is 117MB according to Podman.

I've also removed the erroneous copyToRoot from your call to dockerTools.buildImage, which resulted in copying the app into the image a second time while the use of string context in config.Cmd would have already sufficed.

I've also switched to dockerTools.buildLayeredImage, which puts each individual store path into its own image layer, which is great for space scalability due to dependency sharing between multiple container images, but won't have an impact for this single-image experiment.

This is mostly a JRE size optimization challenge. The full list of dependencies and their respective size is as follows:

  /nix/store/v27dxnsw0cb7f4l1i3s44knc7y9sw688-zlib-1.3                            125.6K
  /nix/store/j6n6ky7pidajcc3aaisd5qpni1w1rmya-xgcc-12.3.0-libgcc                  139.1K
  /nix/store/l0ydz31lwa97zickpsxj2vmprcigh1m4-gcc-12.3.0-libgcc                   139.1K
  /nix/store/a3n1vq6fxkpk5jv4wmqa1kpd3jzqhml9-libidn2-2.3.4                       350.4K
  /nix/store/s5ka5vdlp4izan3nfny194yzqw3y4d1z-lcms2-2.15                          445.3K
  /nix/store/a5l3w6hiprvsz7c46jv938iij41v57k6-libjpeg-turbo-2.1.5.1                 1.6M
  /nix/store/r9h133c9m8f6jnlsqzwf89zg9w0w78s8-bash-5.2-p15                          1.6M
  /nix/store/3dfyf6lyg6rvlslvik5116pnjbv57sn0-libunistring-1.1                      1.8M
  /nix/store/a3zlvnswi1p8cg7i9w4lpnvaankc7dxx-gcc-12.3.0-lib                        7.5M
  /nix/store/657b81mfpbdz09m4sk4r9i1c86pm0i8f-app-1.0.0                            19.0M
  /nix/store/1zy01hjzwvvia6h9dq5xar88v77fgh9x-glibc-2.38-44                        28.8M
  /nix/store/b1fhkmscb0vff63xl8ypp4nsc7sd96np-openjdk-headless-minimal-jre-21+35   91.4M

There's not much else that can be done here. glibc is the next largest dependency at ~30MB. This large size seems to be because Nixpkgs configures glibc to be built with support for many locales and character encodings. I don't know if it would be possible or practical to split these files out into separate derivations or outputs and make them optional that way. If you're using multiple images built by dockerTools.buildLayeredImage, glibc (and everything else) will be shared across all of them anyway (given you're using roughly the same Nixpkgs commit).

https://github.com/max-privatevoid/hackernews-docker-challen...

jossephus01 · 2024-03-16T14:45:50 1710600350

These changes are all great. Learnt a lot from the optimizations. Thanks.

yjftsjthsd-h · 2024-03-16T04:58:22 1710565102

> While I was using Nix, the resulting image was twice the size of the image built without Nix.

I would be very interested to know where the difference is; is nix including things it doesn't need to? Is the non-nix build not including things it should?

jossephus01 · 2024-03-16T05:24:27 1710566667

I have included the result of running dive on the resulting image. You can check it out on https://github.com/jossephus/Docker_challenge/wiki.

As stated above, I havent done any trimming on the resulting image, so There's too many stuff in the image.

wbl · 2024-03-16T05:18:37 1710566317

Don't you just stick the JAR in?

operator-name · 2024-03-15T21:12:26 1710537146

This is great if you've already adopted Nix, and I'd love for nothing than more declarative package management solutions like Nix or Guix to take off.

If you're already using Docker but want to gradually adopt Nix, there is an alternative approach outlined by this talk: https://youtu.be/l17oRkhgqHE. Instead of migrating both the configuration AND container building to Nix straight away, you can keep the Dockerfile to build the nix configuration.

The biggest downside is that you don't take advantage of layers at all, but the upside is that you can gradually adapt your Dockerfiles, and reuse any Docker infrastructure or automation you already use.

FiberBundle · 2024-03-16T00:27:42 1710548862

[1] also uses this approach.

[1] https://mitchellh.com/writing/nix-with-dockerfiles

AtlasBarfed · 2024-03-16T04:14:21 1710562461

So one of the pillars of the article is that docker builds aren't reproducible, but Nix is.

But... is a lot of that irreproducibiity (apologies for that word) because there's no guarantee one of the docker layers will be available?

And... does Nix have some guarantee to the end of the universe that package versions will stay in the repository?

operator-name · 2024-03-16T08:37:46 1710578266

I'd give this article a read, as it can explain it more clearly than I can: https://serokell.io/blog/what-is-nix

But to briefly answer your specific questions: Docker files are commonly not reproducible because they contain arbitary stateful commands like `apt-get update`, `curl`, etc. For a layer with these kinds of commands to be reproducible you would need a mechanism to version and verify the result.

Nix provides such a mechanism, and a community package repository with versioned dependancies between packages. These are defined in a domain specific language called Nix (text files) and kept into a git repository. This should be familiar if you've used a package manager with lock files before.

You can guarentee the package version will stay in the repository by pinning your build to an exact commit hash in the repository.

msteffen · 2024-03-29T05:13:21 1711689201

Really random, but the illustrator who made the slide art for this (Annie Ruygt) also made logos and brand art for two YC startups, RethinkDB (rethinkdb.com) and Pachyderm (pachyderm.io) (where I worked, and which was founded by former RethinkDB engineers). She does great work!

jrockway · 2024-03-16T01:44:26 1710553466

I spent a half a day or so relatively recently trying to build our CI base image with Nix (at the recommendation of our infra team), but it was huge, and some stuff didn't work because of linking issues.

One issue that really bugged me was to build multi-arch images, it actually wants to execute stuff as the other architecture, and only supports using qemu with hardware virtualization for that. My build machine (and workstation) is a VM, so I don't have that. I do have binfmt-misc, though, so if you just happened to fork and exec the arm64 "mkdir" to "mkdir /tmp", it would have worked. Of course, this implementation is a travesty when docker layers are just tar files, and you can make the directory like this:

    echo "tmp uid=0 gid=0 time=0 mode=0755 type=dir" | bsdtar -cf - @-

(As an aside, I'm sure this exact layer already exists somewhere. So users probably don't even have to download it.)

Every time I try nix, I feel like it's just a few months away from being something I'd use regularly. nixpkgs has a lot of packages, everything you could ever want. They all install OK onto my workstation. But "I need bash, python, build-essential, and Bazel" doesn't seem like something they're targeting the docker image builder at. I guess people just want to put their go binary in a docker image and ... you don't need nix for that. Pull distroless, stick your application in a tar file, and there's your container. (I personally use `rules_oci` with Bazel... but that's all it does behind the scenes. It just has some smarts about knowing how to build different binaries for different architectures and assembling and image index yaml file to push to your registry.)

viraptor · 2024-03-16T02:08:51 1710554931

> to build multi-arch images, it actually wants to execute stuff as the other architecture

You should be able to cross compile binaries for other architectures without actually running them. As long as the package's build files support it of course.

> and only supports using qemu with hardware virtualization for that

That doesn't sound right. You can use qemu for architectures that be only software emulated too.

The minimal example is discussed here:

https://discourse.nixos.org/t/how-do-i-get-a-shell-nix-with-...

I don't want to say it should be as simple as using pkgCross (https://nix.dev/tutorials/cross-compilation.html), but... are some specific issues with the usual process that you're running into?

l0b0 · 2024-03-16T03:47:51 1710560871

  >  I spent a half a day or so relatively recently trying to build our CI base image with Nix (at the recommendation of our infra team), but it was huge, and some stuff didn't work because of linking issues.

You must be talking about the official Nix Docker image[1], which indeed is huge. I've been using it for years for a handful of projects, but if the size is an issue you can use the method mentioned in the article and build a very minimal image with only the stuff you specify.

[1] https://hub.docker.com/r/nixos/nix/tags

bfrog · 2024-03-16T02:23:05 1710555785

Hmm? Cross compiling to docker images is exactly what I used nix for. I even had musl being used, it was the smallest image I could build with any tool and built the images quickly and consistently in ci with caching working well.

I never saw went being used so im a bit confused where that came into play for you

operator-name · 2024-03-16T08:50:17 1710579017

What did your final Nix and Docker file look like, and did you have to use `buildFHSEnv` at all to support the odd 3rd party binaries?

I think Nix really needs some articles outlining how to play well and smoothly transition from an existing system piece by piece.

tuananh · 2024-03-16T00:47:11 1710550031

as platform engineer, i want to like nix. but it's not easy for everyone else.

and the dx is still pretty bad IMO.

for example, i prefer devbox DX just because i can add pkg like this `devbox add python@3.11`.

also, looking at 120 lines flake.nix. it's not exactly "easier"

https://github.com/Xe/douglas-adams-quotes/blob/main/flake.n...

Cyph0n · 2024-03-16T01:31:44 1710552704

That's kind of an unfair comparison. The flake you linked:

1. Defines a Go binary (i.e., how to build it)

2. Defines a Docker image that uses said Go binary as an entry point

3. Defines a NixOS module that creates a systemd service that runs the Go binary (only relevant on NixOS)

4. Defines a NixOS test for the module that ensures that the NixOS module actually creates a systemd service that runs the Go binary as expected. The NixOS test framework is actually quite impressive - tests run in a QEMU VM that also runs NixOS :)

Note that only (1) and (2) are relevant to the linked article (+ some of the surrounding boilerplate).

hamandcheese · 2024-03-16T02:11:29 1710555089

I agree that it's not a fair comparison, but I will add that this is a big barrier to newcomers. Everyone experienced with nix builds their own ivory tower of a nix flake (myself included), so it's hard to find actually good examples of how to do basic things without wading through a bunch of other bullshit.

rapnie · 2024-03-16T02:57:46 1710557866

Newcomer here. Could anyone tell if std [0] is a good way to bring more sanity into flake design, esp. in avoiding ivory towery custom approaches? Using devenv.sh is another option, but I liked emphasis on creating a common mental picture of the architecture and focus on SLDC that std provides.

[0] https://std.divnix.com

hamandcheese · 2024-03-16T03:11:03 1710558663

I haven't used std, but I would like to point you at what I think is the ideal way to organize a lot of nix: readTree from the virus lounge[0].

It doesn't add kitschy terms like "Cell" and "growOn", it's just a way to standardize attribute names according to where things live in the filesystem.

So in their repo, /foo/bar/baz.nix has the attribute path depot.foo.bar.baz

I will say that to understand how it works you need to have a solid grasp of the nix language. But once established I think it's a pattern that noobs could learn in a very quick and superficial way.

[0]: https://cs.tvl.fyi/depot/-/blob/nix/readTree/README.md

Cu3PO42 · 2024-03-17T01:17:22 1710638242

I really like this. In fact, I have an extremely similar home-grown abstraction. I might look into standardizing on this.

rapnie · 2024-03-16T03:20:14 1710559214

Thank you! I already bumped into the virus lounge, with their TVIX project [0] that I found quite interesting.

[0] https://code.tvl.fyi/about/tvix

OJFord · 2024-03-16T14:19:38 1710598778

What on Earth is the background reading supposed to be for https://std.divnix.com/explain/why-std.html, a supposedly motivating page for newcomers?

I've even used Nix a little bit (though early days, before flakes) and it makes absolutely no sense to me.

rapnie · 2024-03-16T15:41:41 1710603701

Well, I think that becomes clearer when reading comments on this HN thread. I found std after spending significant time to find out 1) wth is Nix/NixOS? 2) what would I use it for exactly? and 3) How to get going? Then on 3) I found reams of outdated or confusing docs making me doubt 1) and 2) again, as well as the frustrations of others on this.

Then this std background of "We bring clarity, clear mental picture, manageable Nix projects throughout the lifecycle." appealed a lot.

OJFord · 2024-03-16T16:13:43 1710605623

Does cell/cell block/target/actions terminology come from Nix then? I assumed that was std's solution (since it is under the heading Solution, and I haven't heard of it) so found the 'explanation' of it baffling. But if it comes from Nix it can be read sort of like a style guide for how to do what you're already doing with Nix?

If the target audience is limited to those already highly experienced (and yet frustrated) with Nix then I guess it's fine.

rapnie · 2024-03-16T17:41:42 1710610902

Ah, sorry I misinterpreted you before. Yes, these are std's abstractions to organize your Nix code and gradually 'grow' your solution. Rationale and explainers on these concepts are more spread about in the docs. The 'sales pitch' is another high-level txt than the one you passed on why to use std.

(PS. This cell breakdown reminded me a bit of Atomic Design for front-end UI to make that easier.)

Cyph0n · 2024-03-16T02:33:34 1710556414

I'm a beginner myself and have been trying my best to keep things simple. But I do agree that the complexity creep is quite tempting with Nix. Not sure why though..

hamandcheese · 2024-03-16T03:15:04 1710558904

I think nix has a fair amount of gravity - once you've started, assuming like it, you will quickly want to use it for everything.

I don't think most people's flakes are more complex than the alternative (which would be, I don't know, a bunch of different script, maybe some ansible playbooks?) but it is a bit daunting when all that complexity is wrangled into a single abstraction.

MadnessASAP · 2024-03-16T04:04:11 1710561851

I thought I was weird for my bespoke ivory tower monorepo flake.nix, glad to hear I'm not the only one. It has been a tremendous help in managing my homelab.

mkleczek · 2024-03-16T04:40:04 1710564004

Does Nix make it difficult to properly modularise nix files (flakes or not)?

Because it certainly looks like the things you listed are separate/orthogonal and should be in separate modules/files.

Having many years of Java experience this is the reason why I stick to Maven (not moving to Gradle) - it is opinionated and strongly encourages fine grained modularisation.

rfoo · 2024-03-16T10:05:23 1710583523

The thing I hate most about a Java codebase is 100 separate .java files with 30-50 lines each and somehow it indeed worked but I have no idea where to look at if I want to find out how is something implemented.

mkleczek · 2024-03-16T11:33:14 1710588794

Indeed - that’s very often the case - the right balance between too many and too big is not easy to find.

Having said that - Java is in general IDE targeted language and once you have an IDE - many small files is not an issue anymore.

yjftsjthsd-h · 2024-03-16T05:00:45 1710565245

> Because it certainly looks like the things you listed are separate/orthogonal and should be in separate modules/files.

Nix absolutely allows that, to the point where I'm surprised that the linked example doesn't separate them. Most of my flakes have a flake.nix that's just a thin wrapper around a block that looks like

    devShell = import ./shell.nix { inherit pkgs; };
    defaultPackage = import ./default.nix { inherit pkgs; };
    ...

yencabulator · 2024-03-20T16:56:22 1710953782

The thing that makes that miserable is all the inputs need to be declared in the top-level flake. They really should make using flakes in subdirectories of the same git repo painless. (Right now they're treated as if they were remote, and you need to "upgrade" them after every change; unusable.)

mkleczek · 2024-03-16T05:11:50 1710565910

And what's the story with generic libraries that can later be used in your nix files to produce desired output?

I am aware of flake-parts that are supposed to offer that but the ecosystem of flake-parts is on the smaller side of things...

lkjdsklf · 2024-03-16T07:43:07 1710574987

Maybe I'm not understanding your question, but this is what the inputs of flakes are for.

You can pull in arbitrary code from pretty much anywhere as an input

mkleczek · 2024-03-16T08:29:42 1710577782

The question is if there actually _is_ a rich ecosystem of such libraries - similar to the rich ecosystem of Maven plugins.

georgyo · 2024-03-16T08:50:47 1710579047

I think something got lost along the way. Nix does not replace maven, you call maven from nix.

https://ryantm.github.io/nixpkgs/languages-frameworks/maven/

nix is a build tool a similae way that docker is a build tool. You define build scripts that call the tools you already use. The major difference is that a docker file gives you no way tl be sure you can reproduce that build in the future. A flake on the other hand gives you a high degree of trust.

mkleczek · 2024-03-16T09:50:49 1710582649

Nothing got lost: both Nix and Maven are dependency management tools. Both are also build tools. The difference is that Maven was created as a Java build tool (and it stayed that way in general).

What we have today is that there is a multitude of dependency managers and - what's worse - all of them are _also_ build tools targeted at specific language.

Nix has a unique position because it is not language specific. Where it is lacking is missing standards and reusable libraries that would simplify common tasks.

I am comparing to Maven because I am looking for multi-platform Maven alternative. There is Bazel but its dependency management is non-existent. There is Buck2 which is great in theory but the lack of ecosystem makes it a non-starter.

Nix is the only contender in the space that offers almost everything and has a chance to become a de-facto standard of software delivery thanks to this.

What's missing though is... easy to use canned solutions similar to maven plugins.

EDIT: grammar

georgyo · 2024-03-16T10:18:41 1710584321

In the link I referenced, it shows you how to use maven plugins in nix.

Nix has composable, reusable, and shareable functions. Nix is a full, be awkward, programming language. You'll find functions and flakes for nearly everything you might want to do. An example of one that is more plugin like is sops-nix.

Though have never used maven or maven plugins, I may be missing your overall point.

mkleczek · 2024-03-16T10:43:34 1710585814

In Java/Maven ecosystem a lot of things is simple because there is a huge ecosystem of easy to integrate libraries/plugins.

Want a packaged spring boot application? There is a plugin for that. Want to package it in a container image? Just add a plugin dependency. Want to build an RPM or deb package? Add another two plugin dependencies. All various artifacts are going to be uploaded to a repository and made available as dependencies.

Missing a specific plugin? You can easily implement it in Java as a module and use it inside your project (and expose it as a standalone artifact as well).

I can’t find anything similar in Nix ecosystem.

Having a language allowing for this is not the same as having solutions already available.

elbear · 2024-03-16T10:58:43 1710586723

I was reading this thread and now I finally understood what you mean by plugins. Plugins kind of exist in Nix, except they're not called that. They are functions available in some specific module.

For example, `dockerTools` provides different functions like creating an image and other things. There is a module of fetchers, functions that retrieve source files from GitHub and other sources.

But I don't think there are many language-specific functions, like the ones you are describing. I can't think of any, except for the ones that build a Nix package from a language-specific package.

mkleczek · 2024-03-16T11:26:14 1710588374

There are attempts like

https://flake.parts/

or

https://github.com/nix-community/flakelight

Their aim is to create an ecosystem of reusable Nix libraries. But it is tiny.

elbear · 2024-03-16T18:22:03 1710613323

Yes, I know about flake.parts (didn't know about the other one). But I'm not aware of the kind of libraries you mentioned. There's FlakeHub[0], which is like a package index for flakes, so maybe we'll start to see there reusable stuff.

[0]: https://flakehub.com/flakes

mkleczek · 2024-03-16T20:48:40 1710622120

To be honest, now when I'm thinking about it - it seems to me Nix main weakness here is that it is a separate language and runtime.

Writing a Maven plugin (ie. a reusable piece of configuration management logic) is easy because you can use any library from the vast Java ecosystem (just add a library as a dependency of your plugin).

Doing the same in Nix requires recreating these libraries in Nix.

Looks like Maven might simply be the right choice...

elbear · 2024-03-18T16:42:21 1710780141

You can have a hybrid approach, where you use Nix to provide the build environment (which includes Maven, for example). Then you use the Java-specific tools for the build itself. This should ensure your build has a high degree of reproducibility.

mkleczek · 2024-03-18T18:23:31 1710786211

That won't cut it as I need a multi-language build.

And don't get me wrong - I know it _can_ be done in Nix (as it is a programming language) - the question is _how easy_ it is to create/maintain it.

Let's say my solution requires a Java Spring Boot application, React client, custom Postgres extension and Python ML code.

elbear · 2024-03-19T06:01:11 1710828071

Well, you can provide in your environment all the binaries needed:

- Java stuff

- node and npm

- I don't know what you would need for Postgres extensions

- Python, pip or poetry (or whatever you use for Python package management)

Here is an example flake: https://bpa.st/ZUQQ

Under `devShells`, you have `packages`. You can put there any package from nixpkgs, not just Python packages like I have there.

To make it cleaner, you could have a variable for each platform and concatenate them under packages like

    `packages = javaPkgs ++ nodePkgs ++ pgPkgs ++ pyPkgs`.

Cyph0n · 2024-03-16T05:07:29 1710565649

Yes, to the point where it can become more confusing than helpful. The fact that Nix handles merging data structures for you makes it easy to fall into over modularization.

viraptor · 2024-03-16T01:34:42 1710552882

Those 120 lines are not exactly representative. For example if I was writing this for a single service, I wouldn't bother making a new module and would inline it instead. Then, you've got many lines which would map 1:1 to an inlined systemd service description, so you're not getting rid of those whatever the system you choose. There's also a fancy way to declare multiple systems with an override.

This example is a "let's do a trivial thing the way you'd do a big serious thing". If you wanted to treat it as a one-off, I'm sure you could cut it down to 40 lines or so.

whydoineedthis · 2024-03-16T02:31:44 1710556304

So I can write a Dockerfile using 17 verbs (13, actually), and it's understandable language to a 4th grader... or I can write 120 lines of complete abstract nix code that means nothing to someone doing software for 20 years.

Hrmmmm...this is such a TOUGH decision.

m463 · 2024-03-16T02:57:33 1710557853

I would like to mention one pet peeve of mine wrt docker...

cramming everything into one RUN line to save space in a layer.

I really wish instead of:

  RUN foo && \
      bar && \
      bletch

You could do:

  LAYER
  RUN foo
  RUN bar
  RUN bletch
  LAYER

or something similar.

maybe even during development you could do:

  docker build --ignore-layer .

then at the end:

  docker build .

sre2 · 2024-03-16T08:45:39 1710578739

Heredocs (described in the docs[1] under "shell form") was introduced 3 years ago[1]. The flag '--squash' has been in podman for quite a while[2]. It might be time for you to upgrade your tooling.

[0]: https://docs.docker.com/reference/dockerfile/#shell-form

[1]: https://www.docker.com/blog/introduction-to-heredocs-in-dock...

[2]: https://docs.podman.io/en/latest/markdown/podman-build.1.htm...

viraptor · 2024-03-16T03:43:16 1710560596

Cramming things into one layer can save gigabytes in size. Even if it saves a 100MB, it adds up between multiple CI runs and in deployment latencies.

Docker should address it one day, but until then, we just have to do it in real world scenarios.

a_t48 · 2024-03-16T04:03:46 1710561826

Parent comment agrees with you, is just asking for more ergonomic syntax

viraptor · 2024-03-16T04:06:42 1710562002

I get it, just providing more context. Should've phrased it nicer.

yjftsjthsd-h · 2024-03-16T05:04:07 1710565447

An alternative solution... for some definition of the term... is to write a "naive" Dockerfile like

    RUN foo
    RUN bar
    RUN baz

and then just build it with something like... I think kaniko did this last I looked?... that smashes the whole thing into a single layer. Obviously that has other tradeoffs (no reuse of layers) but depending on your usecase it can be a good trade to make (why yes, I did cut my teeth in an environment where very few images shared a base layer, why do you ask?).

IshKebab · 2024-03-16T08:30:57 1710577857

Docker itself can do that now too.

m463 · 2024-03-17T08:24:06 1710663846

I think granular layering is still useful.

It's not always best or most efficient to squash everything into one giant layer.

viraptor · 2024-03-16T03:39:55 1710560395

I addressed the 120 lines already and they're not completely abstract. You seem uncomfortable with an alternative approach and that's fine. But this is not a good intentions argument.

xena · 2024-03-16T02:51:50 1710557510

I've used this app for like four different talks over the years, I could clean it up, but then I break code samples in my talks.

kaba0 · 2024-03-16T11:25:33 1710588333

Dockerfile’s are absolutely not something a 4th grader would understand. It looks familiar to you, because you have already learnt it. They are definitely not trivial to understand before that, and the same is also true for Nix.

klntsky · 2024-03-16T00:53:07 1710550387

These 120 lines do quite a lot more, don't they?

zer00eyz · 2024-03-16T03:24:36 1710559476

its 120 lines of code to deal with a binary and its systemd setup.

That binary + config file are effectively as close as we're going to get to a "flat pack" on linux.

Im not sure what is the forest and where are the trees but this example shows exactly what we have lost sight of.

djaouen · 2024-03-15T21:15:04 1710537304

I just wanted to chime in here and say that Guix also has a nice and easy-to-use Docker option with "guix pack -f docker" [1]. Guix also has the advantage of using an already-used language (Guile/Scheme) rather than its own bespoke one. :)

[1] https://guix.gnu.org/manual/en/html_node/Invoking-guix-pack....

j-bos · 2024-03-15T22:00:24 1710540024

I like the article but had a hard time following the specifics of the configs and commamds. Feels like it's more meant for people already familiar with nix, or sufficiently interested to study up while reading

earthling8118 · 2024-03-16T01:47:56 1710553676

That's very interesting, because as someone familiar with nix my take was that this was information I considered to be aimed at people who weren't familiar.

denysvitali · 2024-03-16T03:11:46 1710558706

Unfortunately the result of a Nix Docker image is an image that is 100+ MB for no particular reason :(

l0b0 · 2024-03-16T03:36:55 1710560215

Where on Earth are you getting that result from? [1] gives an 11 MB image.

[1] https://nix.dev/tutorials/nixos/building-and-running-docker-...

denysvitali · 2024-03-16T06:58:23 1710572303

From experience - but I'm more than happy to be proven wrong as I would love to build all the Docker images with Nix.

What you linked is the equivalent of:

FROM scratch

COPY hello-world /hello-world

Of course that's small (hello-world is statically linked). Try to add coreutils (or any other small package) and you'll see what I mean. In my experience the size of a Docker image built with some nix packages is greater than the Debian counterpart. I don't know why though.

georgyo · 2024-03-16T09:13:21 1710580401

No, that dockerfile is not equivalent because the hello-world is not statically built in the nix version.

However I'll give you that it could be smaller in more complex examples. For example glibcLocales is for all locales which is quite chunky but your application only needs one locale.

TeeMassive · 2024-03-16T03:36:08 1710560168

There are ways to properly build a nix container image so this kind of things doesn't happen. You'll find plenty of projects on GitHub dedicated to only that.

denysvitali · 2024-03-16T15:21:08 1710602468

Can you please provide an example? Everything I've tried ended up being way bigger than I think it should

torcete · 2024-03-16T10:52:59 1710586379

Coincidentally, Two days ago I was trying to adapt a flake to include a docker derivation. I came across with xelaso's page and inspired by the example provided (and after a few tries) I manage to compose a docker image. That was very cool! BTW: Thanks Xelaso.

febed · 2024-03-15T22:09:08 1710540548

I didn’t fully grok how this works - what is the base image for the generated image? Also wouldn’t the image size be large if the glibc is copied over again

aidenn0 · 2024-03-15T22:20:31 1710541231

> what is the base image for the generated image?

Default is none (i.e. like "FROM scratch" in a Dockerfile); you can specify a baseImage if needed, but I haven't had to yet. It works by copying parts of the nix store into the image as needed, but see also below.

> wouldn’t the image size be large if the glibc is copied over again

The original Nix docker-tools buildImage did suffer from poor reuse of common dependencies. Docker already has a way to reuse parts of images (e.g. if you build 7 images where the first N lines of a Dockerfile are the same, the 7 images will use a shared store for the results of running the first N lines). There are several backends for Docker storage that accomplish this in various ways (e.g. FS overlays, tricks with ZFS/btrfs snapshots).

Nix docker-tools now has a "buildLayeredImage" that uses this ability of Docker to share much of the storage for the dependencies, so if you build several images that all rely on glibc, you only pay the cost of storing glibc in docker once.

febed · 2024-03-15T22:42:39 1710542559

Thanks, that made the article clearer for me