How io_uring and eBPF Will Revolutionize Programming in Linux

pierrebai · on Nov 26, 2020

Ah the funny things we resad about in 2020.

In 1985... yes I said 1985, the Amiga did all I/O through sending and receiving messages. You queued a message to the port of the device / disk you wanted, when the I/O was complete you received a reply on your port.

The same message port system was used to receive UI messages. And filesystems, on top of drive system, were also using port/messages. So did serial devices. Everything.

Simple, asynchronous by nature.

As a matter of fact, it was even more elegant than this. Devices were just DLL with a message port.

beagle3 · on Nov 26, 2020

And it worked, well, with 512K memory in 1985.

The multitasking was co-operative, and there was no paging or memory protection. That didn't work as well (But worked surprisingly well, especially compared to Win3.1 which came 5-6 years later and needed much more memory to be usable).

I suspect if Commodore/Amiga had done a cheaper version and did not suck so badly at planning and management, we would have been much farther along on software and hardware by now. The Amiga had 4 channel 8-bit DMA stereo sound in 1985 (which with some effort could become 13-bit 2 channel DMA stereo sound), a working multitasking system, 12-bit color high resolution graphics, and more. I think the PC had these specs as "standard" only in 1993 or so, and by "standard" I mean "you could assume there was hardware to support them, but your software needed to include specific support for at least two or three different vendors, such as Creative Labs SoundBlaster and Gravis UltraSound for sound).

atombender · on Nov 26, 2020

Something else that's mentioned less than the hardware side is AmigaDOS and AmigaShell, which were considerably more sophisticated than MS-DOS, and closer to Unix in power (e.g. scripting, pipes, etc.).

The fate of Amiga is so infuriating. It's mind-boggling to think how Microsoft was able to dominate for so long with clearly inferior technology, while vastly superior tech (NeXT, Amiga, BeOS) lost out.

There are many such unhappy stories, and I often think about the millions of hours spent on building tech that should have conquered the world, but didn't. The macOS platform is a rare incidence of something (NeXT) eventually winning out, but the Amiga was a different kind of dead end.

int_19h · on Nov 27, 2020

If you think about it, the triumph of "good enough in the right place at the right time" describes most of history of computing. Unix was that, as well, compared to many of its contemporary OSes. C was several steps back from the state of the art in PLs. Java, JavaScript, PHP... the list goes on and on.

erikpukinskis · on Nov 27, 2020

There’s also the “tearing your competitors to shreds, regardless of the law or ethics” which is how I think of Microsoft in the pre-iPhone era.

As someone who loves software, there was a very clear feeling at that time that Microsoft was putting a huge chilling effect on the whole industry, and that the entire industry was stagnating under their control.

Thank god for Netscape, Google, Apple, Facebook, and Amazon (in that order) who were able to wrest that control from them. Now at least there are multiple software ecosystems to move between. When one of these massive companies poisons the water around them, there are other ecosystems doing interesting things.

unnouinceput · on Nov 27, 2020

Got bad news for you my friend. All of them (well, of course Netscape doesn't exists anymore) poisoned the waters around them. So regardless where you move, you still inhale some poisoned air.

wikibob · on Nov 27, 2020

“C was several steps back from the state of the art in PLs”

This very accurately describes Go

laumars · on Nov 27, 2020

Sure, if literally the only metric you judge a language by is how state of the art it’s expressiveness is.

Sometimes it feels like all the hate on HN toward Go is ignorance that there is a whole domain of software outside of scripting and low level systems programming, and how some enterprises value 20 year maintenance over the constant churn of change eg Rust and JavaScript. And yes, I often hear people saying “you can still do that in x or y” but the point is that Go does it better than most languages because it was purposely designed with those goals in mind - hence exactly why it suffers from expressiveness, state of the art features et al. And I say this from 30 years of experience writing and managing enterprise software development projects across more than a dozen different languages.

Go might not be cool nor pretty, but it’s extremely effective at accomplishing its goal.

jpxw · on Nov 27, 2020

I think this is a very two-dimensional way of looking at the problem.

Go reduces complexity in order to make it easier to build resilient systems.

A language like Perl has bucketloads more features, and more expressive syntax, but I’d still say Go is many steps ahead of Perl.

On another note, I’d actually argue that some of Go’s features, such as “dynamically typed” interfaces and first-class concurrency support are streets ahead of most other languages. Not to mention its tooling, which is better than any language I’ve used, full stop (a language is so much more than simply its syntax).

I believe that functional languages, with proper, fully-fledged type systems, are the best way to model computation. But if I had to write a resilient production system, I’m choosing Go any day.

gumby · on Nov 27, 2020

> This very accurately describes Go

...and was a deliberate design objective made by an ex bell labs guy.

teleforce · on Nov 27, 2020

C was designed to fill its main purpose in writing portable OS in a higher level language, and given that majority of the today's world's OS is written in C, is the testament of its success.

It is interesting to note that while Brian Kernighan and Ken Thompson are involved in initial Go language design, C was largely Dennis Ritchie's baby and his got a complete PhD thesis on programming language design meaning that he basically aware of the state-of-the-art of programming languages design at that time.

The main argument of several steps back is probably about the lack of the functional language aspects like closure and this feature probably at the very bottom of the programming language features list that you want to have in porting OS, given the computer systems CPU and memory limit at the time. The other is object oriented, but you can perform object oriented programming in C inside the kernel just fine but not as gung-ho as things like the multiple inheritance nonsense [1].

Jury is still out on Go. The fact that Kubernetes is very popular for the cloud now does not mean it will be as successful as 50 years of C. Someone somewhere will probably come up with better Kubernetes alternatives soon that uses different languages. To be relevant in today and in the future, Go needs to adopt generics and its designers are well aware of the deficiency of not having generics for current Go implementation.

[1]https://lwn.net/Articles/444910/

pjmlp · on Nov 27, 2020

Not at all, C was designed to fill its main purpose in writing portable OS in a higher level language at Bell Labs, the rest of the world was doing it since 1961. Quite easy to find out for anyone going through digital archives from bitsavers, ACM and IEEE.

The majority of the today's world's OS is written in C, as a testament to the success of free beer OS given alongside tapes with source code, while other mainframe platforms required a mortgage just to start.

Had Bell Labs been allowed to sell UNIX and there wouldn't exist a testament of anything.

teleforce · on Nov 27, 2020

The main competitor to UNIX namely VAX/VMS is mainly written in C and also its natural successor Windows NT kernel, it is probably the second most popular OS in the world. The more modern BeOS and MacOS kernels are written in C. Even the popular JVM (equivalent to Java mini OS) is written in C. Why are these UNIX alternatives have chosen to use C while other alternative programming languages are readily available at the time for examples Pascal, Objective C and including the safe Ada?

And do the mainframe OSes were written for portability in the first place like UNIX?

pjmlp · on Nov 27, 2020

VAX/VMS was written in BLISS, it only adopted C after UNIX started to widespread and they needed to cater to competition and their own in-house UNIX implementation, learn history properly.

https://en.wikipedia.org/wiki/BLISS

The even the popular JVM is written in a mix of Java and C++, with plans to port most of the stuff to Java, now that GraalVM has been productised, https://openjdk.java.net/projects/metropolis/

Speaking of which, there are at least two well known version of the JVM written in Java, GraalVM and JikesRVM. Better learn the Java eco-system.

UNIX was written in Assembly for the PDP-7, C only came into play when they ported it to the PDP-11 and UNIX V6 was the first release where most of the code was finally written in C.

IBM i, z/OS or Unisys ClearPath in 2020, have completly different hardware than when they appeared in 1988, 1967 and 1961 respectively, yet PL/S, PL/X and NEWP are still heavily used on them. Looks like portable code to me.

Mac OS, you know the predecessor for macOS, was written in Object Pascal, even though eventually Apple added support for C and C++, which than made C++ with PowerPlant the way to code on the Mac, not C.

BeOS, Symbian were written in C++, not C.

Outside of the kernel space, Windows and OS/2 always favoured C++ and nowadays Windows 10 is a mix of .NET, .NET Native (on UWP) and C++ (kernel supports C++ code since Windows Vista).

NeXT used Objective-C in the kernel, that's right, NeXT drivers were written in Objective-C. Only the BSD/Mach stuff used C.

macOS replaced the Objective-C driver framework with IO Kit, based on Embedded C++ again not C. Nowadays with userspace drivers the C++ framework is called DriverKit in homage to the original Objective-C NeXT framework.

Arduino and ARM mbed are written in C++, not C.

Android uses C only for the Linux kernel, everything else is a mix of Java and C++, and since Project Treble you can even write drivers in Java, just like in Android Things allowed to since version 1.0.

Safe Ada is used alongside C++ on the GenodeOS research project.

Inferno, the last iteration of the hacker beloved Plan 9, uses C on the kernel and the complete userspace makes use of Limbo.

F-Secure, you might have heard of them, has their own bare metal Go implementation for writing firmware and is used in production via the Armory products.

IBM used PL.8 to write a LLVM like compiler toolchain and OS during their RISC research and only pivoted to Aix, because that was what the market wanted RISC for.

Contrary to the cargo cult that Multics was a failure, the OS continued without wihtout Bell Labs and was even assessed to be more secure by DOJ thanks to its use of PL/I instead of C.

There is so much to the world of operating systems than the tunnel vision of UNIX and C.

teleforce · on Nov 27, 2020

Personally I'd consider C++ as C with classes or object oriented extension of C, prior to 2010. The modern C++ after that is more of a standalone language after some other languages' features adoption (e.g. D). Objective-C on the other hand is totally a separate language.

The original JVM written by Sun was in C not C++ or Java.

Windows NT the kernel part is mainly written in C. The chief developer of Windows NT Dave Cutler is probably the most anti UNIX person in the world, but the fact that he has chosen C to write Windows NT kernel in C is probably the biggest testament you can get. Dave Cutler is also part of the original developers of VMS, if BLISS with its typeless nature is better for developing OS than C, he'd probably has chosen it.

For whatever reasons Multics had failed to capture wide spread adoption compared to UNIX and the fact that its name existed mainly in most of operating System books as pre-cursor OS to UNIX. For most people Multics is like B language that is just a pre-cursor to C language. I know it a shame that Multics had become a mere footnotes inside OS textbooks despite its superior design compared to UNIX.

PL/I language is interesting by the fact that it is quite advanced at the time but as I mentioned in my original comments, Dennis Ritchie had to accommodate the fact that some of languages features are over engineered based on the hardware of the day and had to compromise accordingly. Go designers, however, have chosen to compromise not based on the hardware state-of-the-art but what the language designers think are good for Google developers at the time of the original language design proposal.

bogomipz · on Nov 27, 2020

>" Unix was that, as well, compared to many of its contemporary OSes"

Which other OSes from the era are you referring to here?

vram22 · on Nov 27, 2020

Yes, the Worse is Better thing.

radmuzom · on Nov 27, 2020

On Microsoft dominating - because developers don’t matter, users do. None of these superior technology would be willing to make an exception in the kernel so that SimCity can run (look up the story from Raymond Chen if you are not familiar). Linux found considerably more success in servers as the users themselves are “developer like”.

solidsnack9000 · on Nov 27, 2020

Closest thing I can find is from Joel Spolsky:

I first heard about this from one of the developers of the hit game SimCity, who told me that there was a critical bug in his application: it used memory right after freeing it, a major no-no that happened to work OK on DOS but would not work under Windows where memory that is freed is likely to be snatched up by another running application right away. The testers on the Windows team were going through various popular applications, testing them to make sure they worked OK, but SimCity kept crashing. They reported this to the Windows developers, who disassembled SimCity, stepped through it in a debugger, found the bug, and added special code that checked if SimCity was running, and if it did, ran the memory allocator in a special mode in which you could still use memory after freeing it.

https://www.joelonsoftware.com/2004/06/13/how-microsoft-lost...

radmuzom · on Nov 27, 2020

Yes I could not find the original link too. Closest I could find is (the link mentioned in the paragraph below is broken) -

"Interesting defense. While Chen has a good point about Microsoft taking the blame unfairly over this and perhaps similar issues its not like Microsoft is renown for their code quality. Indeed, check out an item in J.B. Surveyer's Keep an Open Eye blog from September 2004 which details how Chen's team added code to allow Windows to work around a bug in Sim City!"

https://www.networkworld.com/article/2356556/microsoft-code-...

quietbritishjim · on Nov 27, 2020

Here you go:

https://web.archive.org/web/20070114082053/http://www.theope...

It actually refers back to the Joel on Software blog post in the comment above.

beagle3 · on Nov 27, 2020

The Amiga lost the war for completely different reasons. The competition at the time was MS-DOS 4 or 5, windows 2 (and 3.0) were still a toy — and they had compatibility problems.

BeOS had no chance to be evaluated on its own merits because by that time Microsoft had already applied anticompetitive and illegal leverage on PC vendors - for which they were convicted and paid a hefty fine (which was likely a calculated and very successful investment, all things considered).

The Amiga wasn’t even expensive for what it gave: it was significantly cheaper than a PC or Mac with comparable performance, and even had decently fast PC emulation and ran Mac software faster than the Mac.

It did not have a cheap “entry level” model, though, which was one big problem. The other (not unrelated) problem was incredible incompetence among Commodore management.

icedchai · on Nov 27, 2020

Do you not consider the A500 cheap? I believe you they were going for around $600 in late 80’s money, at least in the US. This wasn’t Atari ST cheap but still not a bad deal.

beagle3 · on Nov 27, 2020

A starter no-brand beige box was always cheaper. And iirc, for a long time you couldn’t get an el-cheapo monochrome monitor for the Amiga - only color or TV, which was ok for the C64-upgraders but not for the PC competition.

icedchai · on Nov 27, 2020

True. PC clones were always cheaper.

You could use a composite monitor on the A500/2000. It was only in monochrome. I did that for the first couple months I had my A500.

beagle3 · on Nov 27, 2020

Around me, they still cost twice as much as a CGA or Hercules (or even dual) monochrome monitor. The starter Amiga cost twice the starter PC until 1993 or so, and by then the war was lost. It was too expensive for a middle class family where I lived.

renox · on Nov 27, 2020

There's a similar explanation for Linux's success on servers: Linus is very strict on backward compatibility for the kernel. But for Linux on desktop, the rest of the stack (GUI environments) is made by a bunch of CADT devs who don't care about backward compatibility is, so of course it failed..

pkaye · on Nov 27, 2020

What kind of backward compatibility for GUI environments are you talking about?

AnIdiotOnTheNet · on Nov 27, 2020

The kind where you can still use an application 2 years later without having to recompile it.

cycloptic · on Nov 27, 2020

That generally isn't a problem if you pin/vendor your dependencies... the same thing that most developers seem to do on Windows and mac anyway.

eecc · on Nov 27, 2020

CADT? What do you mean?

sobani · on Nov 27, 2020

Cascade of Attention-Deficit Teenagers (open-source software development model)

brazzy · on Nov 27, 2020

https://www.jwz.org/doc/cadt.html

gpderetta · on Nov 27, 2020

google "CADT jwz". For various reasons it can't be linked from HN.

vvanders · on Nov 27, 2020

For those curious about the story https://www.joelonsoftware.com/2000/05/24/strategy-letter-ii... is a great read with other useful tidbits as well.

zaphirplane · on Nov 27, 2020

There is something to be said about the 2 companies mentioned other that MS, that did the right backwards compatible thing, transmeta and paymybills one is bust and the other doesn’t show up the the 1 page of a search.

asdfasgasdgasdg · on Nov 27, 2020

> the other doesn’t show up the the 1 page of a search.

It merged with PayTrust, which was acquired by Metavante, who then sold the customers to Intuit. All this happened in the early 2000s. More recently it seems that Intuit sold Paytrust back to Metavante. It's still operating a service at Paytrust.com.

kstrauser · on Nov 26, 2020

My first Linux box felt pretty comfortable after cutting my teeth on an Amiga's shell, which was largely inspired by Unix and still similar enough in concepts to make the transition easy.

icedchai · on Nov 27, 2020

I was an Amiga user for most of the late 80's and early 90's. The hardware didn't change much over the years. Software wise, OS 2.0 was a huge upgrade, but hardware wise, it felt like little changed until AGA. AGA machines (1200/4000) were too little, too late. If the had come out in 1990 instead of 1993, it might've been enough of a lead. Maybe in an alternate universe where the A3000 had AGA.

jacobush · on Nov 27, 2020

AGA was the only significant hardware revision.

They should have ditched m68k too. I loved it, but with 68040 and 486, the writing was on the wall for everyone to see.

By the time of Pentium, the writing was on the wall, the floors, the windows, the ceiling, the windows.

Yes, 68060 held a candle against early Pentium but it was not intended as a Personal Computer CPU, more "fast embedded".

The Amiga OS was great. No memory protection, but Win 3.1 had none, DOS had none, Win 95 had some but it somehow crashed relentlessly anyway. It took years for them to discover that it had a max uptime of 48 days because of a timer running out of bits.

AGA could and should have been incrementally upgraded with more modes, ever keeping backwards compatibility. (Like AGA did with the original chipset.) They could have sold Amigas on PCI boards, with a cheap 68000 to boot legacy Amiga OS until the transition was complete with emulation or whatever, and using the PC x86 for games code. So many possibilities, but R&D was on a shoestring budget.

The "game console like" conformity was the strength and ultimately the downfall of the platform, but not because that's bad inherently, but because the revisions stopped coming. The original PS2 was compatible with the PS1, and the original PS3 was compatible with the PS2.

The iPhone also shows the strength of vertical integration, Commodore had a great chess board opening but traded all its pieces for nothing, except in the end, pork for the CEO and board.

edgyquant · on Nov 27, 2020

My grampa gave me my first pc and it had a cli I learned and then years later when I was introduced to Linux I just had an intuition for the basic commands and usage, I haven’t been able to track down what that PC was (it was cli only but could load games from floppy’s). I think it may have been an Amiga (this was in 98 but the pc was a decade+ old at the time.)

nwah1 · on Nov 26, 2020

macOS may have technical advantages, though less and less over time. But it always had a dramatically more restrictive business model from the very beginning.

This is what made it unattractive to business and continues to make it unattractive to many.

The restrictiveness of Apple is likely an advantage for novice mobile users, and other vendors copied it.

jolux · on Nov 27, 2020

You say it's unattractive to business and yet MacBook seems to be the single most popular brand of laptop at most tech companies these days.

hguant · on Nov 27, 2020

Maybe the most visible brand, among web-dev, but HP, Lenovo, and Dell own the business/enterprise laptop market in a big way.

jolux · on Nov 27, 2020

I'm aware, I'm a full-time Windows developer.

eitland · on Nov 27, 2020

Around here it is a mix. Macs aren't extremely popular (3 - 5 of 20 or something) and Linux have quickly become more common and seems to be eating into the Windows marketshare.

cookiengineer · on Nov 27, 2020

Every time there's a huge media article about a newly discovered tracking mechanism in Windows 10, you can immediately see the posts of newcomer questions in Linux specific areas.

A lot of people seem to have switched to Ubuntu or Arch due to Windows 10 tracking. And these are also non-technical people that have no idea what they are doing, which is kinda awesome.

I always love when using Linux gets a bit easier to use as a Desktop for the wider audience.

vkazanov · on Nov 27, 2020

50/50 with Linux among developers in the case of my company. In my team there are like 4 Dells and only 1 Mac.

ksk · on Nov 29, 2020

Software developers are a tiny fraction of business users. In our industry - vaccines - most people use Windows laptops to get work done. Execs do use Apple stuff, but they're not doing the actual ground-level work, so its probably the same everywhere. Apple hardware is expensive to buy, expensive to maintain, and difficult to service due to its flawed design (everything is soldered, no easy access to components, no third party repair, no access to parts, generates lots of e-waste, etc). The OS also is not capable enough to be easily administered by IT.

hvidgaard · on Nov 27, 2020

The PC was an open free specification, so the hardware was cheaper. I think that was the main driver behind the PC winning, not Microsoft..

pjmlp · on Nov 27, 2020

Only thanks to Compaq, that wasn't part of IBM plans.

ksk · on Nov 29, 2020

Maybe its only clear to you. I don't see what is "clearly inferior" about Microsoft tech. It is reliable and rock solid and has worked for our industry very well (vaccines).

atombender · on Nov 30, 2020

I'm of couse referring to the periods when Amiga and Apple competed with MS-DOS and Windows, which were vastly inferior technologically.

icedchai · on Nov 26, 2020

Amiga multitasking was actually preemptive. It was only cooperative in the sense that all processes / tasks were in the same address space and without memory protection...

jswny · on Nov 27, 2020

Between the preemptive multitasking and purely message-based communication it sounds a lot like Erlang.

beagle3 · on Nov 27, 2020

Unlike erlang, though, when it crashed, it crashed.

RIP Guru meditation.

icedchai · on Nov 28, 2020

I learned C programming on an Amiga. It made me very careful. If you messed up, you were looking at the guru followed by a couple minutes for a reboot. Fun times...

pjmlp · on Nov 26, 2020

We have to "thank" Compaq for it as well.

Even with all its management flaws, the Amiga might have survived without the mass production of PC clones.

flohofwoe · on Nov 27, 2020

Minor nitpick: The Amiga had preemptive multitasking (and thus didn't depend on user code to willingly give up their time slice, in that regard it was more like UNIX, and unlike early Windows and MacOS versions).

vram22 · on Nov 27, 2020

Another amazing thing is that Carl Sassenrath, creator of the Amiga OS kernel, also went on to create the REBOL language, which seemed quite innovative too - I've checked it out some - though it is kind of dormant now, and now there is the Red language, based somewhat on REBOL. https://en.m.wikipedia.org/wiki/Carl_Sassenrath

gpderetta · on Nov 27, 2020

I think the multitasking was actually preemptive. But yes, it had no memory protection: the message passing infrastructure relied on it and it would have been very hard to retrofit even on cpus with an MMU (although I think recent versions might have actually tried).

eecc · on Nov 27, 2020

True, but as I read from more knowledgeable sources than myself, the problem of the Amiga was that the software was intimately linked to and effectively exposed hardware implementation details.

This made upgrading chips nigh impossible without full software rewrites, which ultimately caused stagnation.

Indeed, as an A500 kid I used to laugh and was horrified by my first PC...

gpderetta · on Nov 27, 2020

The OS actually had a very advanced abstraction layer. The problem is most games bypassed it (and the OS itself in fact) and talked directly with the hardware.

eecc · on Nov 27, 2020

Thanks for the clarification

InafuSabi · on Nov 26, 2020

A friend of mine was amazed by this capability of the Amiga when I showed him that on one screen I could play mod.DasBoot in NoiseTracker, pull the screen down partly then go on the BBS in the terminal by manually dialing atdt454074 and entering, without my A500 even skipping one beat...

All I had was the 512kB expander, he had a 386 with 387 and could only run a single tasking OS

tinus_hn · on Nov 27, 2020

Linux was originally built for the 386.

icedchai · on Nov 27, 2020

He could've done the same thing with DOS if he bought DesqView. That let you multitask multiple DOS applications on a 386.

gmueckl · on Nov 27, 2020

Not quite. The multiple screen thing allowed several full screen graphical applications with different resolutions on screen at once, divided by a vertical barrier (a title bar similar to those on windows). This was a hardware feature at its core, if memory serves me right.

icedchai · on Nov 27, 2020

Yes, I remember the screen feature on my A500. It was neat.

To say a 386 is limited to single tasking is wrong though. That was my main point.

ww520 · on Nov 26, 2020

I remember NetWare's IPX/SPX network stack used a similar async mechanism. The caller submits a buffer for read and continues to do whatever. When the network card receives the data, it puts them in the caller's buffer. The caller is notified via a callback when the data is ready. All these were fitted in a few K's of memory in a DOS TSR.

All the DOS games at the time used IPX for network play for a reason. TCP was too "big" to fit in memory.

tyingq · on Nov 27, 2020

"In 1985... yes I said 1985, the Amiga did all I/O through sending and receiving messages"

I do remember that, and it was cool. But, lightweight efficient message passing is pretty easy when all processes share the same unprotected memory space :)

orclev · on Nov 27, 2020

L4 uses a similar model, and the last ~20 years of research around L4 has mostly focused on improving IPC performance and security. The core abstraction is a mechanism to control message passing between apps via routing through light weight kernel invocations (which is indeed practically the only thing the kernel does, it being a microkernel architecture).

Memory access is enforced, although not technically via the kernel. Rather at boot time the kernel owns all memory, then during init it slices off all the memory it doesn't need for itself and passes it to a user space memory service, and thereafter all memory requests get routed through that process. L4 uses a security model where permissions (including resource access) and their derivatives can be passed from one process to another. Using that system the memory manager process can slice off chunks of its memory and delegate access to those chunks to other processes.

gens · on Nov 27, 2020

When you want to squeeze every bit of performance out of a system, you want to avoid doing system calls as much as possible. io_uring lets you check if some i/o is done by just checking a piece of memory, instead of using read, pool, or such.

agumonkey · on Nov 27, 2020

One thing that doesn't change is that every decade people will look at the Amiga and admire it the same no matter how much ~advances have been made since.

Upvoter33 · on Nov 28, 2020

This over-romanticizes Amiga (a beautiful system no doubt) because there have been message-passing OSes since the 1960s (see Brinch Hansen's Nucleus for example). The key difference with io_uring is that is an incredibly efficient and general mechanism for async everything. It really is a wonderful piece of technology and an advance over the long line of "message passing" OSes (which always were too slow).

nonesuchluck · on Nov 27, 2020

Purely for entertainment, what is the alternate history that might have allowed Amiga to survive and thrive? Here's my stab:

- in the late 80s, Commodore ports AmigaOS to 386

- re-engineers Original Chipset as an ISA card

- OCS combines VGA output and multimedia (no SoundBlaster needed)

- offers AmigaOS to everyone, but it requires their ISA card to run

- runs DOS apps in Virtual 8086 mode, in desktop windows or full-screen

gpderetta · on Nov 27, 2020

the reverse was actually possible: there were PC compatible expansion cards for the amiga [1]. The issue is that they were very expensive and 8088 only.

[1] for example: https://en.wikipedia.org/wiki/Amiga_Sidecar although "card" is stretching it :)

nonesuchluck · on Nov 27, 2020

Yes, those PC Card addons for Mac/Amiga/etc are endlessly fascinating to me. But with the benefit of hindsight, the crucial factor wasn't just being able to run DOS applications on your fancy propriety computer, it was riding the PC Compatible rocketship as it blasted off. Creative Labs and 3Com and Tseng and many others showed that there was more value in manufacturing a popular expansion in the massive PC world than in owning your own closed platform bow-to-stern.

icedchai · on Nov 28, 2020

They did make a 286 and 386SX Bridgeboard:

https://bigbookofamigahardware.com/bboah/product.aspx?id=329 https://bigbookofamigahardware.com/bboah/product.aspx?id=330

gpderetta · on Nov 28, 2020

Nice. I did not know those.

amelius · on Nov 27, 2020

> Devices were just DLL with a message port.

Reminds me of: https://en.wikipedia.org/wiki/Unikernel

StreamBright · on Nov 27, 2020

Just like Erlang + receive.

bsder · on Nov 26, 2020

All this fuss because Linux wouldn't just implement kQueue ... Sigh.

pengaru · on Nov 27, 2020

Please explain to me how kqueue facilitates submitting arbitrarily large numbers of syscalls to the kernel as a single syscall, to be performed asynchronously no less. Even potentially submitted using no syscall at all, in polling mode.

gpderetta · on Nov 27, 2020

Linux should have had kqueue instead of epoll. But io_uring is a different thing.

cycloptic · on Nov 27, 2020

AFAIK it's unnecessary at this point, Linux has most of the equivalent functionality and there is a shim library for it: https://github.com/mheily/libkqueue

gpderetta · on Nov 27, 2020

yes, these days you can get a file descriptor for pretty much everything so epoll is sufficient.

I think that epoll timeout granularity is still in milliseconds, so if you want to build high res timers on top of it for your event loop you have to either use zero timeout polling or use an explicit timerfd which adds overhead. I guess you can use plain ppoll (which has ns resolution timeouts) on the epoll fd.

cycloptic · on Nov 27, 2020

This is corrected in io_uring too, if you use IORING_OP_TIMEOUT that takes a timespec64.

cyphar · on Nov 27, 2020

Solaris didn't port kqueue either. We're doomed to reinvent the wheel.

bsder · on Nov 27, 2020

And Bryan Cantrill has expressed quite a bit of remorse about that.

CalChris · on Nov 26, 2020

This reminds me of David Wheeler's adage:

  All problems in computer science can be solved by another level of indirection.

The rejoinder, and I don't know who gets credit for it, is:

  All performance problems can be solved by removing a layer of indirection.

sigjuice · on Nov 27, 2020

An often cited corollary to the first one is, "...except for the problem of too many layers of indirection." :)

https://en.wikipedia.org/wiki/Indirection

FeepingCreature · on Nov 27, 2020

Incorrect: that problem, too, can be solved by adding another layer of indirection - then adding an optimized implementation underneath it.

harry8 · on Nov 27, 2020

Have we stopped solving all performance problems with introducing a cache? Why wasn't I told? Will I have to hand in my union card?

infogulch · on Nov 27, 2020

No worries, your union card is safe: a cache is just a particular kind of added indirection. :)

CalChris · on Nov 27, 2020

And some architectures have magically improved performance by reconfiguring caches as scratch pad memories.

http://rexcomputing.com/

fefe23 · on Nov 27, 2020

I don't think io_uring and ebpf will revolutionize programming on Linux. In fact I hope they don't. The most important aspect of a program is correctness, not speed. Writing asynchronous code is much harder to get right.

Sure, I still write asynchronous code. Mostly to find out if I can. My experience has been that async code is hard to write, is larger, hard to read, hard to verify as correct and may not even be faster for many common use cases.

I also wrote some kernel code, for the same reason. To find out if I could. Most programmers have this drive, I think. They want to push themselves.

And sure, go for it! Just realize that you are experimenting, and you are probably in over your head.

Most of us are most of the time.

Someone will have to be able to fix bugs in your code when you are unavailable. Consider how hard it is to maintain other people's code even if it is just a well-formed, synchronous series of statements. Then consider how much worse it is if that code is asynchronous and maybe has subtle timing bugs, side channels and race conditions.

If I haven't convinced you yet, let me try one last argument.

I invite you to profile how much actual time you spend doing syscalls. Syscalls are amazingly well optimized on Linux. The overhead is practically negligible. You can do hundreds of thousands of syscalls per second, even on old hardware. You can also easily open thousands of threads. Those also scale really well on Linux.

skybrian · on Nov 27, 2020

I don't know what kind of programming you're doing, but in network apps, if you have a thread per client and lots of clients (like a web server), you end up with lots of threads waiting on responses from slow clients, and that takes up memory. The time blocked on the syscall has nothing to do with your own machine's performance.

But on the other hand, if your server is behind a buffering proxy so it's not streaming directly over the Internet, it might not be a problem.

mwcampbell · on Nov 27, 2020

> But on the other hand, if your server is behind a buffering proxy so it's not streaming directly over the Internet, it might not be a problem.

This is one instance of a larger pattern I've been noticing. When using some languages (like Python and Ruby) in the natural, blocking way, a back-end web application typically needs multiple processes per machine, because it doesn't handle many concurrent requests per process. Combine this with the fact that each thread has to block while waiting on the client, and you have to add more complexity around the application server processes to regain efficiency. The proxy in front of those servers is one example. Another is an external database connection pool like PgBouncer. Speaking of the database, to avoid wasting memory while waiting on it, you may end up introducing caching sooner than you otherwise would. And when you do, the cache will be an external component like Redis, so all of your many processes can use it. Or you might use a background job queue just to avoid tying up one of your precious blocking threads, even for something that has to happen right away (e.g. sending email). And so on.

Contrast that with something like Go or Erlang (and by extension Elixir), where the runtime offers cheap concurrency that can fully use all of your cores in a single process, built on lightweight userland threads and asynchronous I/O, while the language lets you write straightforward, sequential code. In such an environment, a lot of the operational complexity that I described above can just go away. Simple code and simple ops -- seems like a winning combination to me.

lmm · on Nov 27, 2020

Cooperative multitasking is much easier to implement and administer than preemptive multitasking, and always has been. But there are cases where it isn't good enough, and if you hit those then you need a system that can do preemptive multitasking gracefully - which often means you end up with just as much complexity as if you'd used preemptive multitasking from the start, but with the complex parts being less well-tested.

bogomipz · on Nov 27, 2020

>"But there are cases where it isn't good enough, and if you hit those then you need a system that can do preemptive multitasking gracefully ..."

What are some of those use cases where userland threads are no longer good enough? In what areas do they fall short?

lmm · on Nov 30, 2020

Essentially any time you have to run something that's not completely trusted to not block a thread - which could be user-supplied code (or "code" - matching a regex is unsafe in most implementations, rendering PostScript is famously Turing-complete) or just a third-party dependency.

At my first job we had a prototype that performed 2x faster (on average) by using Go-style async, but we couldn't trust our libraries enough to eliminate bugs from blocking dispatcher threads. So we stuck with traditional multithreading.

int_19h · on Nov 27, 2020

It's all true, and yet most webservers were like that 20 years ago - and they still managed to run even fairly high-traffic websites on hardware much less powerful than what we have today. I would argue that >90% of the web doesn't really need the extra throughput that async gives you at the cost of extra complexity.

cheph · on Nov 27, 2020

Writing asynchronous code is trying to fix how your code is executed in the code itself. It is the wrong solution for a real problem.

But I think what many people get wrong (not the person I'm replying to) is that how you write code and how you execute code does not have to be the same.

This is essentially why google made their N:M threading patches: https://lore.kernel.org/lkml/20200722234538.166697-1-posk@po...

This is why Golang uses goroutines. This is why Javascript made async/await. This is why project loom exists. This is why erlang uses erlang processes.

All of these initiatives make it possible to write synchronous code and execute it as if it was written asynchronously.

And I think all of this also makes it clear that how you write code and how code is executed is not the same, so yes, I'm in agreement with the person I'm replying to, I don't think this will change how code is written that much, because this can't make writing code asynchronously any less of a bad idea than it is now.

mwcampbell · on Nov 27, 2020

> This is why Golang uses goroutines. This is why Javascript made async/await. This is why project loom exists. This is why erlang uses erlang processes.

JavaScript async/await is different from the others. It requires two colors of functions [1], and it conflates how the code is written with how it's executed, so it has the same problem you were talking about at the start of your comment.

Also, JavaScript async/await is suboptimal in that it's ultimately built on top of unstructured callbacks. Or, as Nathaniel J. Smith put it in a post about Python's asyncio module, which has the same problem, "Your async/await functions are dumplings of local structure floating on top of callback soup, and this has far-reaching implications for the simplicity and correctness of your code." [2] That whole post is well worth a read IMO.

[1]: https://journal.stuffwithstuff.com/2015/02/01/what-color-is-...

[2]: https://vorpus.org/blog/some-thoughts-on-asynchronous-api-de...

cheph · on Nov 27, 2020

> JavaScript async/await is different from the others.

It allows me to write synchronous code and execute it asynchronously. The mechanism is different - but the purpose is the same. I'm not endorsing the implementation. But I do use it, because it is way better than writing asynchronous code.

junon · on Nov 27, 2020

What a wonderfully dogmatic comment that completely misses the point of io_uring.

mwcampbell · on Nov 27, 2020

Given the article's over-the-top opening, I think it's good to have a reality check that reminds us of fundamentals like correctness over speed, and clarity over cleverness.

junon · on Nov 28, 2020

io_uring is correct. It's nothing clever - in fact, it's quite boring. It is specifically meant for applications that must handle high volumes of asynchronous I/O.

Yes, believe it or not, you can achieve correctness and speed, together, without compromise.

blub · on Nov 27, 2020

For 99,99% of Linux programming io_uring and eBPF do not have a point and developers couldn't care less about them.

tlamponi · on Nov 27, 2020

Care to shed some lights on the points of io_uring the OP misses? (honestly interested)

trevyn · on Nov 27, 2020

What are your thoughts on Rust?

pengaru · on Nov 26, 2020

Coincidentally last night I announced [0] a little io_uring systemd-journald tool I've been hacking on recently for fun.

No ebpf component at this time, but I do wonder if ebpf could perform journal searches in the kernel side and only send the matches back to userspace.

Another thing this little project brought to my attention is the need for a compatibility layer on pre-io_uring kernels. I asked on io_uring@vger [1] last night, but nobody's responded yet, does anyone here know if there's already such a thing in existence?

[0] https://lists.freedesktop.org/archives/systemd-devel/2020-No...

[1] https://lore.kernel.org/io-uring/20201126043016.3yb5ggpkgvuz...

anarazel · on Nov 27, 2020

I'd like something roughly similar, to make the rr reverse debugger support io_uring. That likely can't work like most other syscalls, due to the memory only interface...

roca · on Nov 27, 2020

I have some thoughts about io_uring support in rr: https://github.com/rr-debugger/rr/issues/2613

cycloptic · on Nov 27, 2020

I was thinking about doing this for an event loop I was working on, but no code to show yet... you probably can get away easily with using pthreads and a sparse memfd to store the buffers.

cycloptic · on Nov 27, 2020

(Reply since I can't edit anymore) The only catch is that with that approach, you can't poll on the fd like you can with the real thing.

pengaru · on Nov 28, 2020

Assuming you're talking about emulating io_uring in userspace at the liburing API level, couldn't you just use a pthreads thread pool for the syscall dispatching from submitted SQEs, and when those complete their results get serialized into the CQE.

For fd-based monitoring of the CQE, wouldn't a simple pipe or eventfd suffice? When CQEs get added, write to the fd, it just happens to all be in-process.

I must admit I haven't gone deep into the liburing internals or the low-level io_uring API, but conceptually speaking there doesn't seem to be anything happening that can't be done in-process in userspace atop pthreads for the blocking syscalls. It just won't be fast.

Am I missing some critical show-stopping detail?

adzm · on Nov 26, 2020

This feels very very similar to IO completion ports / iocp on Windows. More modern versions of Windows even has registered buffers for completion which can be even more performant in certain scenarios. I'm looking forward to trying this out on Linux.

I'm curious to see how this might work its way into libuv and c++ ASIO libraries, too.

ithkuil · on Nov 26, 2020

io_uring allows the kernel and the user program to communicate purely via shared memory without having to perform a system call, i.e. a context switch to the kernel.

Do windows completion ports also work that way or do they involve a system call to be performed in order to consume completion events?

Matthias247 · on Nov 27, 2020

Windows registered IO (RIO) does imho the same (https://docs.microsoft.com/en-us/previous-versions/windows/i...). When enqueuing reads/writes with RIO there at least exist flags to specify that the kernel should not immediately be woken up, and thereby to batch syscalls as with io_uring.

ta8645 · on Nov 26, 2020

You do still need a single system call (io_uring_submit) to submit each batch of entries in the submission queue.

Edit: actually no it's not required in all cases. Thanks for the correction.

ithkuil · on Nov 26, 2020

I only read about io_uring without yet having a chance of actually using it so take this with a grain of salt:

I read that io_uring has two modes, one where you signal via a system call and another that uses memory mapped polling.

https://unixism.net/loti/tutorial/sq_poll.html states:

> Reducing the number of system calls is a major aim for io_uring. To this end, io_uring lets you submit I/O requests without you having to make a single system call. This is done via a special submission queue polling feature that io_uring supports.

yxhuvud · on Nov 27, 2020

Submit is not a syscall. io_uring_enter is the only syscall that is used while running a ring. That one may submit, wait or both at the same time. Strictly speaking it isn't necessary but to avoid it you require elevated privileges.

frevib · on Nov 27, 2020

From Linux 5.10 you only need CAP_SYS_NICE to perform SQPOLL: https://git.kernel.dk/cgit/linux-block/commit/?h=io_uring-fi...

yxhuvud · on Nov 27, 2020

Yes, hence "elevated" and not "root". It is still higher than default, right?

WJW · on Nov 26, 2020

You can ask the kernel to poll the submission queue and skip io_uring_submit too. (Though you need elevated privileges to do this IIRC)

junon · on Nov 27, 2020

Not with SQPOLL. You can eliminate all syscalls with SQPOLL.

leetrout · on Nov 26, 2020

libuv is tracking in https://github.com/libuv/libuv/issues/1947

cycloptic · on Nov 27, 2020

Tracking that issue was a motivator for me to begin adding support to glib: https://gitlab.gnome.org/GNOME/glib/-/issues/2084

cyphar · on Nov 27, 2020

You're quite right -- it's basically the same idea as IOCP on Windows, kqueue on FreeBSD, and Event Ports on Solaris.

Matthias247 · on Nov 27, 2020

Isn't kqueue for sockets still readiness based? I know most runtimes (like libuv) just use it in the same fashion as epoll, and await readability/writeability through the queue. Not sure if it also has completion based options.

cyphar · on Nov 27, 2020

You're quite right, I muddled things up (and now the edit window has elapsed). epoll is the Linux equivalent of kqueue, IOCP, and Event Ports (all readiness based). Not sure how I screwed that one up...

Matthias247 · on Nov 27, 2020

IOCP is actually submission+completion based and thereby closer to io_uring than to epoll. The main difference between IOCP and io_uring at this point seems to be the use of a ringbuffer based submit interface instead of a syscall based one. But that is more of a performance optimization than a huge difference in the programming model.

ncmncm · on Nov 27, 2020

It is already integrated with asio. Third-party, of course, because that's the whole point: io_uring does not need to know anything about asio, nor does asio need to know anything about io_uring, to get optimal performance.

It's all on github, with accompanying CppCon talk. Asio, by the way, will be C++23's network layer.

Matthias247 · on Nov 27, 2020

There's currently a lot of talk about io_uring, but most articles around it and usages still seem more in the exploration, research and toy project state.

I'm however wondering what the actual quality level is, whether people used it successfully in production and whether there is an overview with which kernel level which feature works without any [known] bugs.

When looking at the mailing list at https://lore.kernel.org/io-uring/ it seems like it is still a very fast moving project, with a fair amount bugfixes. Given that, is it realistic to think about using any kernel in with a kernel version between 5.5 and 5.7 in production where any bug would incur an availability impact, or should this still rather be a considered an ongoing implementation effort and revisited at some 5.xy version?

An extensive set of unit-tests would make it a bit easier to gain trust into that everything works reliably and stays working, but unfortunately those are still not a thing in most low-level projects.

junon · on Nov 27, 2020

Don't use io_uring until at least 5.10 rc3, if not 5.11. SQPOLL is still to be properly added and fixed and there are some security concerns (e.g. CAP_SYS_ADMIN being replaced by CAP_SYS_NICE to start a kernel submission queue polling thread).

io_uring has many tests in the companion user space library liburing, maintained by the same person that made the kernel patches (Jens Axboe). They test both the library as well as expected functionality in the kernel.

io_uring is not going to give you speed ups if you use it in the same way as you would epoll or kqueue. Thus, simply sticking it into e.g. libuv without changing how the applications are built probably won't give you a lot of benefit (speculating).

It comes down to how you work with the ring buffers and how much you take advantage of the highly out-of-order, memory-barrier-based shared memory approach as opposed to more "discrete" (maybe not the right word) syscalls.

As of yet, I haven't personally come across a published example of a production framework that utilizes these features adequately. We have some internal IP that does, but probably won't be open sourced.

pure_simplicity · on Nov 27, 2020

This article recently featured on HN may be of interest to answer your question: https://itnext.io/modern-storage-is-plenty-fast-it-is-the-ap...

io_uring allows for better utilization of fast storage.

mwcampbell · on Nov 26, 2020

> Things will never be the same again after the dust settles. And yes, I’m talking about Linux.

One has to be in quite a techie bubble to equate Linux kernel features with actual world-changing events, as the author goes on to do.

More on-topic though, having read the rest of the article, my guess is that while these features will let companies squeeze some more efficiency out of high-end servers, they won't change how most of us develop applications.

the8472 · on Nov 26, 2020

Any async or event-loop runtime can be almost entirely powered by io_uring. Timers, waiting for work when you're out of CPU-bound tasks, most IO syscalls, it all can go through io_uring.

You'll still need a few worker threads for blocking syscalls that haven't been ported to io_uring yet but that need is greatly reduced compared to the previous state of things.

So even if you're not using io_uring yourself the language standard libraries or server frameworks will.

There are WIPs for netty, libuv, nginx. Other projects are exploring it or have announced intent to use it.

jorangreef · on Nov 27, 2020

Another example, Zig landed io_uring in the std lib a month ago: https://github.com/ziglang/zig/pull/6356

I'm also really excited by how you can use io_uring to power everything (fs, networking etc.) with one easy api and a single-threaded event loop: https://github.com/coilhq/tigerbeetle/tree/master/demos/io_u...

io_uring makes thread-per-core designs so much easier.

ben509 · on Nov 26, 2020

He also brings up 2020 because OMG, it's worst year EVAR.

It's not a tech bubble as much as it's a journo bubble. People are reading before they're writing, so he's seeing trendy topics like 2020 and the virus. He feels he needs a hook to get his readers engaged, so he's reaching for things readers can related to.

It's a bad hook. I think an editor would have cut that whole intro.

Matumio · on Nov 27, 2020

Oh please. It's bad hook, yes. But once we get over that, we should acknowledge that this is an extremely well-written article. It has been a long time since I've stumbled over an article on HN that was such a joy to read.

ben509 · on Dec 9, 2020

I agree it is well-written overall and should have been clearer about that; in my defense, my conclusion was simply that an editor would cut the intro.

The intro is both highly visible, and, because of how writing and thinking work, it's also the spot where you're either collecting your thoughts or trying to hook the reader.

If you don't have an editor and you're done with your first draft, try deleting your first few paragraphs. It's often a simple way to vastly improve a piece.

blub · on Nov 27, 2020

If the premise is nonsense how can the article be well-written?

Most applications will not change the way they work with the kernel, because they don't work with it, they hide it as well as possible under libraries and frameworks. Even so, most applications need neither io_uring, nor eBPF. Hardly a revolution.

fnord123 · on Nov 27, 2020

It was written (or, published) in May. The zeitgeist was very much Corona without the fatigue we have now.

tehjoker · on Nov 26, 2020

Well, getting GB/sec speeds instead of 100s of MB/sec is a pretty impressive improvement in disk utilization.

hvidgaard · on Nov 27, 2020

Without any doubt, but the impact on the world as a whole is going to barely noticeable.

perlgeek · on Nov 27, 2020

My real hope is that eventually, you can use some higher-level language to write device drivers for things like crappy IoT gadgets using eBPF, without any chance of crashing the machine due to a pointer fu or so.

Knowing that with eBPF I simply cannot crash the laptop I'm working on is a huge deal, and reduces the great psychological hurdle that kernel development always had (for me, at least).

zests · on Nov 26, 2020

I am impressed with the level of linux knowledge in this thread. How do people become linux kernel hackers? Most of the developers I know (including myself) use linux but have very little awareness beyond application level programming.

the8472 · on Nov 26, 2020

You don't necessarily have to be a kernel hacker to be familiar with many of the features that the kernel provides. Just doing application debugging often requires to dig deeper until you hit some kernel balrogs.

Container problems? Namespaces, Cgroups, ...

Network problems? Netfilter, tc, lots of sysctl knobs, tcp algorithms (cue 1287947th thread on nagle/delayed acks/cork)

Slow disk IO? Now you need to read up on syscalls and maybe find more efficient uses. Copy_file_range doesn't work as expected? Suddenly you're reading kernel release notes or source code.

marcosdumay · on Nov 26, 2020

> How do people become linux kernel hackers?

Honestly, by hacking it.

There's a famous book about Linux internals that I don't remember the name (but has "Linux" and "internals" on it). But I have never seen anybody doing it by reading a book (despite how excellent it can be). You just go change what you want or read the submodule you are interested in understanding, and use the book, site or whatever when you have a problem.

asicsp · on Nov 27, 2020

>There's a famous book about Linux internals that I don't remember the name (but has "Linux" and "internals" on it)

this one? https://0xax.gitbooks.io/linux-insides/content/index.html

though it says insides instead of internals

marcosdumay · on Nov 27, 2020

Yes, this one. Thanks.

bigyikes · on Nov 26, 2020

I have never run into a problem that I thought needed to be solved in the kernel. What kinds of things have you wanted to change?

marcosdumay · on Nov 26, 2020

The first time I went into it was to write a driver for a device in my undergrad. After that I've changed a driver here or there (never anything worth merging), and needed the documentation of the sound systems.

It's not an easy thing, by any means. Just locating where you have to touch on the source tree is a problem that will lead you to plenty of books or sites. But don't try reading those before you have a problem to solve, you will lose time and drown in information.

(By the way, I am assuming you know how syscalls work. If you don't, go study that before you start anything.)

bloaf · on Nov 27, 2020

I learned 90% of the things I know about operating systems by solving problems I myself caused.

sdlion · on Nov 26, 2020

Once I found myself reading about device module programming since the common USB-Serial device module (I forgot its name, cdc something) wasn't properly working for a Chinese multiserial port chip inside a GSM cluster modem (one USB device to multiple serial ports).

I was attempting to hack away a simple example but I found the USB-Serial (more) generic driver intended for "test only" and... it just worked.

Another reason for reading about IO calls, schedulers, etc? That "I'm still writing data into your USB flash drive even when the GUI says it finished 5 minutes ago" that I hate so much.

the_only_law · on Nov 26, 2020

Conversely every problem I’ve run into in a kernel that I could potentially solve requires me to be an expert in 1-3 things outside of the kernel.

sebcat · on Nov 27, 2020

Apart from Linux hw support for things at work, I implemented a fairly simple pseudo-device for establishing TCP connections from a process in capability mode on FreeBSD. The device driver has support for a denylist to disallow connections to specific IP ranges. It has multiple syscalls wrapped into one ioctl, and sockets opened from the device always had TCP_NODELAY, O_CLOEXEC and SOCK_NONBLOCK set. Worked pretty well for its intended use case.

https://github.com/sebcat/yans/blob/master/drivers/freebsd/t... https://github.com/sebcat/yans/blob/master/drivers/freebsd/t...

cgh · on Nov 26, 2020

In my case, which is probably typical, there was a bug in a device driver for some obscure thing we were using where I worked. So I had to dive into the world of kernel modules and fix it. I think a lot of kernel knowledge and development is commercially-driven, in this sense.

alfiedotwtf · on Nov 27, 2020

I think you were thinking “Linux Core Kernel” by Scott Maxwell. And yes, it’s an awesome book that’s copied the style of a same type of book that annotates the SVR4 kernel

01100011 · on Nov 26, 2020

For the most part, it's just software. If you have the time and the interest, you can learn it like anything else. At some level, it requires an awareness of how the hardware works(page tables/MMUs/IOMMUs, interrupts, SMP, NUMA, etc).

I don't mean to downplay the investment, but if you're already an experienced software engineer you can get into it if it interests you. There is a different mindset among systems software programmers though. Reliability comes first, performance and functionality come second. It's a world away from hacking python scripts that only need to run once to perform their function.

gpanders · on Nov 26, 2020

I learned a TON about the Linux kernel through writing custom device drivers for FPGAs. Granted most of my experience is in the driver area and not in any of the subsystems, but even still I have a much better grasp of how the kernel operates now (and even more importantly, I know how to navigate it and how to find relevant documentation).

amboar · on Nov 27, 2020

As others have said, hacking it, certainly. But if you're not up for that and would like something more passive, read LWN.net (and possibly subscribe!)

yobert · on Nov 27, 2020

I learned a lot by trying to make Go talk to ALSA without using any existing C interfaces. Just happy exploration goes a long ways!

dboreham · on Nov 26, 2020

merqurio · on Nov 26, 2020

Also from Glauber Costa, a thread-per-core framework using io_uring written in Rust[1] and discussed in HN[2].

[1]: https://github.com/DataDog/glommio [2]: https://news.ycombinator.com/item?id=24976533

PeterCorless · on Nov 26, 2020

Today I am grateful for the brilliant minds around the world that continually open up fundamentally revolutionary new ways to develop applications. To Jens, to Alexei, and to Glauber, and to all of their kindred and ilk, we raise a glass!

ganafagol · on Nov 27, 2020

The title of the HN post is missing a suffix of "for a few niche applications".

My work is "programming in Linux", but it's not impacted by any of this since I'm working in a different area.

I'm sure this is important work, but maybe tone down such claims a bit.

capableweb · on Nov 27, 2020

"few niche applications" being any application that touches files, network or want to run code in the kernel. Sounds like a bigger target than just "niche", but I'm no Linux developer so what do I know.

blub · on Nov 27, 2020

eBPF has been available for a while now, one year ago there were even two books published about it, one by an author of the "bcc" mentioned in this ad disguised as a technical article. It didn't revolutionize programming in Linux that I'm aware of. It found its market in observability and performance analysis.

io_uring seems to be relevant mostly to people using or wanting to use AIO. Outside of a "few niche applications" this is unimportant for the majority of Linux developers. Libraries like ASIO would likely wrap it anyway since these low-level APIs are not pleasant to use.

rajnathani · on Nov 26, 2020

Previous discussion: https://news.ycombinator.com/item?id=22974728

whateveracct · on Nov 26, 2020

GHC RTS integration already well in the works too :) http://wjwh.eu/posts/2020-07-26-haskell-iouring-manager.html

grahamm · on Nov 26, 2020

At SCO in the mid-90s we were playing with very similar ideas to boost DB performance. The main motivation was the same then as it is now, don't block and avoid making system calls into the kernel once up and running. Don't recall if any of the work made it into product.

mhh__ · on Nov 26, 2020

eBPF is still a bit rough but it's already very cool what you can do already.

It would be nice to see it at a high-level at the syscall interface i.e. currently if I want to attach a probe I have to find the function myself or use a library but it would he nice to have it understand elf files.

hawk_ · on Nov 26, 2020

One thing that I haven't been able to get is if this makes things like DPDK or user mode tcp stack unnecessary since the system call overhead is gone.

dathinab · on Nov 26, 2020

io_uring reduces but doesn't remove the system call overhead.

Only with in kernel polling mode is it close to removed. But kernel polling mode has it's own cost. If the system call overhead is no where close to being a bottle neck, i.e. you don't do system calls "that" much, e.g. because your endpoints take longer to complete then using kernel polling mode can degrade the overall system performance. And potential increase power consumption and as such heat generation.

Besides that user mode tcp stacks can be more tailored for your use case which can increase performance.

So all in all I would say that it depends on your use case. For some it will make user mode tcp useless or at least not worth it but for others it doesn't.

hawk_ · on Nov 26, 2020

I am not sure how user stack tcp or DPDK would get around the power consumption issues of kernel polling. Infact the usage I am aware of pretty much involve polling in user mode because any context switch or O/S scheduling related overhead is excessive. The only thing you can do is to keep your task queue full so as to always be doing something.

anarazel · on Nov 26, 2020

io_uring does allow to remove a lot of the syscall overhead, without polling. Many operations can be submitted with just one syscall. And ready completions can be consumed without a syscall at all.

Additionally, compared to using epoll/select/.. for network IO, one can just submit a send/recv, instead of patterns like recv -> EAGAIN, epoll, recv

gpderetta · on Nov 27, 2020

It does remove the syscall overhead, but as the IO itself will be performed by the kernel so the cpu will still need to switch regularly between user and kernel level. With a full user level network stack and correct interrupt steering the kernel need not be involved at all and the cpu can stay in userspace all the time.

Or you can run the kernel IO thread on another CPU, but that itself has overhead compared to performing IO and handling the data all in the same thread.

hawk_ · on Nov 28, 2020

I see so the extra kernel IO thread which can be spin waiting is one extra busy core and latency of getting data from one core to the other is the additional overhead.

jpetso · on Nov 29, 2020

Are ready completions strictly determined by continuous polling if there's no system call involved? If lots of applications end up using this method, will it increase power consumption due to many processes actively idling until a new consumable shows up in the completion queue?

Dylan16807 · on Nov 30, 2020

They said without polling.

If the queue completely empties, then a normal application will use a system call to go to sleep.

But as long as it's not empty, the application can keep receiving events with neither system calls nor active polling.

You'd only actively poll in very specialized/niche cases.

jpetso · on Nov 30, 2020

Thanks for explaining. I was confused by the "no system call on consumption" example in the blog post, but if it uses a system call after emptying the completion queue then that'll work just fine.

qchris · on Nov 26, 2020

I'm genuinely curious; both of these changes seem to be exciting due to the ability for people to extend and implement specialized code/features using the kernel. Since the Linux kernel is GPLed (v2, I believe?), does this mean that the number of GPL requests related to products' operating systems is likely to increase, since groups using this extensibility will be writing code covered by the GPL which might actually be of value to other people? Or does the way io_uring and eBPF are implemented isolate the code in such a way that the extensions through their frameworks such that the GPL license won't affect them?

rtkaratekid · on Nov 26, 2020

I don’t know about io_uring, but for BPF programs only the kernel space needs to be licensed as GPLv2. Everything on the user space side is handled with system calls or higher level libraries that aren’t GPL licensed (libbpf).

marcan_42 · on Nov 26, 2020

io_uring is a data structure, not code. It's not Turing complete, so there is absolutely no way it would extend GPL virality from the kernel into userspace.

eBPF is code, and follows similar rules to kernel modules. That is, non-GPL-compatible eBPF code is allowed, but a subset of APIs (helpers, like module symbols) are only available to GPL-compatible eBPF programs.

tyingq · on Nov 26, 2020

What seems to prevent an GPL issues with io_uring is the "linking" part of GPLV2. Covered here: https://www.gnu.org/licenses/gpl-faq.en.html#GPLStaticVsDyna...

Glibc being the entry point for the syscall, and glibc being LGPL is specifically why it's "okay". If you were to directly link an application to the kernel code, it would be viral.

jcranmer · on Nov 26, 2020

Licenses only matter to the extent that the resulting product is a "derivative work" of the GPL code. If it's not derivative, then you have no copyright claim that requires the license to permit you to use it.

While the exact nature of when a software project is a "derivative work" of the libraries it depends on is still somewhat of an open legal question, I would be very surprised if anyone were to find that a computer application were a derivative of the OS it runs on. The typical understanding of the industry is essentially a process boundary, and the boundary a system call represents is closer to a process boundary than it is to a library call.

radarsat1 · on Nov 26, 2020

> The typical understanding of the industry is essentially a process boundary,

I agree that this is the typical thinking but I've always found it a little silly and arbitrary. It implies that if I write a GPL-licensed library and release it along with a thin wrapper program that gives it a command-line interface, say it does something like a complicated calculation which reads some data and outputs a single number; then someone could come along and write a program that would not work without it, say something that transforms another input format and then passes it to my calculation. As long as that program calls my "library" as a "program" (using "system()" for example) then they are not bound by the GPL, but if they link to my library and call the calculation directly, then all of a sudden they are?

This linking vs. process boundary thing always seemed like the wrong way to determine if a program is a derivative work of another. If someone writes a program that does not work without the GPL code, they should be bound by the GPL, regardless of whether it's linked, loaded into the same process, called through the command line, or over the wire.

This last one would obviously be controversial, but frankly a lot of companies do hide their use of open source code behind a REST API, and avoid adhering to any particular licenses that way, since they are not "distributing" the software.

spijdar · on Nov 26, 2020

That goes beyond what most people consider the GPL to cover. There are other licenses with stronger copylefts specifically to cover that last case -- notably, the AGPL.

I suspect trying to make the case that GPL's viral copyleft isn't limited to strictly linking but potentially any interaction with it would probably have a chilling effect on the use of GPL code, and this reinterpretation would only reinforce some people's prejudice against the GPL, a la Ballmer's "Linux is cancer" line.

Maybe it's the pragmatism in me, but I think it would have a net negative effect long term, unless it managed to flip all of the tables and convince everyone to use all GPL code, instead of making people reject copyleft wholesale.

radarsat1 · on Nov 28, 2020

> potentially any interaction with it

But that's not what I said. I said programs that do not work without some other program, is, in my opinion, a derivative work. I just don't see how the calling mechanism even plays into that judgement.

I do agree that there are other licenses such as the AGPL that try to cover these cases.

And arguably the online thing is a whole different ball of wax, because you can talk about software using a service, etc. It really is tricky in that case.

But I don't see the reason to distinguish between calling a function via the C stdcall mechanism, vs. "popen" and capturing stdout. It's exactly the same, logically, the only difference are details that imho should not matter for the legal case.

Right now, if I release a GPL library, what stops someone from coming along and writing a CLI program that just wraps every function with some textual interface, and including that with their closed-source program? The GPL becomes pretty toothless if it's bypassed so easily.

qchris · on Nov 26, 2020

I'm under the impression that's what the Remote Network Interaction clause of the the Affero GPL license is supposed to do. The "boundary" is then if someone is interacting with the AGPL code at all, so when you use the AGPL-licensed code behind a REST API, even if that's on someone else's server, the use of that code in producing any response to the API request requires publishing the AGPL'ed code/modifications.

Dylan16807 · on Nov 27, 2020

> If someone writes a program that does not work without the GPL code, they should be bound by the GPL, regardless of whether it's linked, loaded into the same process, called through the command line, or over the wire.

Let's say I'm writing a refinery simulator to sell to people, and I use a GPL command line utility to do some particular calculation about flow rates.

Now I'm GPL just for outsourcing a single equation. But only because that's the only program around for doing that calculation. As soon as someone else reads a paper on the subject and makes an alternate program for that math, my program is no longer GPL?

Those consequences sound like a mess I don't want to deal with.

radarsat1 · on Nov 28, 2020

> Now I'm GPL just for outsourcing a single equation. But only because that's the only program around for doing that calculation. As soon as someone else reads a paper on the subject and makes an alternate program for that math, my program is no longer GPL?

I don't really see the problem. You are saying that if you change your dependency to a non-GPL program, then you are no longer GPL. The answer to your question is simply "yes".

We are not talking about patents here, but copyright. If someone comes up with an alternative implementation with a different license, you are perfectly free to start using it instead, what's the issue?

Dylan16807 · on Nov 28, 2020

> if you change your dependency

Depends on what you mean by changing the dependency. Let me lay out the scenario in more detail.

The program is still exactly the same. It asks to be pointed at a fluid sim program, and then uses that for some of the math it needs.

When it was coded, the only dependency it could use was GPL.

Now there's a new non-GPL dependency it could be pointed at, with the same API.

Now it's possible to run the program without using any GPL code. Does that make the program no longer GPL, even though it didn't change?

radarsat1 · on Nov 28, 2020

I see your point now.

I'll answer with my own hypothetical. If I write a program that dynamically links a library performing the same GPL'd fluid sim calculations, it is presumably forced to be GPL, because it links to it. What if someone comes along and runs the program but at runtime uses LD_PRELOAD to override the dynamic linker, linking it to an alternative library that presents the same interface. Is the program still required to be GPL?

I don't really have an answer to your specific proposed loophole, it's pretty clever and is a very good question; but I don't think the calling mechanism is part of the issue. You could make the same argument whether you are talking about a "program" or a library. The calling convention is a meaningless detail imho.

I think you are specifically responding to my "does not work without" interpretation overly literally. Clearly if the program is written for and tested against a specific interface of a GPL'd program, it is intended to work with that program.

On the other hand if it's written to call into some kind of standard interface, it no longer requires that GPL program specifically, but could work with any program implementing that interface. And I will admit that whether a program is written only to work with a GPL program/library/whatever, or is more general, may be up to interpretation, what is considered "standard", etc., but that is exactly my point -- law is nuanced. If it were possible to codify laws perfectly with overly simple rules like "the copyright applies because it's a DLL and not a program", then we wouldn't need lawyers.

In law, intent is important. If I write a non-GPL program that depends on the functionality of a GPL library, I can go find all sorts of ways to not "link" to it but still use it, e.g., as a program, a service, etc. -- and it happens -- but the intent, which was to find a way to use GPL software without adhering to its license, is still quite clear.

Dylan16807 · on Nov 29, 2020

> I'll answer with my own hypothetical. If I write a program that dynamically links a library performing the same GPL'd fluid sim calculations, it is presumably forced to be GPL, because it links to it. What if someone comes along and runs the program but at runtime uses LD_PRELOAD to override the dynamic linker, linking it to an alternative library that presents the same interface. Is the program still required to be GPL?

I've never believed that linking made your code necessarily GPL in the first place. I don't care what the FSF says, they're not exactly unbiased.

> I think you are specifically responding to my "does not work without" interpretation overly literally. Clearly if the program is written for and tested against a specific interface of a GPL'd program, it is intended to work with that program.

> On the other hand if it's written to call into some kind of standard interface, it no longer requires that GPL program specifically, but could work with any program implementing that interface.

Well that's basically how the standard already works. If your code is using a specialized enough interface, sharing data structures you got from the GPL code, then it's derivative of the GPL code and needs to follow the GPL.

So while "process boundary" is an inexact tool, your suggestion of "does not work without" doesn't seem significantly better to me.

radarsat1 · on Nov 30, 2020

Yeah, I think you make some great points and I'll give you that; you probably did show here why my idea is not correct. I don't know the right answer, I'm certainly no lawyer ;)

I just know that, to me, "dynamic linking" seems like an arbitrary and imprecise way to define "derivative work". And, I'm not sure whether it's really something that _can_ be defined and possible to determine without consider it on a case by case basis. It's a good "right hand rule", perhaps, but doesn't strike me as either necessary or sufficient to really define it. We'll never really know, I guess, until someone makes that actual argument in court.