Any sandboxing adds a performance overhead. There is no free lunch in Operating ...

pjmlp · on June 6, 2019

A tiny performance overhead is worthwhile in name of security.

For example, I don't remember the last time I cared about disabling bounds checking, even in C++ code (VC++ allows to keep it on).

monocasa · on June 6, 2019

The whole point of BPF is that you can't run this code in a separate context _and_ have the code work. The original use case was packet filtering in interrupts because waiting for user space took too long.

pjmlp · on June 6, 2019

If I remember correctly Intel has implemented a user space TCP/IP stack exactly for that.

monocasa · on June 6, 2019

Which doesn't work if you want to still share an adapter across multiple applications.

vardump · on June 6, 2019

I don't see any reason why it can't br made to work between multiple applications while being still faster than traditional kernel based stacks.

monocasa · on June 6, 2019

The whole point of DPDK is to avoid the context switches by dedicating the adapter to a single application, and letting the application take over management. Once you have multiplexing, you're right back to where you started. There has been a secure multiplexing scheme based on packet buffers... the Berkeley Packet Filter (BPF), ie. what this paper is talking about as prior art.

vardump · on June 6, 2019

No, you can simply have memory mapped ring buffers between processes and all of the stuff not required for the specific application cut away. You don't need to have any more context switches that way.

No need to have a traditional socket API while still being able to do access it from multiple applications.

I have no interest to create such beast, but I'd be truly shocked if it couldn't at least beat a generic kernel based stack.

It'll lose some performance compared to the single application approach, but there might still be a niche for this kind of way.

Sure, you'll need to copy memory, but the data should be almost always in L3 cache anyways.

monocasa · on June 7, 2019

Once you're doing copies (in your example two copies!) it's game over.

vardump · on June 7, 2019

My intention was however to only have one copy (other than initial DMA to (hopefully) directly L3 cache).

Yeah, it hurts if it's 400 Gbps ethernet. L3 bandwidth is like 50-90 GB/s. X86 just doesn't have enough bandwidth even to the caches! Better have pretty high CPU frequency, as I think L3 gets faster in proportion (but not completely sure). 200 Gbps should be somewhat fine.

Also pretty bad if the data travels over a QPI link... better have both processes in same NUMA region. And that ethernet PCI-e adapter... :-)

Regardless, I do think it'd still work way faster than anything a reasonably general kernel stack could do. Might be a reasonable compromise when process & permission isolation is required.

aey · on June 6, 2019

Bpf can handle 60 million packets per second, adding a user/kernel context switch will kill that by a factor of 1000x. So while your code may not care about it, there are definitely low latency applications where milliseconds equate to dollars.

pjmlp · on June 6, 2019

Some high integrity OSes do that full in user space, while running Linux on the side as it isn't certified for such scenarios.

rurban · on June 6, 2019

Tell that the various libc maintainers. They are blocking the C11 Annex K (safe bounds checks) for over a decade now.

And talking about performance With good compilers my secure memcpy_s is actually faster than the glibc or BSD libc memcpy, with compile-time constexpr.

pjmlp · on June 6, 2019

I would blame ISO C, by making Annex K optional, and not caring to improve C's safety in general.

aey · on June 6, 2019

I am quite happy with rust these days. But it still lacks great ‘no allocation’ libraries.

rurban · on June 6, 2019

Then you are illusional. None of their safety guarantees are real. Memory, type, concurrency, none. But talking to them fall on deaf ears. Its called hype driven development and very popular amongst HN folks. There exist plenty of real safe languages though.

pjmlp · on June 6, 2019

Then Oracle, Microsoft, Amazon and Google are illusional as well, given their investment into production code Rust.

vardump · on June 6, 2019

Any references to support that claim?