OpenBSD 6.0 released

mrb · on Sept 1, 2016

"To deter code reuse exploits, rc(8) re-links libc.so on startup, placing the objects in a random order."

I love this defense in depth, buried in the release notes... https://www.openbsd.org/60.html

armitron · on Sept 1, 2016

This is pretty much useless. Attackers have moved on to using information leaks in order to determine memory layout and placement of objects.

So in practice, this doesn't really deter anything. Information leaks are everywhere.

aweinstock · on Sept 1, 2016

Without this patch, using an information leak to bypass ASLR would be as simple as leaking any address in libc (__libc_start_main is often convenient since it's where main returns to when the progam ends, and hence is already on the stack), subtracting the (fixed, public) offset from the leak to calculate the base address libc was loaded at, and adding the (again fixed and public) offset from libc's base to some useful function (e.g. system(3)).

With this patch, the order of symbols are randomized every boot, so the offsets that were previously fixed and public aren't.

It's probably still possible, if you have multiple adaptive memory reads, to traverse the datastructures that legitimate programs use to find the order of the symbols, but a) not all information leaks can be used to read arbitrary addresses multiple times, and b) the traversal code becomes more complicated for the attacker to {write,maintain} relative to the simple arithmetic that was previously possible.

The above analysis is for remote exploits, local privilege escalation exploits can just read libc to determine the order (since it's re-linked on boot, not per process).

CraigJPerry · on Sept 1, 2016

I don't know why you're being down voted. We've had ASLR in Linux for years and various attacks have been successful.

The string format attack is the one I remember most clearly which basically renders ASLR useless.

aweinstock · on Sept 1, 2016

Attacker-controlled format strings are very convenient bugs, but they can't do everything. Consider the program:

    int main() {
        char buf[20];
        fgets(buf, sizeof buf, stdin);
        printf(buf);
        return 0;
    }

An attacker writing to the program's stdin can read at offsets to the stack (e.g. "%42$x"), read the contents of arbitrary non-null memory (e.g. "ABCD%5$s", where ABCD is a 32-bit memory address, and 5 is the positional parameter corresponding to the start of buf), and write an arbitrary value to an arbitrary address (e.g. "ABCD%38x%5$n", to write the value 42 to address 0x44434241).

A significant limitation of the vulnerability in the above program is the attacker can't, in a single execution of the program, read a value, then do computations on it locally, then write a value based on those computations. This flexibility is needed in order to bypass ASLR.

CraigJPerry · on Sept 1, 2016

Kudos for the worked example!

_pfxa · on Sept 1, 2016

Obviously it's not useless. Not all attackers have agreed collectively to use differents methods. Also, if it leaks in multiple points, it's better to start from a hole instead of letting it all go before finding a patch big enough for all.

aomix · on Sept 1, 2016

Pretty much any security measure in isolation can be bypassed. So you need to stack as many as are reasonable on top of each other to limit the attacker's toolkit.

This change is basically free and makes certain things more difficult for attackers, so why not?

justin66 · on Sept 1, 2016

This seems like a big deal:

One thing to note: this will be the last version of OpenBSD to be pressed on CD. The project will now focus on internet-only distribution, giving much more flexibility in the release schedule.

kgwxd · on Sept 1, 2016

It does? I was under the impression the only reason anyone still did that was to give something physical to donators.

throwaway7767 · on Sept 1, 2016

CDs are read-only, whereas a USB stick has (generally insecure) firmware that can be modified. So in a setting where the user is concerned about persistent USB malware, it can make sense to use them.

It's a niche concern of course, but I suspect it's a little bit more common in the OpenBSD community than in many others. Of course it shouldn't be too hard for those users to build their own install boot CDs with the tarballs for installation, just skip the big package sets and install them later if the whole won't fit on the disc.

soccerdave · on Sept 1, 2016

What happens if your cd that comes in the mail is swapped out for a different cd by a malicious actor?

kabdib · on Sept 1, 2016

You check hashes (from the web site) after you copy the files to media you trust, and then any tampering is really interesting . . .

aruggirello · on Sept 1, 2016

You can find well-known hashes to check your install media with online (assuming you can trust the media because it comes wrapped up in a nice package is a bit naive...). What you cannot check however, is a USB key firmware (ring-0?) malware, that lives completely outside of disk space, and thus is beyond reach, but might have limited (or even complete) access to the system it is plugged to. CDs and DVDs (don't know, but perhaps SD/compact flash/etc. cards too?) are thus the last mainstream media that come with 'no firmware attached'.

joyfulgerard · on Sept 2, 2016

> What happens when a malicious actor leaves physical evidence all over?

They get caught.

ue_ · on Sept 1, 2016

Perhaps I'm the only person using computers in 2016 who only installs OpenBSD via CD :-)

agumonkey · on Sept 1, 2016

I only do to enjoy the pleasure of old ways. Interacting with CDs and CD drives reminds me of when it was cutting edge. Tray mechanics, the speed intake of spinning motors. Even the latency and seek sounds. And somehow, the (almost since it's on DVD RW) immutability. In some ways, the sheer speed of SSD is ... boring when you're not in a hurry.

ps: a bit like vynil lovers who take their player out for similar reasons.

kgwxd · on Sept 1, 2016

I do that too. I also like to imagine a post-apocalyptic, no-internet time where I have a generator, PC, BSD disk and the knowledge to use them to rule the new world while everyone else is stuck on the "Activation Required" screen.

PhantomGremlin · on Sept 2, 2016

Fortunately OS X isn't as bad as Windows. I haven't tried the most recent version, but for previous versions I have successfully installed OS X from a USB stick, without needing any internet access at all.

However, the OS X binaries are signed. So, when the key expires, people are SOL. I had this problem, and easily re-downloaded my various older OS X versions. But in the general case it's quite disturbing: http://www.macrumors.com/2016/03/03/older-os-x-installers-br...

aruggirello · on Sept 1, 2016

Reminds me of my old faithful Amiga, and its 3,5" floppy drive...it made a distinct, almost musical sound. You just turned the machine on and that famous splash screen would appear, then the empty drive would start ticking regularly, waiting for an 880Kb disk. As soon as a disk was inserted, you could hear its typical seek noise while the system booted, track after track...

pjmlp · on Sept 1, 2016

And the famous click of death.

keithpeter · on Sept 1, 2016

I can recommend an offline install of Debian using the DVD1, DVD2 and DVD3 binary iso images. No need for an Internet connection.

Depending on the software you wish to install, you will be swapping between the DVDs a few times (e.g. texlive/gnuplot/r-base).

So far as I am aware, Debian is the only distribution that provides a fair chunk of the packages available in the (main) repository for effectively offline installation.

cyphar · on Sept 2, 2016

SLE (and RHEL) do too, and I think that openSUSE Leap does as well (but I don't think that it's visible from the main download page).

keithpeter · on Sept 2, 2016

Thanks, I'll check the openSUSE Leap arrangements.

cyphar · on Sept 2, 2016

Damn, I just double-checked and it looks that I was mistaken[1]. I'll send an email to opensuse-project@opensuse.org to see if there's a reason we can't produce ISOs like that. Since both SLE and openSUSE use OBS (the Open Build Service), it should be possible to publish those images without much issue.

[1]: http://download.opensuse.org/distribution/leap/42.1/iso/

cyphar · on Sept 2, 2016

Here's the thread: https://lists.opensuse.org/opensuse-project/2016-09/msg00000...

keithpeter · on Sept 2, 2016

So at present, offline installation with a reasonable package selection is Debian off the shelf and Slackware with a carefully curated cache of slackbuild produced binaries.

I suppose one could cache the OpenBSD package binaries and point pkg_add to the relevant storage media to make a third option.

cyphar · on Sept 3, 2016

You could also cache the openSUSE packages as well (zypper allows you to add any directory as a repo). If you _really_ wanted to do it, you could do a recursive copy of http://download.opensuse.org/distribution/leap/42.1/repo/oss... with wget and then point zypper at that directory.

Touche · on Sept 1, 2016

I haven't even owned a computer with a CD/DVD drive in over 10 years.

One thing that drives me mad is that computer manufacturers still distribute recovery images as DVDs, even though the computers the discs are for don't have drives.

SixSigma · on Sept 1, 2016

USB CD/DVD drives exist

Touche · on Sept 1, 2016

Yeah I know, it's just annoying to own something you'll need once every 2 years or so.

dnzm · on Sept 1, 2016

In fact, I prefer to own something that I use once or twice a year (and between different computers at that), and then store in a box somewhere, rather than having a piece of computer suck in dust and whatnot all the time.

Not exactly a counterpoint to what you were saying, but somehow related.

SixSigma · on Sept 1, 2016

Such as re-installation DVDs

kgwxd · on Sept 1, 2016

I still install all my OSes via CD/DVD but I burn them myself from an ISO.

jimktrains2 · on Sept 1, 2016

The last couple installs of any OS I've done have been a few off of USB and the rest off PXE.

anthk · on Sept 1, 2016

I install it from the ramdisk, much better.

GTP · on Sept 1, 2016

How? How can you make a ramdisk resistant to a reboot?

clarry · on Sept 2, 2016

You don't need to. It's created a-fresh for you when you boot bsd.rd. After you're done installing, you don't need the ramdisk as you'll boot your newly installed bsd kernel.

GTP · on Sept 2, 2016

Are you referring to the upgrade method described here [1]?

[1] https://www.openbsd.org/faq/faq4.html#bsd.rd

Gracana · on Sept 1, 2016

One nice thing about the physical CDs is that they're trustworthy source for the release's signify pubkeys (which you can compare with the ones on the website.)

angry-hacker · on Sept 1, 2016

What if someone tampers the CD before it is delivered to you?

dmm · on Sept 1, 2016

Don't forget the awesome stickers!

groovy2shoes · on Sept 1, 2016

Yeah, and even the packaging. The latest release (5.9) was the first "official" OpenBSD CD set I ever bought, and I was astounded by the quality of the set, from the packaging and inserts to the discs themselves. I don't know why, but I didn't expect it to be so... professional. I pre-ordered this release, too. I like being able to help out the OpenBSD project financially and get something nice out of it. Oh, well. RIP, OpenBSD CDs.

justin66 · on Sept 1, 2016

The flexibility in the release schedule strikes me as very useful. The CD sales business is admittedly less interesting.

ghshephard · on Sept 1, 2016

It is - I did not know this. I purchased a USB-DVD player for the sole purpose of getting a physical delivery of OpenBSD. Sad to see it go by the wayside.

cupantae · on Sept 1, 2016

I can't tell if this is a joke.

nayden · on Sept 1, 2016

it is not a joke. http://www.openbsd.org/lyrics.html#60f

orbitingpluto · on Sept 1, 2016

How did I not know about these songs? How?

ghshephard · on Sept 1, 2016

Which part - buying a USB/DVD player just for OpenBSD? In three+ years that's literally the only thing I've used it for. Nothing else.

The part about OpenBSD 6.0 not being on CD? I have no idea - this is the first I heard about it. Couldn't really find anything on the site either other than the song lyrics referenced above...

bch · on Sept 1, 2016

I thought the (forced) schedule was considered a feature, to allow users to know when to expect updates and to have developers know when work had to be finalised for inclusion in the release.

mytec · on Sept 1, 2016

I enjoyed receiving the CD package and the associated artwork.

hackuser · on Sept 1, 2016

This is a good opportunity to ask: Can anyone recommend a laptop I could put OpenBSD on and be fully functional for busy workdays (i.e., when I need to spend 100% of the day being a knowledge worker with a reliable tool I don't have to think about, and 0% being a sysadmin trying to get their tool to work)?

On one hand I've seen threads on HN and Reddit saying how OpenBSD works flawlessly, esp. on various Thinkpads. OTOH I read in-depth reports like these, by OpenBSD insiders who know far more than I ever will, and even their results seem insufficient:

* https://gist.github.com/reyk/80dca43c8bcfa76d2a7ff147ea64d44...

e.g., "The [wlan] connection is sometimes not stable or the firmware shows errors when switching the AP configuration. But stsp@ is actively working on improving it!"

* http://www.tedunangst.com/flak/post/Thinkpad-Carbon-X1-2015

* https://marc.info/?l=openbsd-misc&m=145275871714024&w=2

EDIT: For those interested in this topic, I would start with tedunangst's excellent summary from earlier this year. Thank you Ted!:

http://www.tedunangst.com/flak/post/openbsd-laptops

aaron_m04 · on Sept 1, 2016

I used OpenBSD CURRENT on the Thinkpad Carbon X1 for 3 months earlier this year, and my experience has been that hardware support is not a problem for a "knowledge worker" (I am assuming you don't need things like HDMI audio for that).

The main problem is software that is outdated, unavailable, or buggy on non-Linux platforms.

I came crawling back to Linux in the end.

hackuser · on Sept 1, 2016

Thanks. Though a knowledge worker sometimes does need to connect their laptop to a TV for collaboration and presentations.

> The main problem is software that is outdated, unavailable, or buggy on non-Linux platforms

What kinds of applications could you not find a good solution for?

aaron_m04 · on Sept 1, 2016

> What kinds of applications could you not find a good solution for?

* LibreOffice Writer (lowriter) had an annoying bug where it would sometimes take over a second to redraw the toolbar buttons, and the editor was unavailable during that time. This happened on save, resize, or unhiding the window.

* Konsole's logic to identify the name of the foreground process would occasionally go into an infinite loop and all the konsole windows would lockup and need to be kill -9'd. It can be avoided by using tmux but that's inconvenient. This is sad because it's the most featureful and user-friendly terminal emulator I know of.

* No good C++ IDEs.

* No support for cargo in the rust packages.

* No workable virtualization.

Also, if you use OpenBSD on a laptop, you'd be crazy not to use GNOME 3 in my opinion. Support for the other DEs are not on par.

hackuser · on Sept 1, 2016

Thanks for the tips; that's very helpful

> * No workable virtualization.

For those interested, they are working on it AFAIK. Look up vmm/vmd.

clarry · on Sept 2, 2016

I don't think it's crazy at all to stick with the fine WMs that ship with OpenBSD. Such as cwm. It's well supported.

aaron_m04 · on Sept 1, 2016

I forgot another one:

* No Atom editor (no binary package, and I went through hell trying to compile from source but was unsuccessful).

_lwad · on Sept 2, 2016

dwm works just fine

cylinder714 · on Sept 1, 2016

I run OpenBSD on a T420s and an X220i without any trouble, but I made very sure to get Intel graphics and WiFi as Nvidia and Broadcom are not supported.

carapace · on Sept 1, 2016

Same question, for desktop server/workstation?

clarry · on Sept 2, 2016

If you don't need ridiculous performance, I can recommend the shuttle fanless mini PCs. I have a DS437 and a DS57U7. Both used to be desktops, now the former serves as a server.

I think there's a new one out now (DS67*), I haven't tried it yet. But I might buy one. Or wait for the next gen of intel CPUs.

http://dmesgd.nycbug.org/index.cgi?action=dmesgd&do=view&id=... http://dmesgd.nycbug.org/index.cgi?action=dmesgd&do=view&id=...

Note that I swapped the wlan card on one of these, for an Intel one. I don't know if the original card would've worked; I never tried it.

carapace · on Sept 2, 2016

Thank you. :-)

riffraff · on Sept 1, 2016

I am not an OpenBSD user, but as in every release I am happy to check the lyrics for the release song, this time we got 6 so it's awesome :)

http://www.openbsd.org/lyrics.html#60a

corv · on Sept 1, 2016

That song[0] brings a smile to my face. Keep up the great work OpenBSD!

[0]: http://ftp.openbsd.org/pub/OpenBSD/songs/song60a.mp3

willvarfar · on Sept 1, 2016

One big step in this release is the mandating of W^X by default.

> "Unfortunately there is important third-party code, such as just-in-time compilers, that still uses mmap(2) to make memory both writable and executable, so for the time being, we have to arrange ourselves with it."

If a program wants to JIT on OpenBSD, how should it do it in a secure, OpenBSD-approved way?

avsm · on Sept 1, 2016

Use mprotect(2) on the region of memory that the program wants to make executable. http://man.openbsd.org/OpenBSD-current/man2/mprotect.2

This is good portable programming practise anyway...

willvarfar · on Sept 1, 2016

So they should create a writeable mmapping, write the code into it, then change it to W^X using mprotect?

How does this stop an attacker doing the same via ROP?

ADDED: The approach that comes to my mind is that they could have two processes. One process has the sourcecode, and pages where it can write code. The other process can execute the code. The code updates performance counters which the JITing process can read, so the JITter has feedback to know what to optimise.

However, this sounds a large architectural change, prevents programs JITting programs they generate on the fly, and causes the JIT to lack behind somewhat.

On the Mill CPU (disclaimer: I'm on the Mill team) the CPU can change processes ("turfs" in Mill terms) using a "portal" function call. This alleviates somewhat the performance concerns, as the JITted program can call into the JITer process synchronously and cheaply.

dmm · on Sept 1, 2016

> How does this stop an attacker doing the same via ROP?

Wait, isn't that backwards? Doesn't the use of W^X necessitate the use of ROP? Right now a JIT has lots of memory that is W&X so you just need a memory exploit to inject some code and then find a way to jump to it, no need for ROP.

But if you implement W^X this won't work because now you can't inject code with just a memory exploit, you also have to set it to executable with mprotect(2). So instead you use ROP, and inject only data which includes jumps to carefully selected sections of code in libc, etc, that implements an exploit. I mean you could probably use ROP to run mprotect, but by that point it's pointless because you're already running code on the system right?

willvarfar · on Sept 1, 2016

Forgive me for dreaming of a world where CPI or other approaches close the ROP vector too :)

DasIch · on Sept 1, 2016

It might not make such attacks impossible but it will make them more complicated and that by itself should cause some attackers to fail. Isn't that by itself worth doing?

yegortimoshenko · on Sept 1, 2016

Now, that OpenBSD runs on Xen (and, by extension, on Amazon EC2), is there an official AMI?

microcolonel · on Sept 1, 2016

If I understand correctly, this also means that the main excuse for it not being supported on DigitalOcean is now alleviated. :- )

pyritschard · on Sept 2, 2016

DO relies on KVM which OpenBSD has supported for a good while. If you are interested in something similar - including price-wise - to digital ocean with support for OpenBSD, I encourage you to try https://exoscale.ch

preetamjinka · on Sept 1, 2016

Doesn't DigitalOcean use KVM?

microcolonel · on Sept 1, 2016

I stand corrected, seems it's KVM.

Seems some other people were confused. I've been told in other places that it's Xen. OpenBSD has had working virtio-blk and virtio-net drivers since about 5.3/5.4/5.5 IIRC. hmm...

zzzcpan · on Sept 1, 2016

Talking about security, I'm hoping for a bound-checking memory-safe C compiler to eventually made it into something like OpenBSD or FreeBSD and be used by default for all ports and packages. Almost no software there would suffer from the overhead, and OpenBSD doesn't even promise or try to be very fast, it's a perfect place for it.

willvarfar · on Sept 1, 2016

If you squint at it from the right angle, Capability-Based-Addressing is basically making C memory-safe.

Lots of excellent CBA research comes from the CHERI project http://www.cl.cam.ac.uk/research/security/ctsrd/

However, they struggle to run lots of very routine C programs; its surprising how many normal programs you run happen to have been written in C and happen to rely on unions and memory unsafety etc!

For example, if you read http://www.cl.cam.ac.uk/research/security/ctsrd/pdfs/201503-... and look at Table 1, you'll see common programs and libraries such as libc and Python which cause trouble for 'memory-safe C'.

pedrocr · on Sept 1, 2016

Is a "bound-checking memory-safe C compiler" even possible in the general case without implementing a new Rust-like language?

david-given · on Sept 1, 2016

It certainly used to be in the old C89 days. I can't comment about C99 and above (because I don't know the spec well enough). I know there used to be a gcc patch which did exactly this.

The way it works is that the C89 spec defines memory in terms of objects, and it's illegal to even think about a pointer that doesn't point into an object (or to the byte after). That is, it's not just illegal to dereference it, you're not even allowed to create one.

So, because you know that every pointer must point at an object (or NULL), you encode the object and length of the object into the pointer. So pointers end up being a tuple of three words: object, object length, and offset. Now you have enough information to bounds check all accesses while still allowing pointer arithmetic within an object.

Of course, in real life, people do all kinds of wrong things which de-facto C allows but which de-jure C doesn't, such as pointer arithmetic between objects. But the spec does allow bounds checking.

(I abused this to implement a C to Perl compiler.)

Incidentally, fun fact: the 286 memory segmentation system supports mostly-bounds-checked pointer accesses in hardware.

nanolith · on Sept 1, 2016

Yes. But, you typically have to use a theorem prover to build up static checking for functions, and then enforce proof obligations on callers to said functions. If done in a system like Coq or Isabelle, the proof obligations become a parallel markup to C that is used in conjunction with the source code to enforce policies. Bounds checking is one policy -- and a relatively easy one to implement at that -- and other policies can be stacked.

Take a look at VST for an example already in the wild. I'm currently working on a somewhat different approach that does not have the severe license restrictions that CompCert has.

http://vst.cs.princeton.edu/download/

nickpsecurity · on Sept 1, 2016

That's not true. You just use a compiler transformation. See this:

https://news.ycombinator.com/item?id=12407156

Difference between memory safety and full, formal verification of correctness. You're describing the latter. Definitely check out Myreen et al's CakeML work, COGENT at NICTA, and AutoCorres/Simpl used in seL4. They might have stuff to speed up your own tool development. I wish you great luck on your project. :)

nanolith · on Sept 1, 2016

Compiler transformation can solve most of these concerns, but it is not perfect due to undecidability. Compiler transformations will always be conservative, falling back on runtime enforcement.

The main edge cases I've run into -- hence my need for building a tool like this -- is dealing with tight performance concerns found in realtime and embedded applications. Falling back to runtime enforcement is not an option, and neither is trusting developers to be able to fully analyze complex control-flow and data-flow paths without solid tooling.

For general-purpose applications, of course, compiler transformation with runtime fallbacks is perfectly acceptable.

nickpsecurity · on Sept 2, 2016

"is dealing with tight performance concerns found in realtime and embedded applications"

I thought tools like Astree and SPARK have knocked this out the park. Copilot was also pretty good on runtime side given it works in embedded. Just gotta structure your program to use the tools. I think they should cover plenty of use-cases given stuff like IRONSIDES DNS runs in SPARK.

"For general-purpose applications, of course, compiler transformation with runtime fallbacks is perfectly acceptable."

Also true for many embedded apps given the remaining runtime hit can range from single-digits to 40% depending on scheme used. Even more combined with something like SPARK or Astree for stronger, static analysis than the minimal stuff academics usually use. Certainly a subset will benefit from or even need methods you prefer but many won't. It's just management or a consumer's preference to save a few bucks shooting them in the foot. ;)

Do shoot me an email, though, in case I get some free time to run some ideas by you. I don't know enough formal methodists these days. My mile-high perspective can only let me do so much as a generalist. Gotta have specialists to help me filter the chaff from the wheat. Then pass such recommendations onto more specialists as always. :)

nanolith · on Sept 2, 2016

"Also true for many embedded apps given the remaining runtime hit can range from single-digits to 40% depending on scheme used."

I think that depends on the definition of embedded. When one is lucky enough to work with a chipset in which a 40% performance hit still results in acceptable performance, such a tool is reasonable. I can tell you from my experience in the consumer electronics field that BOM costs often rule over software performance, and as such, firmware engineers are often stuck with finding creative ways to solve problems on nerfed hardware. Wrestling with a tool that can occasionally inject unwanted runtime overhead in unpredictable locations can compound razor-thin time-to-market timelines for last minute BOM changes.

The focus for my tool is to provide people with the ability to build additional propositions that go beyond basic correctness or memory bounding found in tools like Astree. Astree, for instance, may be able to automatically discover whether a subset of C is correct -- because it has a very reasonable policy of assuming incorrect behavior unless otherwise proven -- but it can't solve for custom policies such as, "does this seemingly correct code actually follow my architecture and specification?"

Also, Astree isn't free software. It's an AbsInt product, which means most likely that it comes with a hefty price tag. To give you an idea, their CompCert compiler is licensed in the five to six figure Euro range per seat. Of course, for the sort of customers AbsInt is interested in -- companies like Airbus or Boeing -- this isn't necessarily a deal breaker. They are willing to pay for such a tool because of the mission critical aspect of their work.

But, I'm of the opinion that the industry as a whole needs access to tools like these. The tool I'm building will be released under the LGPL. The security problem is one with which the entire industry needs to engage. My tool may not necessarily be the best one out there -- nor am I trying to build the best tool -- but I am trying to build one that is "good enough" and reasonably sound. I think I have struck the right balance, but time will tell.

I wouldn't say that I'm a formal methodist as much as a software engineer looking to be able to make strong guarantees about critical components. I have a problem to solve, and I've spent a few years studying this problem off and on. Somewhere along the way, I acquired enough hubris, if not enough knowledge, to think that I could build such a tool. So far, things have worked out better than I've thought. But, I still have a few big hurdles to overcome before I can confidently say that this tool will be useful to others. INRIA and other researchers have blazed the trail, and they have rightfully charged a small fortune for what they have discovered. I'm much more interested in ensuring that the average developer has access to such technology. My tool won't have the bells and whistles of commercial offerings, but it's hard to beat the price.

nickpsecurity · on Sept 2, 2016

Alright, this sub-thread started with a person wondering if one could make C programs memory-safe against common errors. You replied with formal verification while I replied with automated tools that make C memory safe. One or two use formally-verified models. :) You countered with the needs of embedded making such tools impractical. There's many in that industry that can use tooling like I recommended, esp if not BOM-sensitive or hard real-time. You brought up the needs of ultra-constrained market in terms of cost or hard real-time. The increasing market-share of expensive ARM's vs 8-16-bitters and embedded JVM's makes me wonder if that's oversimplified, too. Makes me think similar companies might consider tooling like I described if it likewise adds benefit with tiny costs. The majority will do as you say and avoid these tools since they don't fit their use case. So will some non-embedded sectors focused only on performance or lowest cost.

Let's cut to some important stuff, though, as I think the argument got over semantics or at least less interesting stuff. My background is high-assurance security with generalist experience but none with formal methods or building static analyzers. You seem to have experience in those I don't. Appel, who makes the tool you linked, is a formal methodist (among other things) building stuff that's useful, achieves high-end stuff, and sometimes enabling less-skilled people. Rare type of researcher and deliverables I love seeing. You say you want to do something similar. So, let's have a more interesting conversation.

"Also, Astree isn't free software. It's an AbsInt product" (and CompCert)

I saw writing on wall with CompCert when they hesitated to FOSS it or answer questions. So sad such a wonderful tool got locked up. Thanks for confirming with a dollar range what I suspected. Same with Astree. We need replacements for both like you said about CompCert. My first idea was for a team to clone CompCert based on high-level description of what it does with dependent types a la Chlipala or just FLINT ML-style compiler. Add Design-by-Contract, QuickCheck, and a few other things with take little time but collectively knock out tons of errors. Compile it with MLton for development speed and CakeML for release version with equivalence checks. Be 90+% to quality of formal verification without CompCert and have a nice foundation for whoever wants to go fully formal.

Far as what formal people were doing, first I saw were trying to use a micropass approach to piecemeal build one for a MIPS machine where they spent tiny fraction of time of CompCert. Don't have link handy or know their status. Fortunately, Myreen et al's method already got used in seL4. A lot more than that actually haha. Perhaps a short-cut to CompCert replacement is to automate the process of running a C subset through such tooling with possibly guidelines for programmers a la MISRA to keep automated part feasible? Existing tools, esp AutoCorres/Simpl, should have solved hardest problems in that.

"But, I'm of the opinion that the industry as a whole needs access to tools like these." "The security problem is one with which the entire industry needs to engage"

You preaching to the choir. I was designing and preaching high-assurance security, at least for critical stuff (eg kernels, compilers), way before the Snowden leaks. The stuff mainstream, "INFOSEC professionals" said was hypothetical in risk or overkill in solution was often validated by specific attacks in leaks. ;) Odd thing is they still argue with us and dismiss high-assurance as red tape or 100% impractical. Same with things like SPARK. (rolls eyes) Anyway, I published for free a summary of high-security INFOSEC on Schneier's blog in 2013 in an argument about secure code != secure systems:

http://pastebin.com/y3PufJ0V

"but I am trying to build one that is "good enough" and reasonably sound. I think I have struck the right balance, but time will tell."

I encourage people to do exactly what you're doing. There will be no one-size-fits-all. Tools that approach these analyses balancing ease-of-use with properties proven or bugs found are best bet because it's all time-constrained developers will likely use. I'm also glad you're FOSSing it. I don't expect a community to show up as FOSS types ignore most high-security stuff. The model I'm pushing for is businesses licensing software or selling hardware leveraging high-assurance components with a percentage of revenue put into improving or maintaining them. On top of what practical, grant-funded academics contribute. FOSS is still a pre-requisite to this model working as likes of AbsInt or Alt Software keep barrier-to-entry too high with licensing costs. Medium-to-high assurance might cost money and time to develop but access to it has to be cheap and easy for uptake. Ironies of life.

"I wouldn't say that I'm a formal methodist." " Somewhere along the way, I acquired enough hubris, if not enough knowledge, to think that I could build such a tool."

Lol. That's how it starts. You'll be asking me to trust 100 pages of proof because the checker said so within 3-5 years. :P

Seriously, though, I'm glad you're not cocky as being careful & using many methods for correctness is best route. At least one always misses something. Even happened to CompCert where the spec was wrong 2-3 times but at least the code faithfully implemented the bad spec. You might dodge that if more careful & testing every part of lifecycle.

" My tool won't have the bells and whistles of commercial offerings, but it's hard to beat the price."

Good luck to you on that. While we're at it, the last time I heard something like this was someone telling me about Liquid Types analyzer for C programs (below). What do you think of it for this problem area or versus your own approach? Or if someone like me with no specialist skill should attempt to mess with it? Very hard for me to evaluate that aspect. I fear I will waste dozens to hundreds of hours on the wrong approach haha.

http://goto.ucsd.edu/csolve/

nanolith · on Sept 2, 2016

I agree with your synopsis on Pastebin. It's pretty close to what I have been trying to practice. As an interesting aside, I worked with a few researchers about a decade ago on tracking SMM vulnerabilities in various Intel chipsets. They were the experts though. I was just a hired hand. Still, I learned a lot about the very real threat of hardware vulnerabilities.

The Liquid Types approach used in CSolve -- if memory serves correctly -- makes use of an SMT solver to attempt to find counter-examples given a particular set of constraints. This is similar to the approach that Frama-C uses. SMT solvers are interesting and can do quite a bit, but they are still incomplete. Many linters have started incorporating SMT solvers. It's better than much of what is out there, but there are still significant limitations. Furthermore, the markup languages provided are often limited in the amount of customization that can be performed.

My tool is designed as a framework that can be used to build much more comprehensive proofs. Similar to what you posted in Pastebin, the security of a system is intimately tied to the architecture of a system as a whole. I want to formally verify the architecture, then demonstrate that the implementation is operating under the constraints of the architecture using a combination of equivalency and bounding proofs. This, in conjunction with a strong understanding and integration with the sorts of runtime features available in a system (i.e. MMU protections) can be used to build a compelling verified proof of security similar to what was done with seL4.

But, ultimately, even this is not enough. Few systems today are contained within a single machine. IoT has demonstrated just how complex the current software ecosystem is becoming. Security proofs must extend beyond individual applications and systems and cover heterogenous and widely distributed systems with many different potential attack vectors. This can be formally verified, but as far as I'm concerned, such a verification is an architecture-first approach. I don't think that in this sort of environment, a correctness tool is enough. It's a great addition to the toolbox, and it is a damned useful tool at that, but correct software can still violate policy unless this policy is part of the constraint of "correct".

nickpsecurity · on Sept 3, 2016

I'm mixed about the post, esp the science part. The science of developing robust software is great and pretty consistent going back decades varying mostly in specific tools and tactics. Mainstream programming just doesn't apply it although more adoption in past decade of key techniques. Here's some computer science from the 1960's-1980's used in robust and secure system development (esp Orange Book B3 or CC EAL6) people might want to copy. I'm taking an empirical route where I reference techniques that were applied to many real-world projects with lessons learned in papers or studies that were consistent. All one can do with limited data & these aren't in order of importance.

1. Formal, non-English (eg math/logical) specifications of requirements or abstract design. English is ambiguous and misreadings of it caused countless errors, even back then. CompSci researchers tried formal specs with both English as a start and precise notations (eg Z, VDM, ASM's, statecharts) for clarity in specifics. Result was many inconsistencies caught in highly assured systems and protocol specs before coding even began.

2. High assurance stuff often used mathematical (formal) verification. Whether that worked or made sense was hit and miss. More on it later. Yet, virtually all of them said there was benefit in restrictions on the specs, design, and coding style to fit the provers' limitations. Essentially, they used boring constructs that were easy to analyse and this prevented/caught problems. Don't be too clever with design or code. Wirth and Hansen applied this to language design to bake safety & comprehension in with minimal to low loss in performance.

Note: Led to Nick P's Law of Trustworthy Systems: "Tried and true beats novel or new." Always the default.

3. Dijkstra's THE project showed that modular, layered design with careful attention to interfaces (and interface checks) makes for most robust and maintainable software. Later results confirmed this where each module must fit in your head and control graph that's pretty predictable with minimal cycles prevented all kinds of local-becomes-global issues. Many systems flawless (or nearly so) in production were built this way. Dijkstra correctly noted that it was very hard to do this even for smart people and average developer might screw structuring up a lot. Solid prediction... but still worth striving for improvement here.

4. Fagan ran empirical studies at IBM that showed a regular, systematic, code review process caught many problems, even what tests missed. Turned that into formal inspections with the periodicity and prioritizing tuned per organization for right cost-benefit. Was generalized to whole SDLC by others in high robustness areas. Improved every project that used it from then on. Exactly what parameters to use is still open-ended but periodically looking for well-known flaws with reference sheet always works.

5. Testing for every feature, code-path, prior issues outside of code base, and common use-case. All of these have shown repeated benefits. There's a cut-off point for each that's still an open, research problem. However, at a minimum, usage-based testing and regression testing helped many projects achieve either zero or near-zero, user-facing defects in production. That's a very important differentiator as 100 bugs user never experiences is better than 5 that they do regularly. Mills' Cleanroom process combined simple implementation, code review, and usage-testing for insanely-high, statistically-certifiable quality even for amateur teams.

6. By around 60's-70's, it became clear that the language you choose has a significant effect on productivity, defects, maintenance, and integration. Numerous studies were run in industry and military comparing various ones. Certain languages (eg Ada) showed vastly lower defects, equal/better productivity, and great maintenance/integration in every study. Haven't seen many such studies since the 90's and most aren't constructed well to eliminate bias. However, it's grounded in science to claim that certain language choices prevent common negatives and encourage positives. So, it follows to adopt languages that make robust development easier.

7. By the 80's or 90's, it was clear that computers were better at finding certain problems in specs and code than humans. This gave rise to methodologies that put models of system or code into model-checkers and provers to show certain properties always hold (the good) or never show up (the bad). Used successfully with high-assurance safety and security critical systems with results ranging from "somewhat beneficial" to "caught stuff we'd never see or test for." Back then it was unclear how applicable it was. Recent work by Chlipala, Leroy, et al show near perfect results in practice when specs/proofs are right and much wider application than before. Lots of tooling and prior examples means this is a proven way of getting extra quality where high-stakes are worth the cost and where core functionality doesn't change often.. The CompCert C compiler, Eiffel's SCOOP concurrency scheme, and Navy team's EAL7 IPsec VPN are good examples.

8. Static analysis, aka "lightweight formal methods," were devised to deal with specialized skills and labor of above. Getting to the point, tools like Astree Analyzer or SPARK Ada can prove absence of common flaws with little to no false positives without need for mathematicians in the company. Just a half dozen of these tools by themselves found tons of vulnerabilities in real-world software that passed human review and testing. Enough said, eh?

9. Software that succeeded with testing often failed when random stuff came at it, especially malware. This led to various fault-injection methods like fuzz testing to simulate that and find breaking points. The huge number of defects, esp in file formats & protocol engines, found via this method argues for its effectiveness in improving quality. It ties in with stuff above in that well-written code that validates input at interface and preserves invariants throughout execution should simply disregard (or report) such erroneous input.

10. Interface errors themselves posed something like 80+% of problems. This was noted as far back as the 60's in Apollo project when Margaret Hamilton invented software engineering, fault-tolerance, and specification techniques to fight it. Dijkstra and Hoare pushed for pre- and post-conditions plus specific invariants to document the assumptions of code during procedure calls. Modern version is called Design by Contract in Eiffel, Ada, and numerous other languages (even asserts in C). Many deployments and tests showed such interface checks caught many issues, esp assumption violations when new code extended or modified legacy.

11. Concurrency issues caused all kinds of problems. Techniques were devised by Hansen (Concurrent Pascal) and later Meyer et al (SCOOP) to mostly immunize against them at language level with acceptable performance. Languages without that, especially Java, later got brilliant tooling that could reliably find race conditions, deadlocks, or livelocks. Use of any method inevitably found problems in production code that had escaped detection. So, using prior, proven methods to immunize against or detect common errors in concurrency is A Good Thing. Note that shared-nothing, event-driven architectures also emerged but I have less data on them outside that some (NonStop, Erlang) worked extremely well.

The above are just a few things that computer science established with supporting evidence from real-world projects so long ago that Windows didn't exist. Anyone applying these lessons got benefits in terms of code quality, security, and maintainability. The rare few applying most or all of them, mainly high assurance community, got results along lines of space shuttle control code with extremely, low defects or zero in production. So, given the past and present results of these methods every time they're put to the test, I'm irritated every time another person talks like there's no good science to quality software. I just listed a bunch of it, it's been tested in production as scientific method requires thousands of time, tweaked probably hundreds, and core approaches remained even if tactics got modified.

Now people can feel free to use and improve on the science. CompSci continues to in every area I listed with a chunk of proprietary and FOSS developers using a subset of the techniques. Just need more uptake. Use what's proven. And do note that there's plenty of examples for specific design and implementation decisions for common types of functionality. Many things that were shown to work or not work that could be encoded in libraries, DSL's, templates, whatever. No excuse except for our field's continual failure to learn and hand down the lessons from the past.

http://pastebin.com/xZ6m4T8Z

nanolith · on Sept 3, 2016

It is upon the shoulders on these giants where I stand. Dijkstra, Floyd, Hoare, Knuth, Curry, Church, and Turing, among others.

The science exists. The trick now is bringing it all together in a way that is useful enough to others that they actually start using it. :-)

nickpsecurity · on Sept 1, 2016

Yes, there's been many produced over time. Here's two that are in prototype form, one in LLVM, that could use extra attention from OSS community:

https://www.cis.upenn.edu/acg/softbound/

https://github.com/santoshn/softboundcets-3.6.1

http://safecode.cs.illinois.edu/

https://github.com/jtcriswell/safecode-llvm37

When overhead is too high, there's also tools like CPI to at least stop code injections:

http://dslab.epfl.ch/proj/cpi/

There were also safer variants of C to make rewrites for safety easier:

https://www.cs.uic.edu/pub/Main/PhDQualifyingExam/Sample4.pd...

http://www.eg.bucknell.edu/~lwittie/research/Clay.pdf

Note: Clay was used in device drivers and garbage collectors in partnership with Microsoft Research. Some Cyclone techniques were adopted in Rust.

So, the tooling has been around in different forms for a long time. OpenBSD presentation even mentioned a few. They avoid safer languages and these tools on purpose in favor of C, a review process aiming to find all vulnerabilities before hackers, and their after-the-attack mitigations for when they don't. Improving something like Softbound with checks disabled on known-good modules for performance gain would be so much better in terms of time invested and benefits gained. Plus, even average app developers could make memory safe code with same tool.

Not going to happen, though. It's not how mainstream "security" works. They default on using worst tools for the job for what seems to be political/social reasons more than anything. CompSci and private sector innovations long knocked out the technical ones. There's at least niche projects like Redox, GenodeOS, and Muen applying different aspects of best, known methods.

ape4 · on Sept 1, 2016

There's the convention where you put char stuff[0]; at the end of a struct and malloc has necessary.

octotoad · on Sept 1, 2016

Farewell sparc32. You will be missed by the retrocomputing geeks.

dmm · on Sept 1, 2016

VAX too. Looking forward all of the platforms not supported by LLVM are doomed.

rjsw · on Sept 1, 2016

Still supported by NetBSD.

MustardTiger · on Sept 1, 2016

Not supported, "supported". Big difference.

chriscappuccio · on Sept 1, 2016

Through emulation. How useful...

rjsw · on Sept 1, 2016

NetBSD/sparc runs on real hardware as well as emulators.

20yrs_no_equity · on Sept 1, 2016

I love BSD and would like to use it, but we're kinda in a Linux world.

Question: If I am building a custom hardware device (and we will be in the future, I believe) can we run OpenBSD on it using the ARM port, but also invoke binaries created for linux?

It appears that support was removed.

In the near term I need to build for Linux, But eventually we'll be able to target our own hardware, and one of our toolchain items doesn't support BSD right now (and is closed source, though we are considering an open source alternative.)

Any thoughts?

chriscappuccio · on Sept 1, 2016

Linux emulation was only complete for one port - i386. Not amd64, not armv7, nothing else. And, it was emulating a very old Linux kernel interface. What could it have possibly done that you can't do by simply compiling the program using the native toolchain?

rjsw · on Sept 1, 2016

Linux ARM binaries require the OS to map a page at the top of the address space that is shared between the kernel and userspace, you would need to check whether OpenBSD ever included this in their Linux emulation.

The support for this isn't in NetBSD either but there has been a request to add it in order to run a Citrix app.

_lwad · on Sept 2, 2016

They removed Linux emulation support in 6.0.

Johnny_Brahms · on Sept 1, 2016

For those of you who are looking at a reason to play around with openBSD, there might be some progress at getting it to run at the Raspberry pi 2 and 3: http://marc.info/?l=openbsd-cvs&m=147059203101111&w=2

Probably not going to happen, but it runs on some other ARM7 SBCs. Mine is running FreeBSD currently, but where is the geek cred in that?

peatmoss · on Sept 1, 2016

I've wondered about OpenBSD on the NTC CHIP computer. I have one that I haven't quite gotten around to doing anything with that I'd love to OpenBSD-ify.

chriscappuccio · on Sept 1, 2016

It is supported.

peatmoss · on Sept 1, 2016

Oh, well, that makes things easy then. Do you know off hand which image one uses?

nayden · on Sept 1, 2016

this is currently being worked on as I type this reply :)

Johnny_Brahms · on Sept 1, 2016

_nice_.

For my pi I want something that I don't have to think about. A colleague pwned my rpi because I hadn't updated it in ages. OpenBSD seems like a better choice...

chriscappuccio · on Sept 1, 2016

It's happening.

Theizestooke · on Sept 1, 2016

I'm trying to use the austrian mirror, ftp5.eu.openbsd.org, but I'm getting empty directories or "Permission denied" when trying to access the packages folders http://ftp5.eu.openbsd.org/ftp/pub/OpenBSD/6.0/packages/

_cx2w · on Sept 1, 2016

Have you tried ftp2? I had problems today with one of them (can't remember which worked and which didn't).

EDIT: http://ftp2.eu.openbsd.org/pub/OpenBSD/

keithpeter · on Sept 1, 2016

"...the kern.usermount sysctl is also no more. Administrators who want to let users mount devices will need to configure doas(1) for that task."

For convenient laptop use I prefer allowing my user account to mount USB drives and have that done by mouseclicks in some way.

Antoine Jacoutot's toad package is not in the 6.0 packages collection, but xfce4-mount is, so I assume I can set up the appropriate doas rule for a user and then configure xfce4-mount to ignore the local drive. I shall have a play on Sunday.

protomyth · on Sept 1, 2016

I've never really had a separate /usr/local partition, but it looks like that might not be such a bad idea given the upgrade guide: https://www.openbsd.org/faq/upgrade60.html

Esau · on Sept 1, 2016

I like having separate partitions (or slices) for everything. The guy who introduced me to UNIX did it so he could mount certain filesystems "ro" or "noexec". He also told me that partitioning can help avoid inode exhaustion but I really doubt that is an issue with modern filesystems.

I still partition with NetBSD. It just feels right; even if not necessary.

Tharkun · on Sept 1, 2016

I never run out of disk space, but I frequently run out of inodes. It's definitely still an issue. And if you don't stop to think about when you're creating your file system, it tends to bite you in the arse at a later date.

Esau · on Sept 1, 2016

Out of curiosity, can you share what platform/file system you generally use? I had assumed inodes would not be an issue with modern operating systems; especially 64-bit ones.

kazinator · on Sept 1, 2016

On traditional Unix filesystems, a fixed number of inodes is provisioned when a filesystem is created. Though of course the upper bound on that is limited by the width of the numeric type ino_t (or whatever plays that role in kernel space) have you, in practice, the actual limit is lower.

I'm looking at a filesystem I have handy here; the root FS in an Ubuntu Linux VM. dumpe2fs /dev/sda1 says that the inode count is 1146880. Just a million and something: way less than what a even a 32 bit inode number can represent.

That filesystem simply cannot have more objects than that, in total.

With this type of filesystem, you have to estimate how many objects you will ever need to store in it, not only how much total storage. If you estimate too high, you waste inodes. Suppose inodes are 128 bytes wide and you provision four billion of them. Oops, that's like half a terabyte of storage dedicated to the array of inodes!

protomyth · on Sept 1, 2016

Oh, I partition too, but I do /, /usr, /var, /var/log, /home, /tmp, /opt (yeah, yeah, I know), and then anything server specific (mail, maybe www). I just don't do the /usr/local.

Esau · on Sept 1, 2016

When I set up NetBSD, I create /, /usr, /var. /tmp, and /home. I have never create a /usr/local partition but I can see value in it is you want /usr itself to be read-only.

protomyth · on Sept 1, 2016

I do /var/log because of some bad experiences with runaway programs - samba went psycho on me in the 3.x days.

jrcii · on Sept 1, 2016

Yay! I get to spend 5 hours figuring out how to update the syntax for my pf.conf rules.

ben_bai · on Sept 1, 2016

I'm assuming you run 5.9.

There is only one entry in http://www.openbsd.org/plus60.html so I would assume you don't have to change your rules. Also pfctl -n should tell you.

"In pf.conf(5), change the parser to make af-to on pass out rules an error. This fixes a bug where a nonworking configuration could be loaded."

SSLy · on Sept 1, 2016

What? Where do you see notes about it being changed?

SixSigma · on Sept 1, 2016

[flagged]

jrcii · on Sept 1, 2016

Gmail is free, would you be happy if they updated it and your email didn't work anymore? I love OpenBSD but they constantly change the rule syntax for pf, which invalidates huge swaths of forum posts and other online documentation, even published books. The man page doesn't contain every conceivable explanation and example for every scenario either, nor could it, so that is also not a solution. Whether or not the price is free, the use is not, as it requires hours of valuable time to implement.

hackuser · on Sept 1, 2016

> Gmail is free

Perhaps this is a bit doctrinaire, but it's an important distinction:

You pay for Gmail with your privacy and personal information (you can say they are worthless to you, but you do pay). OpenBSD is truly free-as-in-beer.

You also have no end-user control or 'freedom to tinker' with Gmail. OpenBSD is free-as-in-speech, also.

pfortuny · on Sept 1, 2016

You have to ACT proactively to upgrade OpenBSD, unlike gmail.

SixSigma · on Sept 1, 2016

> Gmail is free, would you be happy if they updated it and your email didn't work anymore?

I would shrug, meh and do something else.

I've had my lifestyle made illegal a couple of times. Perspective.

chriscappuccio · on Sept 1, 2016

The last and only major PF syntax change was, what, 5 years ago now ? 4 years ago ? Any other changes are relatively minor and reflect other changes, such as features being removed or added.

nn3 · on Sept 1, 2016

It's good that they have their priorities straight. No more Linux binaries support (who need compatibility anyways?), but instead you get 5 songs sung by the project leader.

IntelMiner · on Sept 1, 2016

If you're running OpenBSD, likely you're running it for the features it provides (security and code correctness)

If you want to run Linux apps, run them on your Linux box and use your OpenBSD machine to firewall it off

chriscappuccio · on Sept 1, 2016

If your priority is to run Linux binaries under OpenBSD, you could improve the support, perhaps even implement 64-bit emulation. But since you didn't do that, and nobody else wanted to, the unused crap was removed instead. Imagine that???

olp2 · on Sept 1, 2016

Only the "Goodbye" song is sung by Theo de Raadt.

MustardTiger · on Sept 1, 2016

How dare they remove the code for a "feature" that hasn't worked in several years! The horror!