I use C when I believe in memory safety

t43562 · on Feb 5, 2023

I understand why the OP likes C. I was looking at a couple of old projects of mine and I remember the satisfaction at least in one (a SASL library) of how I made various classes of error impossible using a fairly simple strategy and my own string handling routines. In a memory safe language with lots of assistance for threading there would never have been that pleasure. It was not amazing or anything but it was elegant to my eyes.

I feel I have to develop an economy of effort in C because there usually isn't a dumb and easy way to do things - this pleases me.

C has a mental model that seems relatively simple - I feel I can reason about what's going to happen to a greater degree than higher level languages.

C++ is the worst IMO because it has lots of abstractions but they don't make programming simpler because you pretty well have to understand the low level aspect of what they're doing to get performance - so the complexity is much greater.

Rust seems to put a lot of difficulty up front which is of no value in the kind of typical short program I write most of the time. I just didn't enjoy it. If I had something complicated to do I might eventually appreciate it.

The more high-level the language the less one tends to understand it and that's unsatisfying.

Having said all of that though, I tend to "just write python" whenever I can. If it's a throwaway program one doesn't need more and if it turns out to be necessary to interface to some library or other I know C well enough to sort that kind of thing out. I use threading.....never - at least never by choice.

qsort · on Feb 5, 2023

I'm pretty much in the same boat and I don't even like Python the language. The ecosystem and extensibility is just so insanely productive it even rivals Excel...

In re Rust: I feel one of the mistakes most people make is taking Rust to be a general purpose "default language" for everything, like C and C++ used to be. But like you wouldn't write C++ for most stuff even if you were magically granted the ability to never ever write unsafe code, writing web services in Rust just means making your own life harder. It's a great tool, I'm happy it exists and I follow its development, but we must accept it exists in a different space than, say, Python or Ruby or Javascript.

I feel physical pain as I say this, but most stuff should probably just be Java.

biorach · on Feb 5, 2023

> I feel physical pain as I say this, but most stuff should probably just be Java.

This has been on my mind a lot lately, causing similar anguish.

I'd note that, from what I hear, C# would also fit the bill, but be as unappealing to me.

I really wish that there was a language with the productivity, ecosystem and ergonomics of Python, with the speed and efficiency of Java and C# and a type system that increases productivity rather than word count.

Go seems... fine, but, I dunno, slightly regressive. I'll probably use it where performance and scaleability matter.

kaba0 · on Feb 5, 2023

> I really wish that there was a language with the ~productivity~, ecosystem and ~ergonomics~ of Python, with the speed and efficiency of Java

That would be.. Java :D While I’m quite a big Java fan, for the other parts some other JVM language can easily fit the bill and you even get a choice. Scala, Kotlin, or maybe even Clojure if you don’t mind dynamic types.

biorach · on Feb 5, 2023

Yeah maybe, I hear Java has improved markedly since my bad experiences with it (admittedly a long time ago now). I should swallow my distaste and give it a fair trial. From a distance it still looks somewhat verbose and unergonomic.

The others look like really really nice languages and I'd love to use Scala for a solo project.

However I guess I need something that's a) truly general purpose b) not going to scare away collaborators or potential hires c) going to be around for a long time d) not going to require evangilization at, say, a start-up.

Kotlin... I guess I'm wary of it being the new Coffescript - i.e. that it spurs Java into improving to such a degree that it removes the need for Kotlin in the first place.

And yes, having just spent a long time transitioning a project from JS to TS, I'm not sure I want to start anything serious without at least half-decent static typing - a major reason I'm looking for an alternative to Python, which is just not quite there.

kaba0 · on Feb 5, 2023

For what it worths, Scala 3 is a really great language, it now even has a compiler option to exclude nulls, making you use something like `String | Null` to denote nullable values.

biorach · on Feb 5, 2023

Yeah, from a purely personal point of view Scala 3, F# and, maybe, Elixir seem to offer the greatest promise of productivity, safety and expressiveness.

nicoburns · on Feb 5, 2023

> Kotlin... I guess I'm wary of it being the new Coffescript - i.e. that it spurs Java into improving to such a degree that it removes the need for Kotlin in the first place.

I think that's unlikely. The key difference being that Coffeescript was always dependent on JavaScript because it compiled to JavaScript! Whereas Kotlin doesn't compile to Java, it compiles to JVM bytecode the same as Java does.

adra · on Feb 5, 2023

Java's gotten a lot better and maybe some day in the future I'll go back to general purpose java for server-side development, but there's enough velocity increase in Kotlin that I feel more and more productive. The real victory of the language IMHO is absolute null awareness (at least for largely Kotlin codebases). This makes any sort of nullability issue perfectly clear as long as you're not '!!' (null-safe opt out) everywhere like a lunatic (or as a newb when I started). Being able to seamlessly weave java and Kotlin in the same projects has made the slow iterative migration a lot simpler as well. All the same dev toolchains/ecosystems, just a different compiler plugin.

charlieflowers · on Feb 6, 2023

Why not Typescript?

ithrow · on Feb 5, 2023

Why not just TS?

naasking · on Feb 5, 2023

> I really wish that there was a language with the productivity, ecosystem and ergonomics of Python, with the speed and efficiency of Java and C# and a type system that increases productivity rather than word count.

Try OCaml or F# then.

nequo · on Feb 5, 2023

Or Nim from what I hear. It does look like Python[1] but it is compiled and seems much much faster.

[1] https://nim-lang.org/

Yoric · on Feb 5, 2023

Seconded. You can even write OCaml or F# on the client, too.

_dain_ · on Feb 5, 2023

>I really wish that there was a language with the productivity, ecosystem and ergonomics of Python, with the speed and efficiency of Java and C# and a type system that increases productivity rather than word count.

Nim has that except for the ecosystem. And it's also kind of janky, enough that I got so frustrated I stopped trying to use it. But you might have better luck than I did; check it out if you haven't.

biorach · on Feb 5, 2023

> Nim has that except for the ecosystem. And it's also kind of janky,

That's a terrible recommendation!

_dain_ · on Feb 5, 2023

Yeah I know lol, just trying to be honest. It's a genuinely fun language to work with; if it wasn't I wouldn't have put in so much time trying to make it work with the way I like to program. I still recommend you check it out, because people like to code in different ways, and it might fit your style of programming more than mine.

nequo · on Feb 5, 2023

What have you found janky in Nim?

_dain_ · on Feb 5, 2023

Iterators claim to be first-class but aren't. And "zero-cost" iterators that are only zero-cost if you don't value your own time spent debugging them. I've been able to crash the compiler when trying to write what in Python is idiomatic "itertools-y" code. And whenever I complain about this, someone pops up to recommend some third party iteration library. Which is part of the problem; I shouldn't need a third party library full of terrifying-looking macros just to write lazy iterators. They don't live up to what the docs promise, which is a real shame.

If they made iterators work how they seem they should work, I would come right back to Nim. It does so many things right but this one thing is a dealbreaker for me. The UFCS is practically begging me to chain lazy iterators together but actually trying it is like stepping on a rake.

nequo · on Feb 5, 2023

What have you switched to from Nim? Have you gone back to Python and put up with the speed penalty or chosen something else?

Lazy iterators and generators are something I've missed in Rust. Haskell has them just by nature of everything being lazy and I love that. But I've become too familiar with Python, and unless runtime performance is a bottleneck, Haskell hasn't been the path of least resistance for most of my practical problems.

_dain_ · on Feb 5, 2023

> What have you switched to from Nim? Have you gone back to Python and put up with the speed penalty or chosen something else?

Just Python and shell scripting. All this stuff is for hobbyist programming not my day job, so I just shelved the stuff that need performance and dusted off some projects that don't. I'm learning Rust at the minute but it's slower going, I'm not at the level where I can make the stuff I want.

eternityforest · on Feb 5, 2023

I've been really enjoying Dart, I can write code slightly faster than in Python because of the type system and deep IDE support.

Unfortunately, when I use Dart, it's on Android, where the SAF exists, which kills my enjoyment, despite Android/Flutter otherwise being probably my favorite platform.

vbezhenar · on Feb 5, 2023

Kotlin is very pragmatic alternative to Java. A lot of people like it. It's endorsed by Google so it's pretty safe bet nowadays.

Scala is another alternative with more scientific approach.

Both enjoy JVM quality and consume Java libraries.

Java is improving as well, but not on par with those languages. Worth checking out though.

jcadam · on Feb 5, 2023

I really like Kotlin on the backend, it integrates nicely with Spring Boot, etc., but as soon as I say I have Kotlin experience, people say "Oh, so you're an android developer?"

I've never touched android development in my entire life.

pjmlp · on Feb 5, 2023

Because on the JVM, like in other platforms, guest languages come and go.

https://en.wikipedia.org/wiki/List_of_JVM_languages

Kotlin has its future assured thanks to godfather Google, stagnating Android Java on purpose (only recently they decided to finally move to Java 11 LTS subset), while promoting Koltin as its replacement for everything besides core platform.

So no wonder that people usually say that.

biorach · on Feb 5, 2023

see my reply to sibling comment

ksec · on Feb 6, 2023

Both Nim and Crystal. Or something in between. We are getting close, may be another 5 years of iteration.

Yoric · on Feb 5, 2023

I personally wrote a number of web services in Rust. Found it a bit odd at start, but quickly became pretty natural.

Probably influenced by the fact that I hate debugging in production and last times I wrote some serious Python code, I ended up doing just that.

YMMV

peterashford · on Feb 7, 2023

As someone who equally loves C for all the reasons outlined here - I never understand the Java hate. Its an extremely practical, pragmatic language. Not every program has to be a technological marvel. Most code exists to get stuff done, and having code that gets stuff done and ISNT super complex is a pretty good thing, IMO.

qsort · on Feb 7, 2023

> Java hate

I don't hate Java, I work with it every day and as you say it's a practical language that gets stuff done with minimal hassle.

The reason why it's hard to stomach is that it's the infrastructure of the future with the programming techniques of the past. Java the environment is like living in 2050, Java the language is like living in 1990.

Things are slowly getting better, but everything feels clunky and bureaucratic. The type system manages to be simultaneously ultra-verbose and not expressive enough to represent useful properties. It's a complete travesty that I need to codegen my own Tuples and write my own Either<T1, T2> as a literal Bohm-Berarducci construction. No value types (they're coming, but not yet stable). Didn't have Records until Java 18. Useless semantics for "final". Getters and setters. Nulls aren't even funny.

The language design feels at odds with the broader goal of providing a safe, efficient, solid programming environment.

It's still a fine choice for most applications, I'd venture to say it's actually the best choice, but that's because the language itself actually matters very little, unlike us nerds would like to think. But it doesn't have to be that way. I'm hopeful it'll get better.

lordgroff · on Feb 5, 2023

I 100% agree with your conclusion, except without the pain. I like Java and I think in every way that counts, it makes sensible choices.

atorodius · on Feb 5, 2023

I have found pleasure in combining „just write python“ with some C compiled code linked in and bridged via numpy for performance critical loop. I just love how fast I can write code in Python for when I do side projects

dilawar · on Feb 5, 2023

You may want to checkout pybind11 library. And checkout NIM lang too.

TkTech · on Feb 5, 2023

Pybind11 has significant overhead that makes it non-viable for some projects, even if it has a nice API. For pysimdjson we switched back to cython for an order of magnitude improvement.

nodemaker · on Feb 5, 2023

Curious. Which overhead are you referring to for pybind11?

TkTech · on Feb 5, 2023

Its magic function wrapping comes at a cost, trading ease of use for runtime performance. When you have a single C++ function to call that will run for a "long" time, pybind all the way. But pysimdjson tends to call a single function very quickly, and the overhead of a single function call is orders of magnitude slower than with cython when being explit with types and signatures. Wrap a class in pybind11 and cython and compare the stack trace between the two, and the difference is startling.

Ex: https://github.com/TkTech/pysimdjson/issues/73

nodemaker · on Feb 5, 2023

Ah yeah that makes sense. I would rather call a single C++ function from Python that calls other C++ functions (or itself). In case of pysimdjson however, Cython makes much more sense.

Overall this is way better than writing everything in Rust.

jjtheblunt · on Feb 5, 2023

> Rust seems to put a lot of difficulty up front

I've found myself thinking, time and again, is Rust exposing, to the programmer, what have traditionally been details handled to varying degrees by sophisticated compiler optimizations? If so, a great optimizing compiler might deliver binaries as performant as a Rust expert could, but with less programmer effort?

(I can't answer that, only haven't found a counterexample as a one time optimizing parallelizing compiler person.)

oconnor663 · on Feb 6, 2023

In my experience, the details Rust is exposing are less about performance and more about correctness. For example, Rust forces you to be explicit about:

- who owns what

- how long things live

- which values might be null or missing

- which functions might return errors

- how strings are encoded (https://twitter.com/timClicks/status/1450943515635056648)

- which types are thread-safe

There are some performance gains to be had in that list, but for the most part it's about asking the programmer to be explicit and avoiding convenient-but-risky defaults that lead to lots of bugs.

aldanor · on Feb 5, 2023

> a lot of difficulty up front which is of no value in the kind of typical short program I write most of the time

Wonder what are the examples of 'short programs' where you'd just zip out raw C as a default language, and it would be simpler/faster to implement than in Rust?

t43562 · on Feb 6, 2023

The comparison is really to python rather than C. I C mostly when I'm trying to fix some compilation bug in a library or when I'm trying to call some glibc or other OS level bit of code to achieve something- e.g. on a Raspberry Pico.

Using C for anything is just rare for me and C++ and Rust even more so because they don't add anything that I need usually.

junon · on Feb 5, 2023

I love C. I'm good at C. I have extensively researched undefined behavior, how to spot it, how to avoid it. I've written loads of C, and I consider my C to be pretty darn good.

I learned Rust over the last 6 months. I'm not sure I'll ever go back to C. I still love C. But Rust is the future. I'm pretty sure of it. I don't have to worry about undefined behavior unless I opt into it, and I usually write more performant and powerful code in a shorter amount of time.

I've gone down the road of trying to bolt on checkers and verification to C to achieve a better DX when writing safe C. C compilers just aren't extensible enough right now though (Clang would require some serious upgrades to the attributes and tablegen system in order for it to work well).

Rust's compiler is already extensible today, and the language has most of that built in.

fallat · on Feb 5, 2023

Have you tried zig?

junon · on Feb 5, 2023

Yes. I'll leave it at I have serious concerns about the safety of Zig and refuse to use the language or anything written with it until its creator changes his approach when responding to critical security vulnerability reports.

AndyKelley · on Feb 6, 2023

You have no idea what you are talking about.

junon · on Feb 6, 2023

Andrew, you've taken this road for years now and have only been rude and dismissive to me on Discord, IRC and GitHub for a while, despite my many attempts to reach common ground and discuss what happened with the DOS vulnerability I found in the standard library, one you acknowledged was unfortunate. You dismissed it saying that Zig should not be used in production until v1, but I (correctly) pointed out that won't stop people from using it in production. Now, for example, we have Bun.sh, which worries me that the standard library has other "unfortunate" vulnerabilities you have also chosen to ignore that are making their way into production.

There's clearly nothing more I can say to you; I'm tired of the emotional and childish responses to my attempts to reach out. I've expressly avoided using your name and have tried to keep my critiques civil when discussing Zig the few times I have. However, you seem to find the comments every time despite this.

I wish you and Zig the best of luck.

---

andrewrk — 04/02/2020 there's no such thing as security vulnerabilities until post-1.0, which is why nobody should be using zig in production yet

63 · on Feb 6, 2023

I would love some context from either parent commenter here. This is the first I've heard of security concerns with zig, though admittedly I don't use it much.

detaro · on Feb 6, 2023

https://news.ycombinator.com/threads?id=_u9xp (which appears to be junon?) I'm guessing, keeps popping up when Zig is mentioned.

junon · on Feb 6, 2023

Yes, I asked Dang to remove them after a conversation with Andy, as I wanted to reconcile this with him privately. Dang said he wouldn't remove the comments but would anonymize them, which I guess results in that username.

It's not really something I want to bring up again. I was asked my opinions on Zig, I gave them. The PR might not have been the 'best' solution but the vulnerability was left unaddressed - Andrew seems to insist I misunderstood something, but has failed several times to explain why.

I hope it's since been fixed, but panicking on UTF-8 decoding errors had the potential for massive damage in my opinion.

lanza · on Feb 5, 2023

I've experimented and liked it, but the odds of a language exploding in popularity without a massive investment from tech giants is nearly zero. It's not "may the best language win."

burnished · on Feb 5, 2023

Title and content dont really match do they? The article itself is basically this guys journey with C and the steps he takes to get something other languages now build in - ironically I think it was a stronger argument to write everything in Rust than any 'rust evangelism strike force' has put together.

Neat article, always interesting to take a peak at some ones highly idiosyncratic process.

qwery · on Feb 5, 2023

The title is "Why I use C..." so the article being an account of the guy's journey using C seems highly appropriate.

rendaw · on Feb 5, 2023

Literally, yes. But I think most people come in expecting the author to justify the choice, otherwise this is no more interesting than "why I turned left at the intersection" or "why I have blue window shades".

If the author's reason is "because I can get the same features as Rust with much more effort" they've failed to justify it and thus that reading.

simplotek · on Feb 5, 2023

> so the article being an account of the guy's journey using C seems highly appropriate.

More importantly, it goes against the cargo cult narrative that the choice of programming language is the one and only factor determining memory safety, and consequently any use of languages such as C automatically leads to security issues.

It's not a coincidence that we have people in this discussion coping with that concept by spinning the use of frameworks focused on memory safety in C applications as somehow representing writing code in an entirely different programming language.

VBprogrammer · on Feb 5, 2023

Even the author mentions that he intends to rewrite everything he ever writes in some memory safe language he's building.

Despite the authors borderline obsessive efforts to write memory safe C code, it's interesting to note that his implementation of "bc" prominently contains a file documenting memory issues in previous versions. If that's what borderline obsessive gets you, how are us normies who are only programming to pay the bills, who have to work with other people who also just want to finish what they are doing or interact with blobs of code written by other people with less exacting standards supposed to write memory safe code?

ufmace · on Feb 5, 2023

Yup, that's what I thought was the most interesting part of this, how many memory errors this guy has already admitted to releasing into production in this one project despite all his precautions.

If you wanna write your hobby programs in C because you think it's fun, then fine, you do you. But this seems like a pretty good indication that basically no C written by anyone can realistically be truly safe.

burnished · on Feb 5, 2023

That was my favorite part - it was incredibly virtuous behavior to lay it all out like that. The article would not have been interesting if it had omitted those details.

simplotek · on Feb 5, 2023

> Even the author mentions that he intends to rewrite everything he ever writes in some memory safe language he's building.

It's a framework. It's a set of C macros and C code. It's not a new language. Please stop.

VBprogrammer · on Feb 5, 2023

If you are going to be snarky you should probably read all the way to the bottom of the article.

> The final reason is the only one that justifies my decision: my code will be rewritten in a memory-safe language.

> No, it won’t be Rust. It’s my own language. It’s called Yao.

simplotek · on Feb 5, 2023

> The article itself is basically this guys journey with C and the steps he takes to get something other languages now build in (...)

Not really. The author describes he put together a framework for C that implements memory safety features.

Isn't it intellectually dishonest to criticize the adoption of libraries that provide features that other languages may or may not have built in?

What's the end goal exactly? Use safety features to write safer software, or attack anyone that hasn't mindlessly jumped onto a particular bandwagon?

alexvoda · on Feb 5, 2023

The point in Rust and C# and Java and others is that memory safety is either mandatory or opt-out instead of opt-in. This means you have some guarantees about the whole ecosystem.

Of course you can augment a language with libraries. But you are one developer and everyone else will make different choices.

simplotek · on Feb 5, 2023

> This means you have some guarantees about the whole ecosystem.

Sure, you are free to make this sort of claims. But they are besides the point, aren't they?

The cliche being discussed here is C's hypothetical ties with memory safety issues, and here we are commenting on a discussion on how someone writes C using a framework focused on memory safety and thus providing "some guarantees" about their whole ecosystem. Yet, we're still seeing comments mindlessly parotting myths in spite of the facts that are being dangled right in front of their nose.

burnished · on Feb 5, 2023

Im surprised you read that as criticism, it was intended to be a frank and neutral description.

gonzo41 · on Feb 5, 2023

I've always liked saying. "there's memory safe languages and then there's languages people and airlines use"

SubjectToChange · on Feb 7, 2023

Safety critical code is its own beast. But hey, if you want to fly on a plane running on glibc, go ahead.

defrost · on Feb 7, 2023

No need for glibc - not a lot of call for putc() or fprintf() in a well designed air control system.

Even malloc | free aren't vital as memory is preplanned and can be retrieved from an RTOS in a single block at startup.

SubjectToChange · on Feb 7, 2023

I feel like you are missing the point. Using glibc in avionics software is insane, as is any other application software library. In other words, it’s asinine to pretend C is somehow more reliable because it’s used in safety critical applications. There simply isn’t anything in common with the C running on your servers and the C running on an avionics package.

defrost · on Feb 7, 2023

I don't run servers much, I mainly write for critical applications (a decade writing for geophysical airframes, telling pilots where to go and where the pylons are when 80m off the deck for millions of line kilometres, lot's of other stuff going back a long way).

> Using glibc in avionics software is insane,

As I said above .. there's no need for glibc et al.

> I feel like you are missing the point.

We seem to be in agreement here.

> In other words, it’s asinine to pretend C is somehow more reliable because ...

What most people call "C" is (preprocessor) + (actual language) + (stdc function library).

What I call C (and hey, maybe that's just me) is just the language component, the "below the fold" library spec in the ANSI Std is easily disregarded .. it's more of a dated "proof of concept" of (for example) one of many ways of handling strings.

C is a nice language - the preprocessor and stdlib have issues.

SubjectToChange · on Feb 7, 2023

> As I said above .. there's no need for glibc et al.

Christ, I wrote it as an absurd notion in the first place. This is honestly like writing “go ahead and wear underwear on your head” only to have someone tell repeated tell you that isn’t necessary. Well, no shit.

bayesian_horse · on Feb 5, 2023

I got a gist of "I can use the C because I'm smart and nobody else is so I'm working alone."

wheelerof4te · on Feb 5, 2023

So many people write their side projects alone.

Language doesn't matter.

burnished · on Feb 7, 2023

Really? That read to me a a self aware, even humble recognition of personal shortcomings. But I see communication as inherently more difficult than keeping an idea in one's own head.

raverbashing · on Feb 5, 2023

Yeah true. "Oh he only finds C fun to work with" sorry, that sounds like maybe Stockholm Syndrome or maybe unchecked perfectionism

> "The next reason is that during my time with C, I have developed a custom software stack designed around memory safety."

Oh ok then. Again goes to show how C is deficient in some areas

I get something being fun, but C ends up needing to worry about a lot of details where you could be actually improving the code in other languages

t43562 · on Feb 5, 2023

C is fun - there's elegance in writing good C code and the mental model you need to understand it is much simpler than the higher level languages.

It's also fun to write data-structures yourself instead of being a mere user all the time. In C you can tell yourself that your take on how to do datastructures is worthwhile and unique (might be a slight stretch) but in other languages almost anything you can do is redundant.

hgomersall · on Feb 5, 2023

I used to write loads of C, and recently I came back to write some C bindings to a Rust library, and it was _not_ fun. It was dead simple from the C side - mostly just a few functions that constructed or handled opaque pointers. The problem is the error handling is atrocious, and that was so hard to present neatly. The consequence is an overly verbose API in which I've essentially manually monomorphised in C every result. For sure, it's not idiomatic C, but it is properly handling errors, which arguably is one of the main failures of idiomatic C.

It's not just the API either. So once you've got your "nice" error handling API, you have to manually handle resource clean up - no branches that exit for you; every function call is accompanied by a result check and a goto to cleanup code (so your pointers need to be defined properly prior to any calls and initialised to null). Honestly, I don't know I ever lived with this mess. It was a painful and laborious experience to get to something that was just about ok.

gavinhoward · on Feb 6, 2023

Author here.

> So once you've got your "nice" error handling API, you have to manually handle resource clean up - no branches that exit for you; every function call is accompanied by a result check and a goto to cleanup code (so your pointers need to be defined properly prior to any calls and initialised to null).

I have a stack allocator that will release everything allocated in the function on exit, and I have macros to replace my use of the return keyword that also deallocate everything in the stack allocator before returning.

So my code actually has code cleanup on return.

C does not have to be laborious. Maybe the fact that I have done things like that is part of why I like C?

hgomersall · on Feb 6, 2023

Where can I can get such a stack cleanup system? Also, can you call arbitrary clean up code (this is hardware that needs cleaning up as well as structures allocated in rust that need to freed in rust)?

gavinhoward · on Feb 6, 2023

I use a stack allocator that knows about setjmp()/longjmp(), functions, scopes, and destructors. (Destructors are a function pointer type in my code.)

Basically, you tell the stack allocator to store a jmp_buf, then you setjmp() on it. Then keep going.

For a function, a special destructor is used as a marker. Same with scopes.

The same is also true of a setjmp() destructor, which is actually what activates longjmp().

The stack allocator should have three functions to unwind itself until it reaches the end of the next scope, function, or jmp, respectively.

Then, when you need an exception, or to exit a function or scope, you call the correct stack allocator function to clean up everything to that point. Then you return or whatever.

This can run arbitrary code on cleanup for an item if you write a "destructor" with that arbitrary code. For example, one idea I had was to use the stack allocator to ensure a mutex or other lock was always released. This would be done by storing a pointer to the mutex and writing a "destructor" that unlocks the mutex when given a pointer to the mutex. Boom. Scope-based mutex unlocking.

Makes sense?

hgomersall · on Feb 8, 2023

Thanks for that. I think I'll stick to Rust! But good luck!

pjmlp · on Feb 5, 2023

I don't see any elegance when comparing it to Modula-2 (1978), among many other possible examples from 1958 - 1980 timeframe.

raverbashing · on Feb 5, 2023

> C code and the mental model you need to understand it is much simpler than the higher level languages.

Only if you manage simple buffers and do shallow processing with them

String management, network connections, data structures, this is simpler literally everywhere else

> It's also fun to write data-structures yourself instead

True, but it's more fun in C++, warts and all. And not when time is money

pclmulqdq · on Feb 5, 2023

I think people use the word "simple" to mean different things. You mean "easy to do," while other people mean "easy to understand what's happening." C lovers love the second definition of simple.

SubjectToChange · on Feb 7, 2023

“C lovers” love the appearance of understanding what happens. At the end of the day they still feed their programs to awe inspiring optimizing compilers. Sure, they can probably explain what any snippet of code in their program does, but they can’t tell what, if anything, from that snippet is executed by the machine.

pjmlp · on Feb 5, 2023

Only if they happen to know by heart the 200 use cases of UB documented in the ISO C standard.

Arch-TK · on Feb 6, 2023

You only need to know, by heart, _a_ list (even a subset) of the documented behaviour of C, you can simply refer to the standard whenever you happen upon something not in your list of known good (or known undefined) behaviour. It's entirely possible to, assuming you can keep this up honestly, write perfectly well defined C without hitting undefined behaviour. It is equivalent to writing brainfuck in a language which is a strict superset of brainfuck. As long as you pick a turing complete subset of C to know the semantics of off by heart, you can write perfectly safe C (the only obstacle being human fallibility). The idea that you need to learn all the documented undefined behaviours in C is a myth and is, in fact, fundamentally wrong on the basis that the documented undefined behaviours in the C standard are only an infinitesimal subset of the set of undefined behaviours in C given that anything not explicitly defined by the C standard is automatically undefined.

pjmlp · on Feb 6, 2023

There is the theory, and then there is the practice when we do security assessments and pentesting.

Arch-TK · on Feb 10, 2023

Your response makes no sense.

There is no theory.

It's really simple, actually:

If you have a mine-field with an uncountable number of mines, knowing the locations of 200 mines won't help you cross the mine-field safely. If, instead, you learn how to spot areas which are known to be safe, you can, assuming you don't make a mistake, at least attempt to cross it safely (and if you fail to cross it, you can go back and learn how to spot other areas which are known to be safe).

That is the difference between your claim that you need to know all the documented instances of UB and my claim that you just need to know enough defined behaviour to write your program.

It has nothing to do with security assessments or pentests.

simonh · on Feb 5, 2023

Finding things fun = Stockholm syndrome? I don't think so. it just means there are other aspects of the language that make up for the deficiencies.

Also if he already has a framework that solves that problem for him, then it's not deficient in that areas, for him, right?

throwbadubadu · on Feb 5, 2023

Yes, like enjoying wood work == stockholm syndrome, why not go to Ikea??

Also to gp, title perfectly matches, pointing out why he believes in memory safety when using C, everytime also pointing out the flaws with his thinking, and why this only applies to him in his completely owned projects. Not sure what's wrong with the critics here.

nequo · on Feb 5, 2023

Great write-up on the author's personal perspective. This part in particular I think is a good summary of why this works for him:

  The next reason is that during my time with C, I have developed a custom software stack designed around memory safety.

  [...]

  In other words, I am not using C; I am actually using the partially memory-safe Gavin D. Howard dialect of C.

In other words, don't try this at home? I definitely should not try it at home.

oconnor663 · on Feb 5, 2023

Bryan Cantrill making a similar "I Work Alone" point: https://youtu.be/HgtRAbE1nBM?t=2359

> I can write C that frees memory properly...that basically doesn't suffer from memory corruption...I can do that, because I'm controlling heaven and earth in my software. It makes it very hard to compose software. Because even if you and I both know how to write memory safe C, it's very hard for us to have an interface boundary where we can agree about who does what.

nequo · on Feb 5, 2023

What a clear way to describe the difficulty with manual memory management.

So like Lisp, C might end up being a solo programmer’s tool too. But not because it is a breeding ground for DSLs but because everyone is adhering to different conventions in the absence of the compiler enforcing one.

menaerus · on Feb 5, 2023

I don't think that the composability/interface-boundary/memory-mgmt argument stands for C and much even less so for C++. There's nothing difficult about it nor ambiguous.

bcantrill · on Feb 5, 2023

Did you actually watch the linked presentation? If so, I would be curious in an earnest rebuttal: I very much stand by my conclusions -- and even more so after having spent the years since on Rust-based systems and C/Rust hybrids. Specifically, I have not seen a BTree implementation in C that I would consider to be composable; can you point me to one?

menaerus · on Feb 6, 2023

I did not watch the presentation, sorry. Depends what the definition of composability you have in mind so if you share what you mean by "composable BTree" implementation, I would be happy to give my opinion. I ask because there's no much I can see about the composability of a b-tree implementation other than supporting multiple types, and this is certainly doable although not as elegant as in C++.

bcantrill · on Feb 6, 2023

While I appreciate your candor, you may want to know what argument, exactly, you are putatively refuting? (Specifically: the argument is not about types -- it's about ownership.) If it's not too much to ask, take two minutes to watch the linked clip. As for the composable BTree implementation, I actually am not asking for your opinion -- I'm asking for a link to the implementation itself. The details here matter (and indeed, that's the whole point).

menaerus · on Feb 7, 2023

I watched the video now and it is literally what one of the parent comments paraphrased and what I was responding to.

What you said is:

> I can write C that frees memory properly...that basically doesn't suffer from memory corruption...I can do that, because I'm controlling heaven and earth in my software. It makes it very hard to compose software. Because even if you and I both know how to write memory safe C, it's very hard for us to have an interface boundary where we can agree about who does what.

And I stand by my point. Single-responsibility principle will get you covered, yes, even in C.

> I actually am not asking for your opinion --

You could show some respect.

> I'm asking for a link to the implementation itself. The details here matter

B-tree btw is a silly example which particularly is not among the hard design problems out there and there are dozens of implementations laying around. Obvious examples to look for would be in transactional database systems implemented in C.

> (and indeed, that's the whole point).

... and the point not being quite clear, even after politely asking for a clarification, so perhaps next time you don't try to play the authority but try to provide an actual example to support your otherwise questionable claim. I'm sure Rust community will profit from individuals like yourself.

bstpierre · on Feb 5, 2023

I’ve only played around with lisp a little bit, what about it makes it a solo programmer tool?

nequo · on Feb 5, 2023

I saw it expressed by multiple people whose language of choice was Lisp but the only one that I specifically remember is Ron Garret’s recent post called Lisping at JPL Revisited[1]:

  All this is a reflection of the so-called Lisp curse, the fundamental problem with Lisp -- its great strength is simultaneously its great weakness.  It is super-simple to customize Lisp to suit your personal tastes, and so everyone does, and so you end up with a fragmented ecosystem of little sub-languages, not all of which (to put it mildly) are particularly well designed.

He doesn’t specifically say here though that this makes Lisp a solo tool. I can’t find a source for that right now unfortunately.

[1] http://blog.rongarret.info/2023/01/lisping-at-jpl-revisited....

kazinator · on Feb 6, 2023

Mainly, the online opinions of people who have no Lisp experience, solo or collaborative, but have read something to that effect somewhere.

tmtvl · on Feb 6, 2023

Lisp is the most powerful and elegant language. That is a problem because you're giving weak, fallible monkeys phenomenal cosmic powers. Now, many monkeys are well-behaved and disciplined, so they create reasonable, pleasant codebases. But some monkeys go bananas.

TheLoafOfBread · on Feb 5, 2023

I agree, don't try this at home, try this at work.

habibur · on Feb 5, 2023

I see this as positive. Giving you a large array possibilities and choices.

woodruffw · on Feb 5, 2023

I also like to program in C; the fundamental error is not programming in C (or liking C), but thinking that you’re special or uniquely able to use C safely.

The author adequately defends their predilection for C, which was never doubted; they don’t produce a great argument for why C is good (which, perhaps, they didn’t intend to). Most of the defensive programming techniques covered in the post reflect this: asserts, for example, only catch the bugs you know about.

jillesvangurp · on Feb 5, 2023

Exactly. Objectively, there's no such thing as a disciplined and effective C programmer. Everybody makes mistakes. You can't guarantee that you won't. Believing that you can is in itself dangerous. It's delusional. Experienced programmers know they make mistakes and even anticipate that they will. My attitude these days is "I wonder where I messed up; let's find out!". Basically, like a good scientist, I try to falsify the hypothesis that I messed things up. Only when I start failing to do so, I move on. I make all sorts of silly mistakes. Off by 1 errors, logic errors, inverted conditions, etc. You name it, I do it.

There are all sorts of valid arguments for not immediately porting over the huge legacy of C/C++ code bases to something less likely to blow up in your face. However, the amount of valid arguments for starting new code bases in those languages is rapidly declining. I'm sure there are some corners where you just have no other valid choice and that's fine. But increasingly it's just a matter of people being stubborn, rigid, and unwilling. Half the success is just understanding your own limitations and mitigating those. The rest is just dealing with the inevitability of people messing up.

gavinhoward · on Feb 12, 2023

Author here.

> they don’t produce a great argument for why C is good (which, perhaps, they didn’t intend to)

Never intended to.

dvhh · on Feb 5, 2023

I have yet to touch Rust, as I am currently busy with other things and my work is in other computer language. And mostly glueing modules together.

But yes, I am one of those weirdos that use C for their personal projects, and yes I find it "fun" to fire the debugger to look for my own mistake, see my code run magnitude slower through Valgrind to find my memory errors, and yes I know that my code will be riddled with UB and other kind of "fun" error, but again that is expected because (well) it is C. And I do it on my commute while ssh-ing with my phone.

And while I am very impressed by the progress of Rust over the years. There are two aspects that kind of put me off.

The community does come as very preachy and the language sound very close to the Ada language ( as in "not fun to work with"). And so far those two haven't been disproven.

So maybe the day that I would have to write life critical software as a hobby I would consider writing in it in Rust. Or maybe in order for Rust to succeed it would have to make the other languages far less fun.

Aeolos · on Feb 5, 2023

Rust is actually really fun to work with. Don't let random naysayers on hn convince you otherwise, try it for a month or three and form your own opinion.

There is something magical about wri5ing code that is as fast as C++ but with a fraction of the bugs and effort.

thesuperbigfrog · on Feb 5, 2023

>> try it for a month or three and form your own opinion

Seconded. After you use it for a few months it begins to click and makes sense.

I use C and C++ when I have to work with legacy code, but choose Rust for new projects.

nindalf · on Feb 5, 2023

About the fun aspect of using Rust. Once I learned it, nothing feels as fun as Rust for two reasons - when I'm done writing the program it almost always works first try and second, when it runs I know this is the fastest possible program that I could have written.

It takes a while to get to the point where writing Rust feels easy, but when you do it's so much fun. That's why Rust users love the language so much.

smabie · on Feb 6, 2023

To elaborate on the 2nd part: idiomatic Rust is usually slower than the absolute fastest program you could have written. Making sure you are not allocating (usually wrt to Strings and Vecs) in the hot path often takes a non trivial amount of work compared to naive solution (for ex, fighting with borrow checker instead of cloning, or or allocating your Vecs and passing results thru a mut ref parameter vs a return value) and also makes your code uglier.

That said, writing an ultra performance program in Rust is easier than using any other language in existence rn.

newpavlov · on Feb 5, 2023

>This means that I get many of the advantages of Rust

Arguably, you don't. The big difference is whether your compiler enforces and tracks your documentation, assumptions used by your code, often critically important for correctness. Unfortunately, Rust is far from being perfect in this regard (e.g. SPARK probably does a better job here), but it significantly improves on C/C++ in this regard.

>I Work Alone

Unfortunately, even if you are the only soul touching the code, it often quickly becomes false. On a big enough project, "alone" inevitably becomes "me and that guy who wrote this part of the code". This also can be formulated as "me and 2-years-ago-me". In other words, we quickly forget a lot of context used for writing a piece of code.

>Rust is too big for my small brain, unfortunately.

I think the lack of hubris, the understanding that small human brains are not enough to keep track of all necessary context required for writing correct software is the key element for liking Rust and similar languages. Rust is not about building ivory towers of abstractions for the sake of it (though some people try to walk in this direction), but about offloading as much as possible of the context on compiler. Unlike humans, the compiler does not get tired, it does not forget. Yes, it's limited and because of those limitations it requires to restructure our code and sprinkle it with additional information, but more often than note it's a good thing.

As sel4 and other formally verified software demonstrates, it absolutely can be done in C. But one could say that C used in sel4 is not a C language anymore. After all, you can not take a random C library and use it in sel4.

saghm · on Feb 5, 2023

I think they may have buried the lede a bit:

> I’ve written code in so many languages:

[omitted list of a couple dozen languages]

> and many more.

> When it comes to the enjoyment of programming, I hate all of them. With a passion.

> Except for C.

I get the sense that this is the real reason, and the rest of it is justification that they understand the risks involved but choose to do it anyways. And honestly, I think this it's fine? I'd argue that it's a little inconsistent with their statement above that avoiding C/C++ in lieu of memory safe languages and should be followed by default, but they subsequently admit to being hypocritical about this, so I'm not really concerned that they don't understand the benefits of memory safety. When it comes down to it, I have middling confidence that I would be able convince someone who still at this point doubts that Rust is safer than C, but I have absolutely no illusions that I can convince someone to change a personal preference. If anything, it's a bit refreshing to see it freely admitted, given how often it seems to be the real reason for a lot of choices of what language to use (and I don't just mean this for C/C++, but for any language, including Rust).

santoshalper · on Feb 5, 2023

I think he's fairly transparent that it is the only reason. "I program in C because I enjoy it even though I know objectively it is not a terribly good choice for building software" is pretty much the entire article.

thrwawy74 · on Feb 5, 2023

There are 2 things I've obsessed about in C & C++ land:

1) Why are we explicitly directing a loop forward or backward most of the time when it's not necessary. The compiler should direct the loop unless we want to explicitly do this for the body of the loop. My thinking is that there may be performance reasons to reverse a loop or even split it into separate threads. (unlikely)

2) In C++, I've always wondered how template types get instantiated for POD that is represented the same at the bit level. Does the compiler duplicate the templated code or is it smart enough to say "this std::vector<int> is the same as that std::vector<intalias>". Rust prides itself on zero-cost abstractions, I have wondered if they reduce binary bloat in this area. (as a naive C++ person)

mike_hock · on Feb 5, 2023

1) The compiler is already allowed to reorder loop evaluation if it can prove that it can't have any observable side effects. If it can't, there could be subtle dependencies on the order of evaluation that the person writing the code and tests wasn't aware of, so you're just asking for subtle bugs that are missed by tests and could even turn into vulnerabilities in production.

2) It duplicates, and the standard even comes with asinine "guarantees" that actively sabotage deduplication, such as requiring all functions to have different addresses. Which means the compiler can reuse the same function, but only one function label can start right at the code and the others need to NOP-slide or JMP into it. Which unduly pessimizes precisely those functions that are most likely candidates for deduplication: Those that themselves only consist of 2-3 ALU instructions.

menaerus · on Feb 5, 2023

Nr (2) is not correct. Compilers have been inlining functions for decades so the argument of "all functions to have different addresses" does not hold. Compiler can even _merge_ two different but similarly looking functions into a single one. And that's just part of the code transformations compilers can do.

And this goes without even mentioning LTO, which has been explicitly designed for optimizing the code layout and size during the linking phase.

So, code deduplication is a real thing and happens regularly with your code. Templates are no much different than the non-templated code written for the same purpose.

ynik · on Feb 5, 2023

If you take the address of different functions, the standard requires those to compare unequal. However if you merely call functions and never take their address, the standard doesn't put any requirements on that, so inlining etc. is legal.

But without LTO, it's impossible for the compiler to know whether a non-inline function with external linkage will be called or have its address taken, so the compiler must ensure that it has a unique address.

leni536 · on Feb 5, 2023

There are linker level optimizations aside outside of LTO that can merge functions. gold's --icf option comes to mind, with --icf=safe meant to be conforming.

menaerus · on Feb 6, 2023

All that sounds ... reasonable? What I didn't agree with is the "requiring all functions to have different addresses" argument. It's invalidated both by the compiler (e.g. inlining) and linker (e.g. code folding).

dahart · on Feb 5, 2023

Good example of an Observer Effect in programming! If you look at fn addresses they do one thing, and if you don’t they do another. :P

mike_hock · on Feb 5, 2023

If it can prove that nothing can observe the address of the function(s). Inlining renders the whole discussion moot.

The point stands. Compilers cannot merge equivalent functions in cases when it makes sense to even think about this optimization, which is when it actually has to export an externally visible symbol for the function.

menaerus · on Feb 6, 2023

And that's what happens in probably like 99% of the cases. I objdump the code quite often to understand what happens under the hood and rarely I see that the similar code has been duplicated. There could be of course examples but I just didn't agree with "standard ... actively sabotage deduplication" sentiment which to me reads as something universal and not an exception.

mike_hock · on Feb 6, 2023

I think active sabotage is a correct assessment when a simple, obvious optimization is explicitly prohibited for no (defensible) reason and can only be applied by extensive whole-program analysis that allows the nonsensical rule to be bypassed completely. It's still sabotage, it's just mitigated by the extremely smart compilers we have nowadays that basically pick the program apart and rewrite a functionally equivalent one.

thrwawy74 · on Feb 6, 2023

In regards to #1, I mean we shouldn't be asked to explicitly direct the iteration of the loop. We should say "loop over these things" rather than "start from 1 and stop at 10". MOST of the time we're explicitly stating these things - that the compiler can undo - so what's the point? If the compiler is doing whatever it likes, why are we expending thought on which direction a loop should iterate?

I love that Rust has eliminated the egoism around "you should know at all times when something is allocated or deallocated". We're trusting the compiler to do that better with zero cost to us. So take it to the extreme: Only specify the loop direction if you have a true requirement to do so.

This is my Ted Talk, thank you.

For #2 - LOL! I had no idea it was essentially aliasing those functions to get around a requirement for uniqueness.

taeric · on Feb 5, 2023

1 isn't really the same thing. I don't know any compiler that would turn

    for (i = 0; i < length; i++)

into

    for (i = length; i != 0; i--)

Which is what I read 1 to be.

As for why you sometimes do one over the other. I think it is true that computers branch on zero more easily than other values. That said, I'd be a little surprised to know it makes a difference that can be measured in any meaningful program.

whoopdedo · on Feb 5, 2023

Clang will if `i` is not referenced anywhere else. e.g.

    int i,n;
    for (i = 0; i < length; i++)
        n = something();
    return n;

It's also free to treat `i` as a different variable:

    int i,n = 0;
    for (i = 0; i < length; i++)
        something(n++); /* Compiler will merge the n and i variables */
    return n;

Or eliminate the loop completely if there are no side effects.

    int i, n = 0;
    for (i = 0; i < length; i++)
        n = i + 1; /* A very roundabout way to say "n = length" */
    return n;

taeric · on Feb 5, 2023

In neither of those did the sequence change direction, though?

thrwawy74 · on Feb 6, 2023

See this is why I felt I was obsessing over something very small.

Loop inversion, unrolling, and splitting. The compiler can do so much with a loop to make it run faster on 'this machine' to optimize for speed, memory use, or binary size.

whoopdedo · on Feb 6, 2023

Sorry, I didn't copy the assembler output for brevity but Clang generates `decl %esi` (or `addl $-1,%ebx` depending on the version) in the first case.

taeric · on Feb 6, 2023

Do you know of any cases where it changes the sequence of seen values? I'm assuming it can in this case, because the i is never used?

grogers · on Feb 5, 2023

For #2, many linkers _do_ support "identical code folding", and gold has an option to only merge functions which never have their address taken.

astrange · on Feb 5, 2023

Although doing this means debuginfo won't be available, so you won't get source lines if the merged code crashes.

Niksko · on Feb 5, 2023

I couldn't quickly figure out what sort of applications this person writes for a living. But for 95% of applications I can think of: whatever, you do you.

Lots of people have pointed out flaws in his arguments here. There are always at least two collaborators on a codebase: you and you in two years time. You can enforce your own documentation best practice, but it's always possible to make mistakes unless you have a probably correct way of enforcing those best practices. I'm sure there are other very valid criticisms.

But in the end, there's a good chance that 100% guaranteed memory safety actually doesn't matter for the software that write. If you told me this person writes software for pacemakers or ABS systems or something similarly safety critical I'd take issue. But beyond that it's a sliding scale from merely irresponsible to an inconvenience.

gavinhoward · on Feb 5, 2023

Author here.

My monorepo (the "current project" linked in the post) has or will have:

* A compiler library (replacement for LLVM).

* A programming language.

* A build system.

* An init/supervision system.

* A VCS/project management system.

And maybe more.

zetalyrae · on Feb 5, 2023

Reading the section on how the author just likes C much more than other languages, I was going to comment he should try building his own C-inspired language that he feels comfortable with but fixes some of C's flaws. But it turns out that's what he's already doing: https://gavinhoward.com/2018/08/pl-design-1-principles/

teunispeters · on Feb 5, 2023

I like C better than every other language... for some of the same reasons. Mind, I'd prefer assembly but it's inconsistent between different versions of the same processor, let alone different processors ...

I CAN code in a lot of languages. The only one I have no interest in ever revisiting is PHP. I rather dislike java's bytecode but the language itself is somewhat ok. But I prefer C. Not just due to "bare metal", but also because it's fairly easy to communicate with other hardware.

I'll note - when multiple different hardware is involved, you'll never get "safe code". Not unless the hardware is "safe" too. I think the thing I'm wary about rust about is how much its claims depend on the underlying hardware being consistent, reliable and stable. But - I'll use whatever tools employers or contractors need. I think the hardest language for me is C++ - as above - because it's so "flexible" (and hides so much under the water) that it takes rather a lot of investment to learn every new stack.

I use C on my own projects. I've yet to see a good argument to explore another option and still maintain consistency with communicating with disparate hardware.

winstonprivacy · on Feb 5, 2023

I stayed away from C++ for about 15 years, while working in higher level languages (C#, Go). I recently came back to it and my productivity is SOOO much higher, along with my code quality.

I argue that for certain applications, the ability to work directly with memory is paramount. I shaved 35ns off of a critical routine the other day with a clever memory hack. And in another example, I doubled the speed of a critical loop simply by rearranging variables within a struct so that the CPU had more efficient access to them by avoiding page cache faults.

Mind you, this code already runs hundreds of times faster than anything written in Go or any other garbage collection language can do.

celeritascelery · on Feb 5, 2023

Do you think you would be as productive in Rust? Or have you you not tried it?

winstonprivacy · on Feb 7, 2023

I haven't tried it but for what I do, I need to be very close to assembly. One microsecond of processing time incurs high costs.

vrnvu · on Feb 5, 2023

How does these performance optimizations you mention translate to cost? How much money are you saving monthly for example?

It's hard to justify the effort to write fast code nowadays without talking about cost or revenue.

grumpyprole · on Feb 5, 2023

> Rust is too big for my small brain, unfortunately.

Rust is harder because it imposes constraints on the programmer in order to enforce memory safety. These constraints don't magically disappear if you are trying to write memory safe code in C.

zajio1am · on Feb 5, 2023

The difference is that you can just use any kind of valid constrains or invariants to prove safety in C (but unfortunately these are just in mind or documentation and not verified by compiler), while you have to fit to specific model of constraints used by Rust.

mort96 · on Feb 5, 2023

The constraints imposed by Rust are more difficult to deal with though. There are so many cases where I know for sure 100% that something is completely safe, but I have to spend a whole lot of brain power trying to come up with a way to convince Rust that it's completely safe which doesn't just involve unnecessarily boxing a value.

aldanor · on Feb 5, 2023

We've all been there – you know for sure something is 100% safe and then the fuzzer finds and UB that was outside of your mental model bounds...

Thing is, if some system/concept/abstraction/model is memory-safe, more often than not it directly translates to Rust concepts via lifetimes and the such. If you've done it a number of times, you will find it very easy to map the former to the latter. In the rare cases when it doesn't fully translate, you can always dip into unsafe (usually at the lowest levels of your abstractions), same as the Rust's standard library does, and test/fuzz that part extensively.

mort96 · on Feb 5, 2023

I am not against the concept of safety guarantees.

We have all been there where we have written code based on a faulty mental model which resulted in bugs. Yes. And more often than not, Rust would have prevented those bugs or made them less severe. But we have also all been there where we have had a solid mental model of what a system is doing and written correct code.

Rust helps in the former case, but it makes the latter case harder.

And no, it's not the case that "more often than not", if you need one const reference and one non-const reference to some data, your code is wrong. In fact, almost all code in almost all mainstream languages end up with multiple references where at least some are non-const, and it's fine most of the time. In Rust, all of those situations require type system and borrow checker gymnastics.

I'm not against Rust. I think it's a fine language. I also think it's significantly harder to write than most other languages if you don't want to just punt and use Rc<RefCell<T>> all over the place.

kprotty · on Feb 5, 2023

Rust can't encode non linear lifetime at compile time. This means anything where ownership or access patterns can be self referential or "go up" statically. This includes things like intrusive data structures (linked lists, graphs) and concurrent/callback based control flow. Rust doesn't have the constraints to model these so they do in fact "disappear" (at least statically) as far as both languages are concerned.

jasonhansel · on Feb 5, 2023

The good thing about C is that, while its memory management is unsafe, it is at least explicit; there's generally no doubt as to whether a statement allocates memory or not, since "malloc" and "free" have to be written out explicitly in the code.

The same cannot be said for C++.

robotresearcher · on Feb 5, 2023

Various stdlib functions allocate heap memory without a visible malloc().

steveklabnik · on Feb 5, 2023

Or even, uh... shell out to perl

https://github.com/Apple-FOSS-Mirror/Libc/blob/2ca2ae7464771...

SubjectToChange · on Feb 7, 2023

The behavior of malloc and free often isn’t explicit.

grashalm · on Feb 5, 2023

Oh the wonderful luxury of writing code alone. I have not encountered this pleasure in commercial projects so far.

avgcorrection · on Feb 5, 2023

Bury The Lead/Lede: The Article

> The final reason is the only one that justifies my decision: my code will be rewritten in a memory-safe language [which I wrote and designed].

Oh look. Haven’t seen that conclusion before.

crabbone · on Feb 5, 2023

The whole bulleted list from "Custom Memory Safety"... this makes me want to run away towards the horizon and scream.

In the 90's it was very common that big shops that did some programming would have their own languages, which continued into the 00's in the form of everyone rolling their own C++ standard library. I had the "pleasure" to work with both. Invariably, this stuff was so much worse than off-the-shelf stuff and yet the authors believed themselves to be geniuses.

I mean, there's a chance this guy did something outstanding and actually improved something about C, but so far I've seen so many fail to do that in such a predictable patterned way I just don't want to see any more of that.

My favorite C code is the one that has no macros, as few of typedefs as possible, no acrobatics with alloca() or any platform-dependent stuff. I think, FIO is a good example of what I'm talking about.

The whole thing just sounds like "yeah, it's a swamp, but it's my swamp and I like my swamp!" kind of thing.

simplotek · on Feb 5, 2023

> (...) big shops that did some programming would have their own languages, which continued into the 00's in the form of everyone rolling their own C++ standard library.

It sounds like you're talking about using libraries as if it was a bad thing.

> had the "pleasure" to work with both. Invariably, this stuff was so much worse than off-the-shelf stuff (...)

Where do you think "off-the-shelf" stuff comes from? It sounds like you're trying to mistepresent the adoption of subjectively sub-optimal libraries as somehow the issue caused by using libraries and frameworks.

> mean, there's a chance this guy did something outstanding and actually improved something about C, but so far I've seen so many fail to do that in such a predictable patterned way I just don't want to see any more of that.

You're showing a tremendously naive and misguided belief that libraries are somehow bad.

Languages such as Rust flourish with the adoption of third party libraries,some of them euphemistically described as "not stable". Languages such as C++ flourish with third-party components like Boost and POCO. But applying the same principle to a language like C warrants an automatic and mindless putdown like blindly accusing anything of being bad? It makes no sense.

davisoneee · on Feb 5, 2023

A more generous interpretation would be that if you have a communal library, rather than everyone having their own individual libraries, it's easier to vet it for correctness. Rust takes the communal approach. The parent was suggesting that every company has an insular library approach. NIH.

Many people that go off the beaten track often fall into the same ditch.

menaerus · on Feb 5, 2023

Correctness yes but what about other requirements people have such as performance, extended functionality, domain-specific optimizations, predictability, etc.

Third-party libraries were not born out of the thin air but because of different parties having different and very much often disjoint requirements so it's not particularly NIH syndrome.

d0mine · on Feb 5, 2023

There is a huge gap between something like boost that is designed to be reused and internal libraries that are subject to the usual corporate constraints (the least amount of work is done if even that).

simplotek · on Feb 5, 2023

> There is a huge gap between something like boost that is designed to be reused and internal libraries that are subject to the usual corporate constraints (the least amount of work is done if even that)

You're somehow trying to imply that libraries are not reusable if they are not widely shared, which makes no sense at all.

The best argument you could make is regarding stability, but you simply cannot lay any such claim just from the library's licensing.

Accusing internal libraries of being half-baked, badly designed, or bug-riddled is just a cheap blanket putdown.

crabbone · on Feb 7, 2023

> It sounds like you're talking about using libraries as if it was a bad thing.

I don't see the relationship... how do you make such a leap?

> Where do you think "off-the-shelf" stuff comes from?

I worked on proprietary projects that eventually went public domain. So, I can tell you where off-the-shelf comes from from personal experience. Proprietary in-house tools have the benefit of not having to deal with many things that an off-the-shelf tool would have to deal because they can choose the kind of hardware to run on, the kind of supporting libraries, their versions and combinations to use, the kind of developer tools to support and so on. off-the-shelf tools, in order to be successful must cover a much wider area of ecosystem in order to be relevant.

Here to give you a better example: C doesn't have its own concurrency primitives, no concurrency model etc. But, you could build an extension, let's call it CC to have some sort of concurrency based on pthreads library. If, for your internal needs, you only use CC on Linux -- you are golden and everything works fine. Once you try to make CC into off-the-shelf product you have to do something about pthreads missing from other platforms.

Off-the-shelf products are typically born as in-house products, but they need an extra step to become what their name implies.

In the specific case of what OP described as their own extension to C, I can vividly imagine a language that doesn't cut it as an off-the-shelf one. Making an off-the-shelf language would've also guided OP to a much larger departure from C because memory safety is only one of the big problems with the language. And this is what happened to a bunch of other languages which already walked a big chunk of that departure journey. This is why I would've been frustrated having to use the language like OP's: it would've felt like not enough change and too much headache to adjust to my target environment than could be had by using actual off-the-shelf languages.

> You're showing a tremendously naive and misguided belief that libraries are somehow bad.

I have no idea where you get this from. If anything, I say the exact opposite...

sacnoradhq · on Feb 5, 2023

C++ is the F-35. Much touted. Expensive to maintain. Expensive to get rid of. Doesn't quite live up to the hype. Borland C++ and Visual C++: never the two shall meet. They might as well been entirely different languages.

thesuperbigfrog · on Feb 5, 2023

>> C++ is the F-35. Much touted. Expensive to maintain. Expensive to get rid of. Doesn't quite live up to the hype.

Quite literally.

The F-35 software is C++ code:

https://www.stroustrup.com/JSF-AV-rules.pdf

pjmlp · on Feb 6, 2023

Given all software problems that plagued F-35 during all its development, including mid-flight avionics reboots, I don't think Bjarne Stroustroup actually realizes that it isn't a good idea to advertise it.

SubjectToChange · on Feb 7, 2023

Name a better plane. The F-35 is a great plane and its export success reflects that.

esjeon · on Feb 5, 2023

> My favorite C code is the one that has no macros, as few of typedefs as possible, no acrobatics with alloca() or any platform-dependent stuff.

You've just outlined a perfectly good subset of C. I'm pretty sure people who write C extensively even to this date mostly end up there. The beauty of using C these days is that one gets to control every single byte without going through hoops and loops. Also, any C code can be reliably integrated into other languages using FFI, which is quite universal. So, yeah, no need to create dirty hacks.

nayuki · on Feb 5, 2023

> The beauty of using C these days is that one gets to control every single byte

No, not even close: Struct padding, compiling switch as if-else vs. jump, implicit arithmetic promotion to int, stack layout of local variables, and many other issues.

If you want to control every byte, write in assembly language.

chrchang523 · on Feb 5, 2023

Nit: struct padding and arithmetic promotion don't support this point. In practice, C compilers provide struct-packing extensions that cover the uncommon cases where default padding wouldn't yield the layout you want. And they'll let you opt-in to a style where arithmetic promotions are forced to be explicit as they are in e.g. Go. So it would usually be ridiculous to drop down to assembly for either of these reasons today.

avgcorrection · on Feb 5, 2023

In either case it doesn’t matter:

- It’s their subjective opinion (the author makes that clear)

- The author works alone

So there’s no risk that one would have to work with this dialect.

woodruffw · on Feb 5, 2023

Well put. Another way to think about it: every major OS and systems vendor has had some custom “safe” C variant over the years, to empirically mediocre results. It's unlikely that this author has done better; it’s much more likely they simply haven’t received the same scrutiny.

gavinhoward · on Feb 5, 2023

Author here.

I know this. The only reason the custom stuff works for me is because it works for me. I accept no outside contributions, and this is one reason why.

gavinhoward · on Feb 5, 2023

Author here.

Don't worry; I do not accept outside contributions, so this code will not be imposed on anybody but myself.

But it works for me. It fits my brain really well.

Nginx487 · on Feb 6, 2023

Software developers tend to love things that give them challenges and material for mental workouts, and it's fine.

However, it has nothing to do with commercial software development, where system architects should think not only about the "coolness" of technology, but also about people who will support the final product. And team leaders think about how hard it would be to hire people who know how (and are willing!) to work with the said "cool technology", and since we are talking about statements like "I have developed a custom software stack designed around memory safety", i.e. non-standard solution with no documentation, (while there are alternatives, where memory safety assumed to be as a pre-requisite, not as an implementation artifact), actually, "close to zero".

I was writing in C and classic C++ for 17 years, and I admit it is fun and challenging to design and implement systems using these tools. However, when in my team we have a library that we're expected to develop and write in C, I rewrite it in modern C++17 (I don't say "in Rust" exactly because of the second reason, but modern C++ is safer than old C by an order of magnitude), and the process rarely takes not more than 1 week. As a result its source code usually becomes about 10 times smaller, implementation faster in 80%, and the team is spared of wasting their time exploring undocumented solutions written decades ago.

ekidd · on Feb 5, 2023

I am more paranoid and cautious than, say, 95% of the programmers I've known. Maybe more. I still don't trust myself to write safe C.

Specifically, I do not think I'm smart enough to violate the Chrome team's Rule of 2: https://chromium.googlesource.com/chromium/src/+/master/docs...

> The Rule Of 2 is: Pick no more than 2 of untrustworthy inputs; unsafe implementation language; and high privilege.

I do know of a few C programmers I'd trust: DJB, many of the OpenBSD team, the Dovecot maintainers, and a few others with long track records of security.

But I don't trust myself because I've used fuzzers on my Rust code, trying billions of inputs. And I've found DDOS bugs that would have been potential exploits in C.

What's more damning, the most careful C code I ever wrote has an enormous, sneaky test suite. It was tested with every sanitizer I could find. It used carefully designed error handling conventions. Still, in the last 20 years, it has been the subject of several CVEs. You see, I relied on a high-quality 3rd party XML parser, and that parser had a handful of bugs.

Out of 7 billion people on this planet, the number that I'd personally trust to reliaby write CVE-free C code is in the low triple digits. I'm not one of them.

Understanding Rust is a cakewalk compared to understanding "undefined behavior" in the C standard, or to making sure a large C program never overflows an addition, or accesses memory out of bounds. But Rust is not the only option. Use Go or Java or Ada SPARK or a functional language or WUFFS. Any are fine.

As an industry, we need to stop making the same endless security mistakes. It's not OK.

delusional · on Feb 5, 2023

> You see, I relied on a high-quality 3rd party XML parser, and that parser had a handful of bugs.

At that point it's hardly the fault of C. Even in supposedly "memory safe" languages like Java, XML (and json) parsers are often riddled with bugs and design problems that can be exploited for remote execution. It's a joke, but these parsers are still somehow considered "best in class" because many supposed "programmers" are willing to trade security for the ability to write annotations.

I'd say that what you describe in your software is exactly what i like about C. You know it's dangerous, so you take precautions. You then discover that those precautions maybe aren't enough and you go looking for something more stringent (like rust). There's learning there. If we cargo cult Rust as "what people should learn instead of C because it's secure" bad programmers will go write poorly designed rust code that's as unsafe (if not more so) as any C code, except now they won't be careful because "the language is safe".

kajaktum · on Feb 5, 2023

>I'd say that what you describe in your software is exactly what i like about C. You know it's dangerous, so you take precautions. You then discover that those precautions maybe aren't enough and you go looking for something more stringent (like rust). There's learning there. If we cargo cult Rust as "what people should learn instead of C because it's secure" bad programmers will go write poorly designed rust code that's as unsafe (if not more so) as any C code, except now they won't be careful because "the language is safe".

What level of cope is this? "Bad language is better because we know its bad so we become better!" Every time someone argues for C or C++ I replace them with assembly and C respectively. The argument makes about the same amount of sense.

lumb63 · on Feb 5, 2023

I don’t think that’s what they’re saying. They’re saying that there is utility in understanding what you’re getting with a memory-safe language like Rust. Why would a programmer care if they’re getting memory safety and thread safety if they don’t know what unsafeness looks like? It’s the same reason we teach history in school; knowing where we came from gives us a better perspective of where we are today.

avgcorrection · on Feb 5, 2023

They can read the Rustonomicon. Done. They can also write C or Unsafe Rust in order to learn about what regular Rust gives them.

But none of that is an argument for learning C and then stumbling into a safer language after doing that, since then you have to learn (say) Rust as well as unlearn your C (or whatever unsafe language habits) as well.[1]

[1] https://www.reddit.com/r/rust/comments/10rnymj/my_reaction_t...

kajaktum · on Feb 6, 2023

>It’s the same reason we teach history in school; knowing where we came from gives us a better perspective of where we are today.

What we learn from history is mostly "holy shit that was fucking awful thank god we stopped doing that!". Except for programmers it seems. I have no objection to learning or using C but we don't ride horse caravans around anymore.

nicoburns · on Feb 5, 2023

> Even in supposedly "memory safe" languages like Java, XML (and json) parsers are often riddled with bugs and design problems that can be exploited for remote execution

The huge benefit of having a strict compiler like Rust's is that it massively raises the quality floor for every single library in the ecosystem. This is especially true if a library doesn't use unsafe (which is true of many Rust libraries including the most popular XML library https://lib.rs/crates/quick-xml).

> bad programmers will go write poorly designed rust code that's as unsafe (if not more so) as any C code, except now they won't be careful because "the language is safe".

The whole point of the language being safe is that you can't write code as bad as C code (e.g. contained RCE vulnerabilities) without going massively out of your way by using an `unsafe` block or doing something obviously stupid like passing an unsanitised string to `exec`. It won't compile. I would absolutely trust carelessly coded safe Rust code over carefully coded C (unless that carefulness is taken to the extreme as in MISRA-C or similar).

ekidd · on Feb 5, 2023

> Even in supposedly "memory safe" languages like Java, XML (and json) parsers are often riddled with bugs and design problems that can be exploited for remote execution.

Why do we think this is OK, even for a moment? For how many decades will we accept massive vulnerabilities everywhere? We can't rewrite everything, but why isn't Chrome's "Rule of Two" a minimum professional standard for new code?

This isn't just a Rust thing. For example, why don't more languages allow me to say, "This XML parser is forbidden from talking to the network or file system"? Why could log4j load code without being granted special permission?

taeric · on Feb 5, 2023

Because that is a lot easier said than done, all told? As we get faster and faster machines with more and more spare cycles, we can probably devote a lot to this sort of thing. Maybe.

And, really, this is no different than anything else. Just in an odd "reverse" direction. Lets say someone makes brake pads for your car that have an antenna and compute on them. How would you keep them from communicating your location to anyone else? You can work on detection, of course, but for the VAST majority of vehicles out there, this would be silly at that point.

Same goes for why you use locks that can typically be unlocked with a toothpick in bathrooms. Privacy locks are vastly different from security locks. Even though they look roughly the same on paper. And this is clearly ignoring every other thing that makes you vulnerable outside. Why do we allow satellite imagery of people in their backyard? Mainly because we don't have a realistic way of disallowing it.

lanstin · on Feb 5, 2023

I think there is an opportunity here for language design. I should be able to import a library and mark it as nothing but computation, or as network only no file system, or file system only but no network, and have the language itself enforce that guarantee. I believe it has to be at the language/compilation level as executable segments can always just trap into the OS.

If there were some syntax, then you could even specify the particular network endpoints or file system paths allowed.

It is sort of old fashioned, a sort of dynamic scoping of access context, but would be better than you call a shared library which starts up some threads with network connections all invisible to you.

There would be some idea of the main function and spawning threads or green threads would also need to specify what that thread can do, a subset of what the creating thread can do.

PaulDavisThe1st · on Feb 5, 2023

> I should be able to import a library and mark it as nothing but computation, or as network only no file system, or file system only but no network, and have the language itself enforce that guarantee.

There is no sheriff to enforce this guarantee. There's nobody come to save you (us) from ourselves. You can try to build walls around things, but meanwhile other people are building ladders (or digging tunnels).

It gets worse though. Virtuality, whether via literal VMs or COWbuilder-style containers, means that defining what is means to access certain resources is hard to do. For the most part, this resolves nicely (i.e. you almost always have less significant access to "the real world" than you might believe), but it's not trivial to determine.

marcosdumay · on Feb 5, 2023

It's funny to read those threads of people saying how nice would it be to have, or that things are impossible, and yet those same things are there in Haskell since forever.

PaulDavisThe1st · on Feb 5, 2023

So tell me, I run a Haskell program inside a VM and it accesses the root of the filesystem. What happens? It accesses the network interface, what happens?

The point is that the code, at runtime, cannot determine what context it is running in, and it may be entirely appropriate for it to do certain things, or highly inappropriate, depending on that context.

It has nothing to do with language.

JonChesterfield · on Feb 5, 2023

Put the library in a different process and seccomp it

tremon · on Feb 5, 2023

Yes, this, exactly. In our current operating systems models, the process is the most granular resource that we can confine. The OS model doesn't allow for more fine-grained resource control than that, so code with different authorization scopes should run in different processes.

Maybe we need a different OS/process model?

bhawks · on Feb 6, 2023

The fuchsia os has a very interesting security model in this respect : https://fuchsia.dev/fuchsia-src/concepts/principles/secure

The tldr being a very powerful and lightweight sandboxing mechanism complemented with a OS mediated ipc layer wrapped by language idiomatic libraries.

Unfortunately fuchsia is taking it's own sweet time to play out.

gavinhoward · on Feb 6, 2023

Author here.

My language will have some sort of what you are talking about.

lanstin · on Feb 8, 2023

Yeah, the replies all sort of show ways of wedging this in, but why not add it to the language/compiler? It would be very useful, and also interesting to see attacks against it. It would also be interesting to see how this sort of dynamic scoped feature would be implemented these days.

(In old LISP stuff, you used to be able to do like:

(with-stdout-redirected-to-printer (whatever-func etc))

and all the stuff whatever-func called would print but it would be redirected to printer.

gavinhoward · on Feb 8, 2023

That sort of redirection will absolutely be possible, but that won't be the language-level thing my language will support.

In my language, there are packages. You might have a left-pad package, for example. (Okay, it's a bad example, but whatever.) The programmer/user gives permissions to packages at the package level, so you might give the left-pad no permissions at all except for allocating memory (for generating a formatted string), which means something else will need to do the output.

These permissions can include anything, like being able to use other packages, network access, filesystem access, etc. Static dependencies will be the first line of defense, but if something need dynamic access, say access to specific files such as `/dev/urandom` or `$HOME/.config/<program>/config.txt`, there will be a dynamic way to give permission.

This dynamic way will probably serve a lot of the use case you suggest.

This will be at the compiler IR level, by the way. The build system of dependencies might otherwise be able to get around it. But with IR, you can tell the dependency's build system to generate the LLVM-like IR for its code, and its build is done and can have no more influence.

Then without using any of the dependency's code, you can then specialize the IR while inserting permission checks.

You can even use this to generate cross-platform "binaries". The "binaries" would just be a tarball of IR. Most IR would be used for any platform, but some would be platform-specific. If you include all of the platform-specific IR with the platform-independent stuff, you can then ship that tarball to any machine, which, on trying to run it for the first time, will realize it needs to be specialized for that machine. This includes inserting permission checks based on the policy for that machine.

Yes, it will be that powerful. My main concern is that it will be too powerful and thus have bad user experience or a steep learning curve. UX matters.

I don't have it solved yet, but I'm trying.

atoav · on Feb 5, 2023

It is a culture thing. The damage that broken code has caused was either not big enough to shift that culture, or (more likely) everybody involved convinced themselves that software errors are both inevitable and a welcome excuse to pin things onto — just try how often some privacy leak/hack/error is blamed on "software failure" as if this was one of gods lightning bolts nobody can protect against.

If we did electrical engineering like we did software every gadget would come with a fire extinguisher. If we did civil engineering like we did software engineering we would have daily bridge collapses.

In the dawn of modern civil and electrical engineering indeed many catastrophes happened, and they did happen till culture changed and had an inpact on the education of future engineers. This point has not been reached at all with software engineering outside of safety critical applications.

tremon · on Feb 5, 2023

For example, why don't more languages allow me to say, "This XML parser is forbidden from talking to the network or file system"?

Because that's not up to the programming language to decide. In which context a particular piece of code is executed is out of scope for the design and compilation phase of your program -- that's part of the application deployment, and that deployment is usually not written in the same language as your program.

What you're asking for is basically the microkernel approach applied to applications: each microservice has its own capabilities and data access, and control is passed through RPC. This exists, and it can be implemented in any language (though more readily in some than in others), but this is not a programming language feature: it's a runtime feature that must be provided by the environment (whether JVM, Linux, Docker, SeL4 or browser) in which your code is deployed.

ekidd · on Feb 5, 2023

> Because that's not up to the programming language to decide.

There's actually a lot of really neat research in this space, including:

- "Row types" for effect systems, which allow the compiler to cleanly keep track of what code has access to what parts of the outside world.

- Capability systems, where you essentially need "handles" to access things in the outside world.

- Strong-typed unikernals, which use either of the above approaches to replace the process boundaries in a regular microkernel.

Admittedly, none of this is exactly ready for the mainstream yet. But there are plenty of ways to help address these issues at the programming language level. And I think this is worthwhile, because as the Chrome team has pointed out several times, trying to stick every parser in a seccomp-isolated process is a lot of hard work. And it rarely gets done in practice.

Given the sheer number of dependencies many projects have the days, and the growing number of "supply chain" attacks against libraries, I think this is worth all the effort that's going into right now.

Maybe in 10 or 20 years, we'll see some commercially-acceptable languages that explore this space.

UncleMeat · on Feb 5, 2023

Even people like DJB have written buffer overrun vulns in their C code. In my opinion, it is impossible for an organization of meaningful size to write and maintain a program of meaningful complexity over a period of time in C or C++ (rather than some very strongly constrained version of C++ or a constrained and augmented version of C) without introducing memory safety errors.

naasking · on Feb 5, 2023

I mostly agree, but I also feel it hasn't really been attempted properly, by which I mean exploiting the type checker to its fullest. For instance, write a stdlib where all functions reading input return "unsanitized" data types that must be supplied with a validation check before the sanitized value can be read out. Safer strings and arrays, possibly references instead of pointers, return explicit error values rather than errno, etc. And then require a program to only write against the safelib.

I suspect most C programmers just wouldn't like the constraints, or would be concerned with performance degradation, and that's the real problem.

Edit: another avenue is to also use Frama-C to check your code.

UncleMeat · on Feb 5, 2023

You can do this. You can ban all pointer arithmetic, use array types that have attached sizes and automatically introduce bounds checks, require various compiler extensions and annotations for lifetimes, require initialization immediately upon declaration, have the compiler introduce nullptr checks when it cannot prove a pointer is nonnull, ban std::variant and reinterpret_cast<T>, and more.

You end up with something that resembles the set of programming requirements from Rust (if you leverage lifetime annotations) or you end up with something resembling a refcounting GC (if you demand the use of shared_ptr everywhere).

This cannot really be "attempted properly" at scale. To implement this with C you need to both superset and subset the language. To implement this with C++ you need to harshly subset the language. Both communities hold backwards compatibility as a huge goal, making it impossible to move in this direction. A project like Carbon seeks to incrementally move in this direction, but required breaking off from the C++ community. Projects like Rust just rip the band-aid off entirely at the beginning.

The "don't worry, C is fine if you just don't suck at programming" folks don't tend to push for these extreme changes.

naasking · on Feb 5, 2023

I'm not even thinking about compiler extensions, just standard opaque pointers and fat pointers, and macros/functions that operate on them, and a linter that flags any references to unsafe C stdlib functions or uninitialized locals. This won't be as safe as Rust, but you'll at least still be in C. I just think we can be a lot further along the safety spectrum in C than we currently are, it's just C programmers still use some outdated practices and idioms that could be safer.

UncleMeat · on Feb 5, 2023

That's fine, but nowhere near enough to have anything resembling safety guarantees.

naasking · on Feb 5, 2023

The point is that you can get some safety guarantees. Nowhere near the degree of guarantees available in safer languages, but still better than the status quo.

Edit: one example of a pattern, whose name I can't remember, was to switch from returning pointers to data structures, to returning pointer offsets as handles. Using these handles you can then track more information about validity and handle "null pointers" more sensibly rather than it introducing undefined behaviour. In superscalar processors the offset calculation basically costs nothing, but the additional safety can be considerable. I believe I read about this pattern in game engines, let me know if you know the name of it!

Edit 2: I think this is it:

https://floooh.github.io/2018/06/17/handles-vs-pointers.html