Bugs that the Rust compiler catches for you

lordnacho · on May 5, 2022

All really good things to mention, but the data races thing is a big deal and gets just a short reference and where is the bit about the borrow checker?

The way I see it the things mentioned are nice appetizers and the data race and borrow checker are the main meal.

IME the most frustrating problems are not that you forgot to exhaust the switch statement or didn't initialise a new field, it's when you get a segfault that's hard to reproduce and you have barely a hint about what's caused it.

infogulch · on May 5, 2022

For my first C class in uni I wrote a curses-style CLI spreadsheet tool with a csv parser, reverse cell reference table, automatic modification-based recursive recalculation, a basic formula parser, an infix notation to reverse polish notation "compiler", and a stack-based RPN evaluator (I may have gone a bit overboard on that assignment). The one multi-user unix server for all students that I wrote it on (named 'foley', some CS reference I think) had an ancient gcc compiler that compiled it just fine, but when run it would segfault. Hardly surprising, I created hundreds of segfaults, but this one would only segfault if I used -O0 and didn't add any logging statements. That's right, it ran fine with any level of optimization above 0 or if I used any logging statements, and only segfaulted on -O0. Utterly infuriating and flagrantly defied my expectations. I got help from some patient soul I found on freenode ##c (I wish I could remember their name) who didn't see any obvious issue or warning and couldn't reproduce any segfault with their much more recent compiler. -Wall+++, asan, nothing cracked it. Everything else was exhausted: it must be the compiler. But I couldn't just update the compiler package because it was a multi-user server and I had zero permissions, so I ended up bootstrapping a 10-years newer gcc through multiple intermediate versions and installing it in my profile (quotas be damned) and used that. It didn't segfault on -O0 anymore. I haven't voluntarily used C since.

andrewchambers · on May 6, 2022

This is not a good anecdote - You hit a 10 year old bug because your sysadmin was too lazy to update gcc and then formed a bad impression of the software?

Guess what, if you install a 10 year old version rust none of your code would not compile at all. Your logic is so flawed its hard to believe you are being serious. If anything it's impressive that a 10 year old C compiler worked most of the time when most other languages are not stable on these timescales.

duckerude · on May 6, 2022

The oldest rustc version I target (for a tiny project, admittedly) is over 6 years old, and it's fine. When code works it also works on the latest release. When code doesn't work I get a reasonable compile-time error message, not a mysterious segfault at runtime.

It's not easy, even in a few hundred lines I had to remove many conveniences to go that far back, but it's very legible. The compiler has my back, I'm never confused about whose fault it is.

Rust is young, but I don't expect this to be any different in 2030.

avgcorrection · on May 6, 2022

> Guess what, if you install a 10 year old version rust none of your code would not compile at all.

Rust was not stable in 2012 yet.

I do agree with your overall point though.

kragen · on May 5, 2022

That's not related to C, except in the sense that GCC is written in C. GCC was just emitting incorrect code because it had a bug in its code generator. If there's a bug in the LLVM code generator (which is written in C++), rustc might end up doing the same thing.

If you write the compiler in Rust it could still happen. Rust code (that compiles) has less bugs than the corresponding C but not zero bugs.

It doesn't sound like you went overboard, it sounds like a good project. Maybe you should have written it in Perl 5.001 though.

infogulch · on May 6, 2022

Because it was C I couldn't be sure it was the compiler's fault until I really exhausted every other option. Maybe spent 100h between a few people getting to the end of it. If I had written in safe Rust (I would have, I wasn't doing anything weird) and it segfaulted I'd know immediately that the compiler was the problem because you can't make safe rust segfault.

kragen · on May 6, 2022

Yes, that's true.

andrewchambers · on May 6, 2022

Or he could do the normal thing and run valgrind to instantly see exactly where the problem is. I don't even disagree with the premise, but these are straw man arguments.

kragen · on May 6, 2022

Valgrind didn't exist then. (Next you're going to say ASan and UBSan. Those didn't exist either.) They (he?) could have run GDB and disassembled the code around the crashing instruction and verified that the machine code there didn't make any fucking sense. But that implies they knew assembly language well enough to recognize when their compiler was spewing nonsense, which is a couple of steps above "can read working assembly emitted by the compiler" or even "can write applications in assembly language".

Even with Valgrind it can be tough to distinguish whether your output code is wrong because of a bug in the compiler or because your code was just wrong. There was a change to GCC a few years back that introduced security holes into the FreeBSD and NetBSD kernel because it just threw away code that would only be invoked in "undefined behavior" scenarios. Signed integer overflow I think.

andrewchambers · on May 6, 2022

You are unfairly comparing ancient C with modern rust - Try comparing modern C with modern rust to avoid making a straw man argument. Like I said, I don't even disagree, I still think you are just doing everyone a disservice with this line of reasoning.

He said if he had used rust - I am saying if he had used C when rust was available 1. The bug would have been fixed. 2. He could have used valgrind. Modern C also has something called ubsan, and another thing called frama-C.

These tools may be inferior to what rust has, but ignoring they exist or comparing 10+ year old C with modern rust is a bad faith argument.

kragen · on May 6, 2022

I agree, I don't think that a bug in GCC last millennium is a very strong argument for using Rust instead of C today, though such arguments do exist.

However, I also don't think they particularly help you to distinguish a compiler bug from an error in your understanding of the language semantics, although they sure do help a lot with everyday errors.

With respect to questions of inferiority or superiority, keep in mind that Valgrind and UBSan are only dynamic checkers; they don't help at all with errors that don't occur in your testing. Frama-C is a static checker more similar to Rust's capabilities, but much more limited, but also with cscope-like abilities for reverse-engineering existing source bases.

The great advantage these three tools (and ASan) have is that you don't have to rewrite your C in Rust in order to use them.

KsassPeuk · on May 11, 2022

> but much more limited

I agree and I disagree ;)

I agree because having a type system that directly provides the guarantee that whole classes of runtime error cannot happen provides fast feedback during development at a low cost.

I disagree because even in a press button (+ tuning) approach, you can prove things with Frama-C that the Rust compiler cannot prove (reason why there is runtime bound-checking, implementation defined behavior for integer overflow and so on in Rust). But also because you can prove much more advance properties than "just" absence of runtime errors.

kragen · on May 14, 2022

And vice versa; Frama-C doesn't attempt to prove the absence of data races, last I checked.

KsassPeuk · on May 16, 2022

In fact there exists a plugin that can do that but it is currently not free:

https://frama-c.com/fc-plugins/mthread.html

(And by the way since it is not done by typing it is hard to use on legacy code)

jcelerier · on May 6, 2022

OP mentions asan so valgrind definitely did exist - it's more than twice as old as asan

kragen · on May 6, 2022

Hmm, that's interesting—that seems like an inconsistency in the story. OP, what year was this?

jcelerier · on May 6, 2022

I don't see any inconsistency. Universities still have multi-user unix servers for students in 2022 - at Bordeaux there's a few 64/128 thread ones with as many gigabytes of RAM, which leaves a lot of room for doing work during class. (And most likely they would be running some Debian LTS like Jessie or Stretch)

kragen · on May 6, 2022

Sure, I'm using a multi-user Unix server right now in another window. The key point was that there was just one server for all the undergraduates. (And that they had to worry about a private install of GCC using up their disk quota.)

jcelerier · on May 6, 2022

.. the uni I'm involved in still has a 1gb quota per student ~ afaik which ... needs some contorsion to be able to fit a GCC build, for sure

kragen · on May 7, 2022

Yeah, you can't build modern GCC inside the 1GB quota, but you can probably build it in /tmp or /var/tmp or /scratch or something. The actual built compiler is under 50 MB in GCC 8. And if you're really cramped on quota you can gzexe it, at the cost of painfully slow startup times.

pjmlp · on May 6, 2022

I am quite sure stuff like Insure++ and Purify did exist.

kragen · on May 6, 2022

Yes, Purify surely existed. I've never heard of a university leasing out Purify licenses to undergrads. I don't know anything about Insure++.

pjmlp · on May 6, 2022

On my days that kind of software was accessible in all universities, with the right connections.

shaklee3 · on May 6, 2022

valgrind came out 20 years ago, so yes, it definitely existed 10 years ago.

kragen · on May 6, 2022

20 years ago nearly all universities could afford more than a single Unix box for all the undergraduates to share. At that point it was mainstream to buy a cheap US$300 ATX box and run Linux on it, so you could have one per student or one per dozen students, rather than one for the whole undergraduate student body.

The Rio Receiver came out 21 years ago. That was a Linux box sold as a consumer electronics stereo component; it grabbed MP3s over Ethernet to play them. I don't remember how much it cost but it was less than US$300.

But apparently the original author tried using asan? That seems like an inconsistency in the story.

kaba0 · on May 6, 2022

Valgrind only outputs errors on codepaths that got run. Which sure, if it is a deterministic segfault it would catch, but let’s say you have a race condition and only get segfault 1 times out of 100, and it may not even happen on valgrind?

kragen · on May 6, 2022

Right, but it is pretty great for deterministic segfaults.

infogulch · on May 6, 2022

Yes perl may have been a better choice for the projects sake, but alas the course was apparently a trial-by-~fire~segfault of C + string processing. Loved PCRE though... hmm I wonder if you could do it with just a regex...

kragen · on May 6, 2022

I will be very surprised if you can write a working spreadsheet in a PCRE regex. Perl itself permits you to embed arbitrary Perl code in a "regex", though.

pjmlp · on May 6, 2022

GCC is being written in C++ for about a decade now.

charcircuit · on May 5, 2022

GCC is written in C++

kragen · on May 6, 2022

There are small parts of GCC that are written in C++, but at the time when universities had "one multi-user unix server for all students" it was 100% C.

pjmlp · on May 6, 2022

Yeah, but that was 40 years ago, back when GCC still stood for GNU C Compiler.

kragen · on May 6, 2022

I had pegged it as between 20 and 30 years ago, but someone pointed out that the author said they tried using asan. 40 years ago most universities didn't have Unix boxes accessible to random undergraduates at all.

GCC started having parts written in C++ 12 years ago: https://lwn.net/Articles/542457/

turminal · on May 6, 2022

and there was no LLVM :)

kragen · on May 6, 2022

Nor Rust!

tylerhou · on May 5, 2022

In addition, most of these checks are provided in other languages or linters for those languages. E.g. C++ has had RAII before Rust has existed. C++ (with -Werror), TypeScript and most FP languages have exhaustive switch checking. Clang can catch many (but not all) initialization errors. The places where Rust adds value (data races, memory safety, explicit unsafe) are not discussed in the article.

nicoburns · on May 6, 2022

Rust also adds value in providing so many of th checks where most languages only have a subset, and being incredibly consistent in having safe defaults.

tylerhou · on May 6, 2022

Not untrue, but the post doesn’t mention prior work, so it’s implying that Rust is one of the only languages that has these features, whereas even C++ (notorious for being unsafe) as a majority of the features (in Clang). Clang can even catch many data race issues with the proper annotations. Maybe it’s ignorance, but given how posts proselytizing Rust tend to get popular on HN, it makes me suspicious that the omission was intentional.

sgift · on May 6, 2022

> In addition, most of these checks are provided in other languages or linters for those languages.

Linters are inferior to things the compiler catches for the same reason test cases are inferior: They are not enforced. If the compiler doesn't allow something you can say with confidence you will never see it in the wild. With linters, compiler switches (-Werror) or test cases you'll have no idea.

SemanticStrengh · on May 6, 2022

I would like to see more about atypical language protections. GCC has many underknown flags that protects against very specific things. Kotlin is the first popular language to support structured concurrency, eliminating many classes of concurrency errors. It's basically "RAII" for threads, coroutines.

Tailrecursive and deeprecursive allow for making any recursive function stack-overflow safe.

pjmlp · on May 6, 2022

I guess C# isn't popular.

SemanticStrengh · on May 6, 2022

It is of course but I doubt C# has structured concurrency?

eptcyka · on May 5, 2022

I've seen people who state their love for Rust and then fail to explain the difference between passing an argument by value, reference or mutable reference.

carlmr · on May 5, 2022

I mean those are two separate things. Depending on how you code, Rust can be similarly high level as Python, make it much easier to design with types than C++ and has great package management with cargo.

You can find plenty of reasons to love Rust, without even getting to the technical details.

Thaxll · on May 5, 2022

> Rust can be similarly high level as Python

Not really, once you start something more complicated than hello world and start to try mutate things for example it get 100x more complicated that Python.

mamcx · on May 6, 2022

> Rust can be similarly high level as Python

In fact, Rust is MORE high level than python and most languages. That is exactly the borrow checker and all the other things are.

What is not, is being "simple" as python.

I code now full Rust and is the most productive language in my +20 years of this. My other picks: python, Delphi, FoxPro.

But is not productive exactly the same way python/delphi is. It has other axes of it. So in some task Python is unmatched and my main pick for scripting or brainstorming stuff. But Rust is the best at being overall productive.

Where is not (today) is lack of time/resources to be there (like GUIs. But honestly, nothing compare to FoxPro/delphi)...

girvo · on May 6, 2022

> nothing compare to FoxPro/delphi

Amen. Every so often I boot up Lazarus and lament what could have been. Even the best web-dev frontend tools and frameworks do not come close.

I wonder if there is even a market for those sorts of tools today, though. I wasn't there for the decline of Delphi (and FoxPro, et al), so I've no real reference as to why they declined.

pjmlp · on May 6, 2022

Qt, WPF, OutSystems,...

The tools are out there, even Delphi is still being sold and having a yearly conference in Germany.

They declined because only developers working at Fortune 500's get to pay for software.

When one only wants to have free beer, they get to have what the street bazaars can offer, instead of what the monks are using at the cathedral.

mamcx · on May 6, 2022

I think all the stuff about nocode/lowcode/jupyter/etc say exist the market, but is going in a overly weird route, as if nobody know what this tools can do.

My dream is rebuild it: https://tablam.org!

kaba0 · on May 6, 2022

No, this low-high level axis is not really “permeable”. You can maybe get as productive in rust as in some managed language at the first time you write the program. But later maintenance will simply drop your productivity as any refactor will likely alter your whole memory/ownership model and cause much more refactor recursively. Sure, you may say that “but it will be correct in the end while in python you have to hunt down the newly created bugs” and you might be right, but it is not a small price to pay. Managed languages let you get away with many many unanswered questions that you have to meticulously answer in case of low level ones, and I think it is a fair tradeoff depending on use-case. But as much as I love Rust, I won’t choose it for some database-mangling web service.

mamcx · on May 6, 2022

> But later maintenance will simply drop your productivity as any refactor

This is very off weird take: Refactoring in Rust is one of their strengths as language (a major one!) where even VERY deep changes can be done in economical times.

Try the same in python is nearly impossible.

And I mean it: I fully rewritten all my app (a fairly complex niche eCommerce platform) to async and remade all things along it.

And I have A LOT of experience building, rewriting, and porting code; and Rust is the faster/more friendly to refactor both large or small in all my years.

However, I concede that get the knowledge with Rust have challenges and my first attempts at this (when doing my hobby https://tablam.org language) were miserable by lack of understanding, so it could be a factor.

DeathArrow · on May 6, 2022

That's a valid criticism. While I can be much more productive in .NET with the help of the language, the compiler, the IDE, intellisense and copilot, looking at last Techempower benchmarks, Rust is still 1.6 faster.

I would use Rust for Web if I'd have to squeeze every bit of performance. Most of the of the time I don't have such need for performance.

DeathArrow · on May 6, 2022

>Where is not (today) is lack of time/resources to be there (like GUIs. But honestly, nothing compare to FoxPro/delphi)

A few that come to my mind: C++ Builder, QT, Windows Forms, Xamarin, .NET MAUI.

ZephyrBlu · on May 5, 2022

I recently ported some Python to Rust and I actually did feel like a lot of the Rust code was pretty similar to Python.

Mutating things is mostly not that difficult, just add mut to the declaration and then &mut if you pass it around.

The most difficult thing was lifetimes.

E: borrows got a little tricky as well

carlmr · on May 6, 2022

This is what I mean, I ported some Python code almost 1:1.

If you program in a functional way anyway, i.e. having data go through a series of filters, it is quite easy. But not all ways of programming are so easy to port to Rust.

melissalobos · on May 5, 2022

> start to try mutate things for example it get 100x more complicated that Python.

I don't know about 100x, using Box and mutexes it is pretty much as complicated or maybe 2x. If you want it to be efficient and leverage all of Rust then sure, but that wasn't what they were meaning.

Thaxll · on May 5, 2022

I don't see where in Rust you could have code that looks high level as in Python.

steveklabnik · on May 5, 2022

I did a random little comparison a while back https://news.ycombinator.com/item?id=22712441

One tiny program does not a realistic comparison make, of course. There’s tons of situations where either language can make things more complex than this example. But for basic tasks it’s not particularly complicated.

melissalobos · on May 5, 2022

https://actix.rs/

Look at the first example. It can be very similar.

jay_kyburz · on May 5, 2022

Haha, I think you might need to have another look at Python.

gpm · on May 6, 2022

Note that their example is something like making a hello world web server using twisted (or whatever the current async framework is) in python...

Strange example, but I suppose making the corresponding python more complex is one way to make them equivalent.

code_biologist · on May 6, 2022

For reference I'd say the "current async framework" is either Starlette [1] (low level) or FastAPI [2] (high level) for web stuff, and trio [3] for more stupidproof general async than twisted. IMHO of course!

[1] https://www.starlette.io/ [2] https://fastapi.tiangolo.com/ [3] https://trio.readthedocs.io/en/stable/

gpm · on May 6, 2022

Thanks, I appreciate it :)

DeathArrow · on May 6, 2022

> Rust can be similarly high level as Python

What about Haskell? Can it be similarly high level as Haskell?

dsQTbR7Y5mRHnZv · on May 7, 2022

This may be the closest thing I've personally seen: https://github.com/timvisee/advent-of-code-2021

ben0x539 · on May 5, 2022

people are allowed to love things without being experts or even particularly didactic about it lmao

eptcyka · on May 6, 2022

I'm not sour about people enjoying things. Would you not feel peculiar about people who are really enthusiastic about a sous vide machine and use it to heat water for their tea? And when asked about their favourite cut of steak they like to sous vide, they'd stare blankly and proclaim that they are vegan? There's nothing wrong with using tools any which way you want, but the reason d'entre for Rust is fearless concurrency, and the distinction between borrowing and moving, with the addition of Send and Sync marker traits, is kind of the whole point of the langauge. I agree that it could still be a good language without these, but if it wasn't for these features, I doubt there'd be nearly as much enthusiasm behind it.

SolitudeSF · on May 6, 2022

this not even close to expert knowledge, thats entry level

melissalobos · on May 6, 2022

Rust is slightly unusual in that most users/potential users of it tend to be very vocal about it. It would be nice if their knowledge matched their enthusiasm.

macintux · on May 6, 2022

> Rust is slightly unusual in that most users/potential users of it tend to be very vocal about it.

Or that the only users you know about are vocal…because how would you know about the non-vocal ones?

ben0x539 · on May 6, 2022

let people enjoy things

ben0x539 · on May 6, 2022

like that's how you get the knowledge, you're enthusiastic about a thing and then spend a bunch of time with it and then you build up knowledge. it's healthier in that order.

Xunjin · on May 6, 2022

Ohhh please this is language wars, people don't want healthy arguments, they just want to pick a side and argue, just to earn internet points :D

pjmlp · on May 6, 2022

Data race is only helpful for threaded code accessing data structures in-memory.

For all the other scenarios of distributed computing or OS IPC, it does very little.

Plus Rust lacks a story how it can work with OpenMP and OpenACC.

oconnor663 · on May 6, 2022

> For all the other scenarios of distributed computing or OS IPC, it does very little.

It might be worth clarifying that catching data races isn't just about catching race-conditions-which-are-probably-bugs. Data races are full-blown undefined behavior in the C/C++/Rust memory models, which can (not necessarily frequently but under the right conditions) corrupt memory just as badly as a use-after-free or an out-of-bounds write.

pjmlp · on May 6, 2022

Agreed, however when talking about how great Rust deals with those, we should not forget it is only one use case among many others where Rust currently doesn't do any better than the competition.

Specially if we take into consideration that focusing back into processes is the only solution to prevent certain exploits.

shaklee3 · on May 6, 2022

Rust does not have a memory model.

ChiefOBrien · on May 6, 2022

> It may sound stupid, but you can't have unhandled exceptions if you don't have exceptions...

> panic!() exists in Rust, but that's not how recoverable errors are handled.

This is the worst argument in the whole article, and this is the worst part of the language. Everyone says it's not like exceptions, but in fact it is much worse. Panic is stringly typed and you can catch_unwind it, just like with try/catch in any other language. And the actual worst part of it, you will never know if a panic can occur in any of the underlying functions until it is too late. Developers be damned if they want to choose different behaviour other than crashing the whole program.

Either double down on using the standard error handling everywhere, or put something like "throws panic" in the function signature (ala Java checked exceptions). Many parts of the language has strict checks for everything, why does panic has to be an outlier?

justinpombrio · on May 6, 2022

It's not like exceptions because it's not used like exceptions. You only use panic if you want to crash the whole program. If you don't want to crash the whole program, you don't use panic. You do not want to crash the whole program if the user data failed to validate, so you do not panic in that case. If a library panics on invalid user data, that's a pretty serious bug.

I've been programming in Rust since it came out, and a couple of those years professionally, and I don't think I've ever seen anyone use catch_unwind. Maybe once in a test case?

To be concrete, let's talk about an example of a panic. Say you want to access the 3rd element of a vector. There are two cases:

1. You're not sure whether the vector actually has three elements on not. In this case, you call `my_vector.get(2)`, which returns an Option, and you handle the case where it's present and the case where it's not. This is standard error handling.

2. You are sure that the vector has at least three elements. Perhaps you just checked its length for some other reason, or you are careful to maintain this invariant, or you just constructed this vector by pushing 5 elements onto it. In this case, you would typically use `my_vector[2]`, which panics if the vector is too short.

For #2, the thing to notice is that this function literally never panics, under any input whatsoever if it is written correctly. Should that fact really clutter up its type signature, either by forcing it to return a Result type or by forcing it to have a "throws panic" marker?

EDIT: This is for a function that uses a possibly-panicking operation, `my_vector[2]`. There are also the functions that define a potentially panicking operation, like the vector indexing function itself. You could put a marker in the type signature of those, that would be reasonable. Though it would only be for users; the compiler wouldn't care.

bjarneh · on May 6, 2022

Wasn't this argument used all the time by the Go community. I.e. only use panic when you intend the program to halt, and handle all potential problems with the Error type.

I think Rob Pike even said it's easy to see where a program fail in one of his talks?

But to me the superb thing about exceptions, is that error handling can be done where it makes sense. I.e. we can try{ problem-code }catch(problem){ handle problem } in a single location. Otherwise we end up peppering the entire code base with a ton of error checking far down the call stack, where we really cannot do much about the problem anyway (unless we are writing command line tools where error handling is just writing the problem to stderr).

Exceptions gives us a nice way to let problems bubble up to the surface, while also stating what the problem was, and where it occurred. That is great IMO.

oconnor663 · on May 6, 2022

> Exceptions gives us a nice way to let problems bubble up to the surface, while also stating what the problem was, and where it occurred.

Work is ongoing on some of this, but there are popular libraries in Rust (like `anyhow`) that let you attach backtraces to regular errors, add context, etc. Propagating erros to callers is handled with the standard `?` operator, which means "short-circuit and return this if it was an error, otherwise give me the successful result". This has the benefit of making early exits explicit, without interrupting the visual flow of straight-line code.

bjarneh · on May 6, 2022

OK, I'm not into Rust. It just seems a bit too complicated :-)

pornel · on May 6, 2022

The simple explanation is that Rust has an equivalent of Java-style exceptions of "throw here, handle elsewhere", but has a different syntax for this. Instead of try/catch, there's a `?` operator to return ("rethrow") the error to an outer scope. It's a better fit for Rust's use of a generic Result type, but overall its usage is similar to the checked exceptions in Java.

Because Rust uses the type-safe explicit Result/? approach for all non-bug failures in the program, the implicit panic (that behaves similarly to RuntimeException in Java) is reserved for assertion failures and crashes only.

`catch_unwind` is not guaranteed to work in Rust. There's a setting to disable it and always hard abort() the whole process on every panic. Rust is serious with panics being for programmer's bugs only, and not trivialities like "file not found".

bjarneh · on May 9, 2022

Thanks for the explanation. I guess most languages need this feature i.e. fail anywhere below this call, then return the type error + where it occurred + a stack trace. I've even heard of people doing stuff like that in C, where they store the stack trace in a list they populate and return errors as part of the same struct, in order to have something similar to exceptions.

I have to say I really enjoyed Rust when it was in its infancy (version 0.1 - 0.2 or thereabouts), but have since fallen off. It used to be so simple and so clean, and unlike anything else. Today it's just way to complex for me :-)

vetinari · on May 6, 2022

> But to me the superb thing about exceptions, is that error handling can be done where it makes sense.

Not necessarily. With exceptions, it is easy to be a cause of error and just throw the exception, then expect up the stack to handle it. Which of course has no idea how, it didn't control the cause in the first place.

Forcing error handling as near as where error can happen prevents this.

native_samples · on May 6, 2022

Actually, up the stack is usually the only place that knows how to handle the error. For instance sometimes dumping to stderr is the right thing to do, other times logging it, other times displaying a generic crash GUI, sometimes display a customized UI. There may also be times when the exception can be handled in a better way, with fallbacks for example.

The Rust/Go approach always makes me laugh. Normally in engineering or anything where reliability matters, panicking is understood to be a bad thing to do and people go through extensive training to ensure they don't do it. Somehow these language communities decided that panicking and giving up on the spot is a smart behaviour.

jeddy3 · on May 6, 2022

> Somehow these language communities decided that panicking and giving up on the spot is a smart behaviour.

Oh, come on, that's a straw man.

Just because panic! exists as a "abort-program-with-a-message", does not mean it's somehow encouraged above using idiomatic error handling.

Just as you can do the same thing in languages with exceptions. Sometimes exiting the program is the right thing to do.

native_samples · on May 6, 2022

Panic is idiomatic error handling. Take something as basic as indexing into a list. Get it wrong and Rust will panic.

Sometimes exiting the program is the right thing to do.

Yes, but it's very rare that the code where something went wrong is in the position to decide that. The survival of the entire process is not a decision to delegate to every possible line of code or library author.

Consider a very common case where I benefit from exceptions every day - my IDE. IntelliJ hosts a bazillion plugins, of varying quality. These plugins like to do very complex analysis on sometimes broken and incoherent snippets of code, that may be in the process of being edited. In other words it's a nightmare to correctly predict everything that can go wrong.

Not surprisingly, sometimes these plugins crash. And you know what? It doesn't matter. A lot of the code is just providing optional quality-of-life features like static analysis. If one of them goes wrong, IntelliJ looks at the exception and figures out which plugin is likely to blame, it examines the type of error and maybe gathers editor context, it can report it easily to a central server that then groups and aggregates exceptions based on stack traces. Meanwhile as a user, it doesn't bother me because it's fine to just not have that analysis show up in the editor.

If every time an IDE plugin encountered an unexpected situation it aborted the entire process it'd be insane. The plugin ecosystem could never scale that way. People would be afraid of installing/upgrading plugins and that in turn would discourage people from writing them or adding features to them.

super_flanker · on May 6, 2022

> If every time an IDE plugin encountered an unexpected situation it aborted the entire process it'd be insane.

What if a plugin developer used `System.exit(-1)` in their catch block. How's Intellij going to handle that?

A very popular eclipse plugin did this in past and would bring down the entire IDE when a particular exception happened.

native_samples · on May 7, 2022

In reality nothing does that because, well, why would you when you have good exceptions? But even so, Java has a way to block that using the SecurityManager. Now they are deprecating the SecurityManager "how do I stop code calling System.exit" is one of the use cases they're planning replacements for.

super_flanker · on May 8, 2022

I'm sure there would still be ways to bring the entire process to halt(for example, spawn thousands of threads with infinite loop). My point is just because a bad developer wrote bad code doesn't mean that a tradeoff chosen for a language design is necessarily bad.

native_samples · on May 9, 2022

In reality it's very hard to accidentally write an infinite loop that spawns threads. There's no idiom that would lead to such a pattern and I can't recall ever encountering such a bug in the wild.

Yes, in theory, there are all sorts of ways you can still trash the process with bad code. But in practice, the sorts of bugs that programmers really make in GC-d memory-safe languages are the ones that don't. So, exception based error handling really does come in very useful and Rust probably got it wrong here.

super_flanker · on May 10, 2022

We have to agree to disagree, both design decision have their tradeoffs.

bjarneh · on May 7, 2022

> The survival of the entire process is not a decision to delegate to every possible line of code or library author.

Stated like that, who can really disagree?

I remember when I was writing a bunch of Go when the language was still very new (2009 - 2011). One of the most popular use cases for the language was making websites. All sorts of unexpected problems caused the entire website to go down, due to unexpected panics here and there. The suggested solution from the Go team was to just restart the web-server whenever it was killed by a panic. Surely that cannot be the best way to do it..

chlorion · on May 6, 2022

>Panic is idiomatic error handling. Take something as basic as indexing into a list. Get it wrong and Rust will panic.

This is not really true. If you are indexing into something that may fail you use the `get` method which returns an `Option` if the index is out of bounds. The index operator is just a shorthand for `v.get(i).unwrap()` pretty much.

https://doc.rust-lang.org/std/primitive.slice.html#method.ge...

native_samples · on May 7, 2022

Yes, but the problem is that very often a programmer "knows" an index operation can't fail because they haven't thought of a case where it is a different size, or code gets refactored and assumptions are invalidated, etc.

The panic mentality comes from people who have spent most of their life writing C++, in which if anything goes wrong like an out of bounds index, memory might be corrupted in arbitrary ways, and in which you don't have a GC to clean up after you. Writing exception safe code is much easier in type safe GCd languages, and many programming errors end up being recoverable.

bjarneh · on May 6, 2022

> it is easy to be a cause of error and just throw the exception, then expect up the stack to handle it.

Agree, this can happen. Perhaps the bad attempt at fixing this in Java for instance - checked exceptions, made people dislike exceptions ever more. The caller "has to handle" the exceptions or re-throw them of course. Even though RuntimeException's can come from anywhere at anytime, so "guard" provided by checked exceptions just made a complete mess of things. People are lulled into thinking that methods without the 'throws BlaBlaException' signature are safe and so on.

I guess no language is 100% on everything, but I've always felt that exceptions are one thing I really like; especially when a language manages to do them correctly.

jcelerier · on May 6, 2022

I don't understand your #2 (or your whole point). It's exactly the case for exceptions and how exceptions happen. "You won't get an exception if your code is written correctly and the inputs of your program match your programmer expectations" yeah maybe two year down the line someone refactors the code which was resizing the vector before and now you have the most run-of-the-mill exception ; I just hope that someone making an app with your library has a way to catch the panic so that the software doesn't crash but shows a helpful error dialog to your user and makes a backup of its data before softly existing instead, otherwise we're really back at the pre-1980 state of the art of software design

Hackbraten · on May 6, 2022

> maybe two year down the line someone refactors the code which was resizing the vector before and now you have the most run-of-the-mill exception

In other words: there’s a bug in the code, and that bug has now caused an unrecoverable error, panicking the thread. Now the thread has died (or maybe caught the panic to present a friendly error message). Either way, the user is now aware of the bug, and disaster has been avoided.

Of all situations where your app might want to create a backup of its state, why would you choose to do so precisely while unwinding a crashed thread, where all assumptions, bets and invariants are already off?

And what would the helpful error dialog even say? „A problem has occurred and the app will now shut down“? From the user’s point of view, is that really an actionable or helpful error dialog?

jcelerier · on May 6, 2022

> And what would the helpful error dialog even say? „A problem has occurred and the app will now shut down“? From the user’s point of view, is that really an actionable or helpful error dialog?

Yes, literally. This is already much better than anything that gets the spinning ball of death of macOS going. You can even continue if you are running an event-driven app where the error may have happened as part of an event handler (and thus limited to a very specific part of the software).

To give my own experience: I develop https://ossia.io and use this catch-all method. In 7 years I've gotten numerous thanks from the users for it not crashing but being able to carry forward in case some issue crops up in a sub-sub-sub-module. Not a single time I remember this to cause some memory corruption later.

(backing up state is done up to the previous user action but while in my case it works, it's not always practical)

tialaramex · on May 6, 2022

So in this space, you might well feel confident that catch_unwind() is appropriate, although I still think the thread solution is more elegant.

I suspect in reality most of the problems this would catch in OSSIA wouldn't end up as panics in a hypothetical "Rust OSSIA" because of the different attitudes to exception throwing/ panic vs "normal" error flow in these languages and libraries - unless you got really happy slapping "unwrap()" on things when you shouldn't, but sure, it would solve this problem.

As to memory corruption - the problem isn't strictly "memory corruption" but unstable system state. If my underlying cause is that somebody's dubious Leslie simulator blows up when I frob the gain control on it too quickly, restoring exactly the state in which it blew up last time doesn't help me on its own. I need some way to say OK, that was crazy, no Leslie simulator until I save the project and then we can take it gently, which again is somewhere the thread solution is nicer.

simion314 · on May 6, 2022

>To be concrete, let's talk about an example of a panic. Say you want to access the 3rd element of a vector. There are two cases:

Reality is not that simple, if you worked in this industry you would know. For example I was building a web scraper years ago and the WebView would crash since is C/C++ , instead of doing it's job and show a web page or a broken web page it crashed my entire program,. The solution was to split my program in a parent program and a child program so this bug does not bring my entire thing down, and I can crash the issue and record the bad url that crashes and try again or just skip it.

I would hate to use Rust libraries that would crash my entire program if they for some reason are bugged. In my experience I found bugs in many popular libraries. So in Rust if I import a say library to resize an img and say the img is corrupted and library is shit it will crash my entire program? I would prefer a higher language where I can try=crash the image resize function and if shit goes wrong I can show the user a relevant message , or fallback to some other resizing method.

oconnor663 · on May 6, 2022

> I would prefer a higher language where I can try=crash

What you're describing is the `catch_unwind` mechanism that Rust does have. Because panics are implemented with unwinding (by default), you can catch them. But it's not the normal error handling mechanism; it's the "oh god an assert just failed, or we just OOMed or something, who knows, most bets are off" mechanism. If you have a main loop that's sufficiently isolated from individual tasks, such that you think you can do something useful with the fact that one of your tasks just vanished in a puff of smoke, then catching a panic coming out of a task might be a reasonable thing to do. That often makes sense in server code, where your main loop might want to keep trucking, or at least gracefully shut down other connections. But for most library code, the right thing to do is to allow most panics to propagate and crash the caller.

simion314 · on May 6, 2022

So for example in JS a correct regex can throw exception on some input , so in the places where this can happen we can use a try catch . What do you do in Rust , do you check the return result and on top of that do you try to catch a panic just in case the regex library is bugged ? so you have to implement everywhere 2 error handling methods to be 100% safe? If yes seems more ugly to have to implement 2 error catching ways.

YEs it happen to me many times to hit bugs when working in real world, bugs in image libraries, bugs in regex libraries, bugs in pdf libraries, bugs in html/xml parsers so from my experience working with c/c++ and higher level languages I prefer the higher level languages, less bugs, almost no complete crashes and better error reports from the exceptions. I never had the tiem to try Rust but I am not tempted so far.

southerntofu · on May 6, 2022

> What do you do in Rust , do you check the return result and on top of that do you try to catch a panic just in case the regex library is bugged ?

Nope, we just check the return result because libraries usually don't crash and have well-defined error cases. Having a decent type system helps catching all the possible outcomes. In a few years coding Rust, i never had a single crash due to a library panic, only from explicit unwraps i applied in my own codebase.

Panics are not intended for errors, but for unrecoverable failures. For example, in rust std a failing memory allocation will crash your whole program, which is in most cases what you want to do. For the remaining cases, there are other non-fallible methods.

For example: String::reserve vs String::try_reserve or HashMap::insert vs HashMap::try_insert.

kaba0 · on May 6, 2022

There is no 100% safe anywhere. Does your JS code handle out of memory errors with try-catch? No, it will abort as if nothing happened at all.

Sure, there are bugs in every code but unexpectedly panicking is considered a bug in a library so in my not too extensive experience with rust libs, these are not the norm at all. So simply writing code where you yourself don’t panic should give you quite a high chance of not hitting this case ever.

simion314 · on May 6, 2022

>Sure, there are bugs in every code but unexpectedly panicking is considered a bug

Yes so would you like say Firefox to just crash when one of it's many dependencies crashes?

You are suggesting but I am not sure if I understand correctly that only memory errors cause panics? So what if the library reads a file and unexpectedly it shit happens with the file, it will crash the program because the developer maybe forgot to return a special error code in this case,

oconnor663 · on May 6, 2022

All the filesystem APIs return Results rather than panicking, since like you said it's expected for those to fail sometimes and for the program to handle those failures. It's possible for a library to convert those Results into panics by calling .unwrap() on them, but that would usually be considered a bad design (ok for tests and tiny programs though). So I think you have an important point here, which is that if your application is calling into a library that you worry might have some bad design decisions in it, you do have to worry about it bringing down your process. And maybe it could make sense in some rare cases to try to isolate that library with catch_unwind. But I think most Rust programmers would prefer to just fix the dependency. The fact that you can visibly spot a lot of these conversions in the code is helpful for auditing.

I'm not super up to speed on JS, but I might draw an analogy to Python. Handling a result in Rust is similar to catching an exception of a known type in Python, a very common thing to do. On the other hand, catch_unwind is (loosely) similar to writing a bare except clause that catches every conceivable exception. You can do that, and sometimes it's correct to do that, but in most cases it's a bad idea. You don't want to accidentally suppress errors that indicate a bug in your program.

simion314 · on May 6, 2022

Thanks, from my experience with desktop apps in managed language I alwas added a global catch for crashes that were not caught or can't be caught, there I was writing the details in a log file. Then I had a menu entry for submitting a bug report, a popup would open and the user had the option to include the log file with the exception information and details like operating system, runtime version etc. The only thing that was bringing down this app in the higher level language(it was an Adobe AIR app in Action Script 3) was the freaking Web View , because was a wrapper over WebKit and that was C++.

This days doing backend dev I am forced to move stuff in a different process but most stuff I use I prefer to use binaries then libraries , for example for resizing an image instead of using the built in image library that crashes sometimes and brings the script down I install image magic on the server and write a script for resizing an image then call that script and check it's output , sometimes I had to use the timeout linux program to kill the program if it gets stuck on some input file.

If I were to create that image resize library in Rust I would attempt to catch everything , including panics and return it as a error result(so only system crashes would be uncaught)

Georgelemental · on May 6, 2022

IO errors are generally handled by returning a `Result` type, that contains the details of the problem on error. You wouldn't use `panic` for IO errors. `panic`s are meant for dealing with broken invariants/assertion failures because of a bug in the program.

notriddle · on May 6, 2022

You need to run it in a separate process. Rust does not have good enough fault isolation features to safely assume a buggy image processor won’t break your app.

* Entering an infinite loop can bring down everything. A separate thread might not, but since Rust provides no way to kill a thread without it cooperating, there is no way to stop a stuck thread without bringing down the whole process.

* Stack overflow is an instant abort, not a panic.

* Double panic, where panicking calls a destructor that itself panics, is an instant abort.

simion314 · on May 6, 2022

Question, if you are a library/program author why would you intentionally use a panic and not cleanup and return an error? Maybe I misunderstood and in fact good developers never trigger panics unless there is no way to avoid it, like if they could not prevent it with more checks or is it impossible to cleanup because they already fucked up, wrote garbage in the process memory and safest thing is to kill the process.

oconnor663 · on May 6, 2022

I think there are a few cases where Rust likes to panic, but different people probably have different opinions here:

- Extremely common operations with dedicated syntax, where introducing error handling would be burdensome. Things like array indexing or arithmetic overflow. In these cases, you usually want an alternative, fallible way to do the same operation.

- Cases where most callers will probably convert the error into a panic anyway. One example of this might be .split_at() on slices, which is bounds-checked just like an array access. Most callers would probably just .unwrap() the out-of-bounds case, and callers who don't want it to panic can easily check before the call, so it's more ergonomic to panic.

- Cases where the only plausible reason for failure is a bug in the caller. For example, the .borrow() and .borrow_mut() methods on RefCell will panic if a write overlaps with another read or write. The caller is almost always expected to statically guarantee that that doesn't happen, usually by making all borrows short-lived. (And here again there are fallible alternatives available.)

An interesting example of something that doesn't panic, but which probably should, is taking a mutex. The standard mutex in Rust includes a "poisoning" mechanism, which almost every caller just .unwrap()s. I think the majority opinion these days is that poisoning should just be removed, but given that it's around I think most people wish it just panicked instead of returning a Result.

notriddle · on May 6, 2022

> is it impossible to cleanup because they already fucked up, wrote garbage in the process memory and safest thing is to kill the process

That’s essentially it, yes. My code should never actually panic. If it does, it means the state of the process has become deeply diseased, and attempting to “clean up” is likely to just make things worse. Of course, if it’s safe Rust, then it still won’t write past the end of a buffer or anything disastrous like that, but buggy code is still buggy code and there’s lots of stuff Rust won’t stop you from doing.

One of the more extreme things I’ve done in production Rust code was add a “watchdog thread”. It has a channel that takes unit and receives on a timeout, and the thread doing the actual work is expected to send it a message once a minute. If it doesn’t receive a message within a minute, it hard aborts the process. The default setup is run under a service manager like systemd to make sure it gets restarted, and that failures are actually logged somewhere.

This is meant to solve the problem that safe Rust is a Turing complete language, so is subject to the halting problem. The type checker can prove that you won’t read past the end of a buffer, but it cannot prove that your code will ever actually finish running. Which means, if you have a project like a web scraper that needs high uptime, you need to prevent it from getting stuck somehow.

simion314 · on May 6, 2022

I agree, so Am I wrong or the issue seems to be community culture thing, where some developers panic too eagerly? Say I make a library for resizing images and I have one public function resizeImage(options) , a good developer would think that maybe my code code has a bug and some function from standard library would panic, I should ensure I catch this and my public function never panics (even if there is no memory,disk or whatever Ias an author I should try not to intentionally panic" where a bad Rust developer thinks like " this will never panic unless I made a bug, if I amde a bug I am happy to panic and crush some sucker program so I get the bug report and fix the bug".

There are always bugs(logic bugs where Rust can't protect you) so why not have a clean interface?

Georgelemental · on May 6, 2022

IO errors like running out of disk space would be handled by returning a `Result` type, not by panicking. Often, Rust code/libraries panic on out-of-memory errors because recovering from that isn't a priority for most application code. But if you are writing lower-level or high-reliability code and you do want to handle out-of-memory errors, the Rust standard library (and many third-party libraries) offer alternative fallible memory allocation APIs that return `Result` instead of panicking on out-of-memory.

notriddle · on May 6, 2022

I disagree.

A good developer will crash the program, as soon as possible, if and only if it has a bug. If you want to write a program that never crashes, then you need to write a program with no bugs.

The reason you don’t want buggy code to limp along after it detects a bug, is that crashing isn’t the worst possible thing.

The worst possible thing is getting stuck in an infinite loop or a deadlock.

golergka · on May 6, 2022

> I would hate to use Rust libraries that would crash my entire program if they for some reason are bugged.

You would, but for other programs with other requirements it would actually be beneficial. There's no single right answer, and you should pick the library that follows your particular requirements for that particular program.

edflsafoiewq · on May 6, 2022

Panics are exactly like exceptions in the fact the entire standard library pays the cost of being exception safe.

roblabla · on May 6, 2022

You can turn panics into aborts with `panic = "abort"` in Cargo.toml, in which case nobody has to pay for being exception safe (though, to get the full benefits of this, you may have to rebuild the stdlib? I'm not entirely sure here).

edflsafoiewq · on May 6, 2022

I'm talking about the cost paid by the library author for the additional burden of writing exception safe code. Whether you use this downstream doesn't matter, the cost is already paid (in fact arguments like "no one catches" make it worse since the cost is paid and no one benefits).

styluss · on May 6, 2022

The only place I've seen people using catch unwind was in Sentry library to catch panics. Needless to say that it was never used even before we removed all unwraps from the code.

svnpenn · on May 6, 2022

> If a library panics on invalid user data, that's a pretty serious bug.

I swear, sometimes it seems like Rust people are from another planet. What do you think "unwrap" does? It's not used in every library, but certainly in many of them.

nindalf · on May 6, 2022

> seems like Rust people are from another planet.

There’s no need to talk in a condescending manner.

What they said was correct - an “unwrap” outside of test/prototyping is considered a serious bug. They Rust loving strawmen you’re creating never claimed that every line of Rust ever written is perfect and bug free.

Nathanba · on May 6, 2022

So you essentially want me to write error handling code nonstop, constantly, all across my functions. Practically after every 5 lines of code there is going to be an unwrap() where I'm not allowed to call unwrap() so I have to know the details of the implementation, the error code, deal with the error code, return early from the function and then gracefully handle it all. Meanwhile in a language that has exceptions I just put a try catch around all the code I think works fine but maybe not and I deal with it in a single location in a way where I dont have to care about what the precise error code might be. Error code programming really seems to be objectively worse for everyone except the compiler writer. Somehow people let themselves get convinced that this is better when it's objectively not.

tinco · on May 6, 2022

Rust has syntactic sugar to help you coalesce error handling into returning a single Result. You'd have to check and make sure the library you use doesn't call unwrap willy nilly. As crazy as that sounds it is actually common practice in the Rust community, there's tools that reveal use of unwrap and unsafe in your dependencies.

In the end you don't use Rust because it's so easy and nice to use (unless you come from C/C++). You come to Rust because you want meticulous control over performance, and you don't want to sacrifice safety to attain that.

If that's not why you're using it, I agree you're probably better off choosing Java, it's plenty fast and comfortable to use, especially if you pick modern tooling.

Hackbraten · on May 6, 2022

> You'd have to check and make sure the library you use doesn't call unwrap willy nilly.

That statement really resonates with me. If you use a library, you’re responsible for what it does, just like how you’re responsible for your own code.

nindalf · on May 6, 2022

?

Are you mistaking Rust for some other language? Error handling in Rust is mostly just using the `?` operator.

The Rust book explains - https://doc.rust-lang.org/book/ch09-02-recoverable-errors-wi...

kaba0 · on May 6, 2022

That’s what the ? syntactic sugar is meant to solve. It will return at the point with an Option null, or the error variant of Result if the preceding expression’s error can be converted to it.

something.map_err(…)? is quite readable in my opinion and that is the worst case, when your method returns a Result<..,..> but the called method has an Optional return type. Otherwise it is just a single ?.

Sure, I do believe that exceptions are superior but we do have to understand that rust is a low-level language, period. It is very expressive considering its nature, but it will never be as productive as a managed language in my opinion - we have this distinction for a very good reason. If you want maximal control over what happens “behind the scenes” you loose some automatism that could improve productivity.

allisdust · on May 6, 2022

How is try catch any better than Err/Ok pattern? Code that doesn't handle error cases shouldn't even pass any code reviews. This is exactly why Rust guides the programmers in a certain paths to ensure all cases are always handled. If you really don't want check the Err/Ok in each call, you are free to use '?' to pass that burden to higher functions.

nindalf · on May 6, 2022

They didn’t know about `?`. My guess is that they read the first page about error handling, where it talks about unwrap and match. They didn’t get to the second page, where `?` is introduced.

Remarkable that people with such little knowledge feel comfortable talking so much.

atoav · on May 6, 2022

No, you should not unwrap unless you know it is safe to do so. You should also add a comment why it is safe to unwrap, if it is not obvious.

Many programmers are writing code for sunny weather only, with error handling being something you might add as an afterthought if your code starts to feel a little too brittle.

In my eyes error handling is just as important to do correctly as getting the core of the functionality done, because error handling is a core functionality of any program, especially if we speak of libraries others are meant to use.

Error handling is what differenciates engineering from coding.

Gwypaas · on May 6, 2022

Or simply use Thiserror together with bubbling.

https://crates.io/crates/thiserror/1.0.24

svnpenn · on May 7, 2022

Sorry, but if the language cant even get errors right, I'm certainly not going to download a package to do it.

orf · on May 6, 2022

It’s definitely pretty good once you’ve used it.

singularity2001 · on May 6, 2022

What OP meant is that proponents of Rust are often a bit out of touch with reality: Go to github and find a random Rust repo which doesn't use unwrap excessively. And is thus full of serious bugs, according to your wording.

nindalf · on May 6, 2022

> Go to github and find one Rust repo which doesn't use unwrap excessively.

Consider serde-json, a widely used library to serialise and deserialise json. You asked me to find “one Rust repo”. Ok here it is - https://github.com/serde-rs/json/search?q=unwrap&type=. Of the 22 uses of unwrap, nearly all are in test code or in comments. Of the remaining 3 or 4, they seem safe to me. But maybe they’re not. Could you think of some json input that could trigger a panic from one of those unwraps?

I’ll put my money where my mouth is. I’ll donate $100 to a charity of your choice if you can find that.

But if you can’t, at least have the honesty to admit that you misspoke when you said not even a single repo without “excessive” use of unwraps exists.

pkolaczk · on May 6, 2022

Not every use of unwrap is a bug. For example a regex library returns Result on regex construction because the passed regex could be invalid. But if you construct the regex yourself, from a hard coded string, you know it is correct. Then you just use unwrap and it is ok.

pjlegato · on May 6, 2022

People don't write bugs on purpose. That hardcoded string that you know is correct is sometimes not actually correct.

Georgelemental · on May 6, 2022

And crashing early is often the best solution in that case.

nulld3v · on May 6, 2022

The assertion remains true though. Unwrap should only be used if you are prototyping or you are 100% sure it will never actually panic.

It's just like the IndexOutOfBounds exception in Java. Many functions can theoretically throw it, but most libraries and programs do not catch it because usually if it is thrown it means that something happened that the programmer did not expect and therefore the program should crash.

elbear · on May 6, 2022

There's a difference between what you should do and what people actually do. If a lot of them use unwrap in production, then OP's argument is valid.

nulld3v · on May 6, 2022

The problem would not be that it is commonly used, the problem would be that it is abused. And I don't see that happening currently.

The assertion that "If a library panics on user data, that's a pretty serious bug" remains true.

If a library is panicking on invalid user data, it is because they are abusing panic, which is a serious bug. Or they just didn't realize that their code could panic, which is also a serious bug.

justinpombrio · on May 6, 2022

> What do you think "unwrap" does?

It panics just like my `my_vector[2]` example does. What did you think `my_vector[2]` did? Libraries use `my_vector[2]` too. I don't get why we're changing topics from one commonly used panicking operation to another.

varajelle · on May 6, 2022

Unwrap is supposed to be used when the developer know that the error can't happen. Or that if that error happen there is no recovery anyway, and the best thing to do is to abort the program.

orra · on May 6, 2022

To be fair to catch_panic, it exists for a very specific purpose: to prevent the undefined behaviour of unwinding across an FFI boundary.

touisteur · on May 6, 2022

To be a bit fair, checked exceptions in java also have their 'bypass' system, since Errors are not checked. So you can't be sure whether someone will decide to throw an error in the middle of library code. You still have to catch-all. I'm not saying it's better.

I haven't seen a way to do exceptions better than fully-checked exceptions, but you have to be ready to have buffer/integer over/underflow exceptions everywhere or have a fine prover for the absence or runtime erroes to 'allow' you not to have them in your signature.

Otherwise having discriminated records (or option types if you prefer) for return and error-handling seems more down to earth, if a bit painful to write.

lenkite · on May 6, 2022

Frankly, I love Java's checked and un-checked exceptions differentiation even if the standard library is confused about it.

Make logical exceptions (depending on purpose of interface) into checked-exceptions. Make system exceptions into un-checked exceptions. Document in javadoc with `@throws`

A higher level module can wrap and re-throw into the appropriate exception if needed.

Error handling can be done in the desired place instead of scattered across the code.

kaba0 · on May 6, 2022

Yeah, I also believe that Java is the closest to the best error handling I am aware of. Unfortunately though, it is inheritance based which is a bummer here. It would be perfect with sum types though.

caffeine · on May 6, 2022

I just wish there was an ergonomic way of saying “Please check if the following code can possibly panic, and fail to compile if it can.”

That would allow critical sections that happen to use a library not to need to audit all the code in the library for panics.

slavak · on May 6, 2022

You might be interested in Prusti: https://github.com/viperproject/prusti-dev

oxff · on May 6, 2022

panic! and unwrap are more like assertions about program invariants, but of course this can be abused.

at least the linter will yell at you if you have a panic but not documented it.

the_mitsuhiko · on May 6, 2022

> Everyone says it's not like exceptions, but in fact it is much worse. Panic is stringly typed and you can catch_unwind it

I'm not sure which argument you are trying to make but panics are not stringly typed unless you panic with a string. You can use panic_any(MyPayload) and then it panics with that instead.

LAC-Tech · on May 5, 2022

Needs a companion blog, "perfectly safe code the rust compiler will nag you about". And the contortions rust programmers go through to avoid that.

Rust is really impressive in a lot of ways. Type classes and pattern are a great fit for systems programming.

But they're fixated on the idea that everything possible should be a static analysis error, language ergonomics or usability be damned. I'd much rather these be warnings, because no static analysis on earth is going to stop you from actually needing tests to see if your code works.

cranky908canuck · on May 5, 2022

> ... no static analysis on earth is going to stop you from actually needing tests to see if your code works.

Correct. However, the intent of Rust is that the code will not fail [1] due to some silly machine thing below your current level of abstraction.

As an example, I'd point to one thing that Rust does that solves a lot of problems: the Vector. This gives you:

- a buffer to load as needed - bounded memory usage ( no more than 2x what's actually needed, and the ability to tailor it) - automatic resizing as needed

IOW, eliminates 'C programmers disease' (eg: #define BUFFERSIZE 1024 // big enough for anything :-) )

I am sick and tired of writing either the tedious resizing stuff by hand, or using a linked list [2] (which in today's world isn't performant: a self-resizing array will deal with locality-of-reference issues much better).

Disclaimer: I've only peripherally used Rust professionally (ie., corporate innovation sessions, self-contained utilities). I have done enough 'fitness exercise' stuff that I'm confident to comment.

[1] "Fail" as in, do something random like start a cryptominer at root-level-privilege. "Fail to compile" is fine, "panic at runtime" is not fine but not worst case.

[2] Opinion: the reason why linked-lists are an example of something hard to do in Rust is because Vectors make them irrelevant, so why bother?

mvolfik · on May 6, 2022

Vectors really don't make linked lists irrelevant - if you want to be able to quickly pop from the middle, no Vec magic will currently help you

adgjlsfhk1 · on May 6, 2022

Vectors by themselves don't make linked lists irrelevant, but they remove 90% of the use cases, and most of the remaining ones would be better served by a hashtable, or some sort of fancy tree. Linked lists (especially as C programmers typically implement them) are absolutely awful for performance.

oynqr · on May 6, 2022

Who forces you to keep rewriting a vector in C?

ben-schaaf · on May 6, 2022

The lack of generics/templating in C.

verdagon · on May 5, 2022

Something I find interesting about Rust is that we can do those safe patterns, as long as we're willing to lose some performance.

The way I think of it: Rust forces us to choose between flexibility and zero-cost memory safety.

If we choose zero-cost memory safety (in other words, we don't use Rc or unsafe or large Cells) we can't do things like dependency injection, basic observers, backreferences, many kinds of custom RAII, etc. But we do get speed.

On the other hand, if we allow e.g. Rc into our codebases, we can do these patterns just fine, though there is a performance hit.

The final challenge in learning Rust (IMO) is to figure out when Rc is better, and when we can afford the complexity cost of zero-cost memory safety. I've seen a lot of Rust projects move mountains to avoid Rc, and ironically end up adding more run-time overhead and complexity.

cranky908canuck · on May 5, 2022

> On the other hand, if we allow e.g. Rc into our codebases, we can do these patterns just fine, though there is a performance hit.

Interesting... I was using one of the Project Euler problems as an exercise for learning Rust; I found that Rc actually improved performance (guess: by eliminating a lot of copies/moves as things went in and out of scope).

eximius · on May 6, 2022

Maybe you should have been using more immutable references?

But I'm actually curious why the compiler would be making many copies after optimization passes. Release builds or debug builds?

ridiculous_fish · on May 5, 2022

If you try to code Rust like JavaScript by using Rc everywhere, you will quickly run into panics when you try to borrow from a RefCell that a caller has also borrowed from. The limitations of &mut cannot be trivially worked around.

Rusky · on May 6, 2022

The "trivial" equivalent to JavaScript is not RefCell, but plain old Cell, applied at the level of individual fields you want to write to. No panics and no (relative) overhead there.

RefCell exists more as a bridge back to the world of &mut, since so much of the ecosystem uses that approach. But if you actually have a good reason to write JavaScript-like data structures (e.g. actual interop with a JavaScript-like language) then it just kind of gets in the way.

ridiculous_fish · on May 6, 2022

I don't understand what you have in mind. Here's some basic JS, iterating over elements of a Node:

    for (let child of getElementById('parent').children)
       func(child)

The equivalent in Rust is "risky" because func might also call getElementById('parent') and then that would panic.

I think you are saying that Cell solves this by temporarily moving all children into a local, iterating over that, and then add them back as children? That seems pretty sketchy to me, you have these weird transient states where Nodes are temporarily removed from the DOM.

Rusky · on May 6, 2022

No, I'm talking about matching JavaScript's memory layout and memory management, indirection and sharing and all.

The children field is itself a reference to a collection. Reading the field creates a new reference to the same collection, and that is what the for loop holds. If func grabs parent that's fine, it's just one more reference (in the Rc sense).

That collection also just holds references, and the for loop copies them out too. The trickiest bit here is the iteration state itself- you wouldn't be able to use e.g. Rust's typical slice iterator and still retain the ability to mutate the collection, but JavaScript doesn't do that either. Its iterators just have to be implemented taking the possibility of external mutation into account.

The fundamental tradeoff here is that JavaScript simply puts everything behind a reference, all the time, while Rust allows you to work with objects directly. This is what makes sharing+mutation feel so simple in JavaScript- anything you can do to change the "shape" of a data structure is really only changing a reference, and leaving the old version around in case anyone else was still using it.

ridiculous_fish · on May 6, 2022

Thank you, that clarifies the idea for me and it's an interesting approach. It seems like an "all in" design that you might apply to e.g. an interpreter.

Of course in practice we don't abandon Rust's slice iterators to achieve basic observers or backreferences, and I've definitely tripped over "someone else is already borrowing this thing" runtime errors.

verdagon · on May 6, 2022

Of course, Cell has its own overhead, which is why using large Cells is often worse than just using Rc. It's also very situational; we often have types which don't work with Cell, and it often doesn't make sense to copy objects in many use cases.

It's one of those things that works well in theory, but in practice can fall short.

Rusky · on May 6, 2022

That's totally beside the point here: you can do a mechanical (thus "trivial") translation from JavaScript-like code to Rc+Cell, without ever hitting that overhead.

That is, JavaScript-like code simply doesn't do large assignments in the first place. It doesn't directly hold move-only objects that are trickier to use with Cell.

If you find yourself hitting those cases, you have left the realm of JavaScript-like code, and entered the realm of manual memory management.

verdagon · on May 6, 2022

> The limitations of &mut cannot be trivially worked around.

> The "trivial" equivalent to JavaScript is not RefCell, but plain old Cell, applied at the level of individual fields you want to write to. No panics and no (relative) overhead there.

I thought you were implying that we could trivially work around Rust's problems by just using Cell everywhere, my comment was in response to that. It seems now that you were talking about something else, so nevermind =)

verdagon · on May 6, 2022

That's true, one should not overuse Rc.

And the other extreme, avoiding Rc entirely, railroads one into a certain architecture which is good for only some use cases and not others.

I like to think the community will someday learn when to use Rc. It had better, lest low-overhead languages with more flexibility like Cone [0] overtake Rust.

[0] https://cone.jondgoodwin.com/

jonpalmisc · on May 5, 2022

Do you have any examples of the “perfectly safe code the Rust compiler will nag you about”? Not trying to start language wars, just genuinely curious as someone who writes Rust on occasion.

alkonaut · on May 6, 2022

It's not very good at knowing when something doesn't alias, for example. E.g. if you do this, the compiler doesn't realize it's safe to do because the array locations are different so I'm not borrowing the specific location mutably more than once. Instead it nags me that I have to split the array into non-overlapping slices.

    if (i != j)
       swap_items(&mut arr[i], &mut arr[j]);

A contrived example and obviously the same can be achieved in many other ways, most of which the compiler would be happier about - but that's often the case with Rust: a seemingly safe thing isn't quite safe enough for the compiler so you have to do it differently. And that's the main problem of ergonomics in the borrow checker imo.

This is helped enormously by helpful error messages, and there is great progress on fixing little paper cuts and improving the borrow checker to make more valid programs accepted by the borrow checker. But it doesn't contain a massive AI or theorem prover so there will always be situations where you'll need unsafe despite not actually being unsafe, or when you'll do something a bit more contrived than you might have expected.

oconnor663 · on May 6, 2022

This is a good example. Another common situation is a struct that has both `foo` and `bar` members and then exposes both `.get_foo_mut()` and `.get_bar_mut()`. (Again this is a trivial example, and maybe in the real world they do more work before returning those references.) The problem is that it's illegal to call either of those while the return value from the other is still alive. Even though we know they don't alias each other, and we could totally accomplish the same thing if the members were public, there's no way for the method signatures to express what parts of the struct they don't touch.

alkonaut · on May 6, 2022

I think there is work going on in this area (borrowing parts of structs)? Might have dreamt.

Measter · on May 6, 2022

I ran into a borrow check error on code like this[0]:

    enum Inner {
        A(i32),
        B(i32)
    }
    
    enum Outer {
        Foo{
            field: Inner
        }
    }
    
    fn do_foo(val: &mut Outer) {
        match val {
            Outer::Foo{field: f @ Inner::A(id)} if *id == 3 => {
                *f = Inner::B(25);
            },
            _ => {}
        }
    }

The compiler is seeing the `id` and `f` references as overlapping for the entire arm, even though the use of `id` and `f` are not interleaved. Bearing in mind that I don't actually know how the compiler works here, but I don't think this is a borrow checker limitation in and of itself, rather what I think is happening is that in the match expression the compiler is creating both `id` and `f` directly from `val`, creating the overlapping borrow.

The reason I believe that is that this equivalent code results in the same error[1]:

    fn do_foo(val: &mut Outer) {
        let f = val.get_inner();
        let id = val.get_inner().get_a();
        if *id == 3 {
            *f = Inner::B(25);
        }
    }

Whereas if you create the `id` reference from `f` instead of from `val` the compiler accepts it because `f` is not used between `id`s creation and death[2]:

    fn do_foo(val: &mut Outer) {
        let f = val.get_inner();
        let id = f.get_a();
        if *id == 3 {
            *f = Inner::B(25);
        }
    }

[0] Playground link: https://play.rust-lang.org/?version=stable&mode=debug&editio... [1] https://play.rust-lang.org/?version=stable&mode=debug&editio... [2] https://play.rust-lang.org/?version=stable&mode=debug&editio...

melissalobos · on May 6, 2022

> Do you have any examples of the “perfectly safe code the Rust compiler will nag you about”?

Not an example, but if the borrow checker approved every proper program in a reasonable time, then it could solve the halting problem.

ben0x539 · on May 6, 2022

my go-to example is the non-lexical lifetimes stuff, like when you're somewhere in a match and you don't get to return a reference to a thing because there's another reference that's clearly not relevant anymore at the top of the match

kibwen · on May 6, 2022

Though it's worth mentioning that Rust gained support for many non-lexical lifetime patterns a few years ago, and has designs on supporting more in the future.

ben0x539 · on May 6, 2022

That's why I'm surprised I keep running into this :D

pjmlp · on May 6, 2022

For many, not all, and for those it is the usual RC<RefCell<>> dance.

oxff · on May 6, 2022

I don't think the compiler can tell if you are mutably indexing to two different items in a collection like Vec.

tayo42 · on May 5, 2022

I dont have an example in front of me. I had write some weird code to prevent a variable from being dropped unexpectedly.

rhn_mk1 · on May 5, 2022

> no static analysis on earth is going to stop you from actually needing tests to see if your code works.

I might have been convinced if mathematical proofs were not expressed in code. If a proof can exhaustively cover the problem space, then there's no need for further testing.

https://en.wikipedia.org/wiki/Curry%E2%80%93Howard_correspon...

LAC-Tech · on May 5, 2022

Fair point, but would you deploy un-tested code to production based on a static proof? AFAIK the issue with this stuff is that the kinds of problems proofs are able to practically cover are very small.

Static analysis is an incredibly useful tool, not denying it. I am saying we should avoid worshipping at its altar. Any useful program is incredibly complicated and needs some kind of run time testing.

yazaddaruvala · on May 5, 2022

I don't feel like static analysis and unit testing are very different things. Given that my unit tests run roughly at compile time (ideally they would run during compile), it really isn't different than static analysis. Especially unit tests where all the inputs are mocked and expected outputs are provided.

When I wrote Ruby/Javascript, I remember needing to write unit tests that verified types of input/output variables. These kinda tests were undifferentiated boilerplate that needed to exist but didn't need to be written by me. Especially since there were many tests I'd forget to add or would intentionally not add because the test file was getting obnoxiously large.

Factoring out those boilerplate tests into the compiler (when using Java, Rust, or TypeScript) was very valuable to me, but didn't change the fact that they are basically automatically generated unit tests. The borrow checker in Rust is a similar factoring out of automatically generated unit tests, which I wouldn't have typically written.

Continuing to push more undifferentiated unit testing into the compiler/libs/proofs helps make sure I only need to write domain specific unit tests.

Why do you feel static analysis and compile time unit testing are different? or do you mean, domain agnostic testing is ok, but really we need domain specific testing?

LAC-Tech · on May 6, 2022

When I wrote Ruby/Javascript, I remember needing to write unit tests that verified types of input/output variables.

When I write in dynamically typed language I never write tests like that. I'm more familiar with static than dynamic, but are experts really doing that kind of thing?

That'd be a case for design by contract.

rhn_mk1 · on May 6, 2022

Sure, sometimes I even deploy code without a proof nor tests ;)

Jokes aside, if my problem is small enough to be proved (for whatever value of small), I do not see any more value to be extracted from tests. Who bothers checking that 3+2 = 2+3 after proving that x+y = y+x?

AlotOfReading · on May 6, 2022

I wish static analysis were always sufficient. Curry-howard doesn't obviate the need for tests. Anything affected by the physical implementation of the computer still needs to be tested, for example. That includes corruption safety for databases, timing side channels in cryptography, all high-integrity code, and so on. Proofs can also be buggy themselves, as can the verifiers.

But yes, formal methods make testing a lot easier and taken far enough, can suffice on their own.

rhn_mk1 · on May 6, 2022

> Anything affected by the physical implementation of the computer still needs to be tested, for example. That includes corruption safety for databases

I think that comes under "taken far enough". If you can model the corruption in your proof, you're good. I'm less confident about timings. But you're right that testing is still useful for bugs on other levels. After all, going high enough, humans can have buggy requirements, and no proof will catch that. Tests might.

emerged · on May 5, 2022

I took computer language theory in college literally 20 years ago, had a sandwich with chips during every class. I clicked on that link and could taste the sandwich.

oxff · on May 6, 2022

It is very ergonomic language though, when you consider how much of C++ complexity it manages to simplify.

> I'd much rather these be warnings

Delusional attitude. If it can be caught at compile time it should be; there is no reason to leave it as a warning for programmer to maybe handle.

pjmlp · on May 6, 2022

For some schizophrenic reason, when I look at GAT proposals or macros, I rather be dealing with C++ code.

DeathArrow · on May 6, 2022

C++ can be as simple as you want.

golergka · on May 6, 2022

There's unsafe for that.

Dowwie · on May 6, 2022

There's a very good chance that in whatever language you're using you're not testing remotely close to what you'd need to in order to reach parity with that of the rust compiler. It's very Dunning-Kruger of a developer to believe that they are achieving such a level of edge case testing.

eternityforest · on May 6, 2022

I hope Rust comes out as the winner in the current crop of possible C/C++ replacements.

It seems like most of the other make a lot of compromises to keep things simple and open.

The bugs are just going to keep happening if we don't do something. And if we don't stop making critical security errors, there could really be a movement to return to analog tech, big enough to set us back 10 years.

So many people don't trust computers, and a lot of emerging tech like self driving cars and IoT in the home relies on trust.

nine_k · on May 6, 2022

What are current possible C++ replacements?

Rust, as mentioned; it was created as a direct replacement to use at Mozilla, among other things.

Zig, which is significantly simpler than C++; good but I'm not sure if it has comparable expressive power (maybe it does, IDK).

D, which is around for some time, but hasn't made inroads comparable to Rust.

Ada, which is around for even longer, and has enough mindshare in certain industries, but sadly not enough generally.

GC-based languages, from Go to Haskell, apparently can't be considered true replacements, even though they are fine for large areas previously dominated by C++.