I am utterly thankful for new experience reports on Rust, especially for ones th...

marijn · on May 14, 2015

Another thing that should probably be clarified is that the author diagnosed the problem with the `let y = x; let z = x;` code incorrectly (assuming that Rust is creating 'bindings'), which (not having actively programmed rust for a while) greatly alarmed me because it sounds like a terrible idea. What is in fact happening is that `x`'s value is _moved_ into `y`, which is a lot easier to think about.

leoc · on May 14, 2015

> but messing with stack frames in a systems language is generally a no-no.

Could you expand on this? Optimising away a stack frame that lies on the border of some security barrier would obviously be Bad News, but what other specific problems are there? Conversely, it seems there are some possible benefits to TCO in a systems language: I'm thinking of those secure-C coding standards which (apparently) tend to ban recursion for fear of stack overflow.

pcwalton · on May 14, 2015

LLVM does do sibling call optimization, which allows for TCO in many common cases, including all cases of a function tail calling itself (but note that RAII makes the definition of tail position subtler than it may seem at first glance).

leoc · on May 15, 2015

> (but note that RAII makes the definition of tail position subtler than it may seem at first glance)

Like http://www.nhplace.com/kent/PFAQ/unwind-protect-vs-continuat... this?

kyllo · on May 14, 2015

Purity isn't as big of a deal in Rust as it is in other languages. We used to have it, but it wasn't very useful.

Right, purity aka referential transparency is basically required when you have lazy evaluation by default (as in Haskell).

Since Rust is a strictly evaluated language, it's easy to reason about the order statements and expressions will be executed in and when side effects will happen, so purity is not generally necessary.

Peaker · on May 14, 2015

Purity is not just about laziness.

It helps with things like concurrency, parallelism, equational reasoning, refactoring, understanding APIs, and more.

surrealize · on May 14, 2015

w.r.t. concurrency:

As you know, the difficulty with concurrency is with shared mutable state. Purity simplifies concurrency by restricting mutability; rust simplifies concurrency by restricting sharing.

That makes purity less important for rust.

kyllo · on May 15, 2015

Purity is also not the same as immutability. Clojure has immutable data and impure functions, and it also offers good, safe concurrency features. Purity helps with other things as the parent comment mentioned, but it's not necessarily tied to immutability. But I maintain that purity is necessary in the presence of laziness, because it's terribly confusing to reason about side effects when you're building up thunks everywhere.

Peaker · on May 15, 2015

Can you sprinkle parallelism annotations and have guarantees about semantics not changing while gaining parallel execution?

Also, note concurrency and parallelism are two of many interesting benefits of purity.

I'll also add unit tests and property testing which are much nicer with purity.

simonz05 · on May 15, 2015

>> It runs about five times slower than the equivalent program > I'd be interested in hearing more about how these were benchmarked. On my machine, they both run in roughly the same time, with a degree of variance that makes them roughly equivalent. Some runs, the iterator version is faster. It's common to forget to turn on optimizations, which _seriously_ impact Rust's runtimes, LLVM can do wonders here. Generally speaking, if iterators are slower than a loop, that's a bug.

In my novice benchmark I found similar results as OP.

  running 6 tests
  test for_range_100   ... bench:        89 ns/iter (+/- 2)
  test for_range_1000  ... bench:       929 ns/iter (+/- 98)
  test for_range_10000 ... bench:      8815 ns/iter (+/- 414)
  test for_while_100   ... bench:        36 ns/iter (+/- 3)
  test for_while_1000  ... bench:       294 ns/iter (+/- 27)
  test for_while_10000 ... bench:      2768 ns/iter (+/- 268)

  test result: ok. 0 passed; 0 failed; 0 ignored; 6 measured

https://gist.github.com/simonz05/afd76c549d6c8afb8081

Jweb_Guru · on May 15, 2015

That is not the same test as the OP's. It's not even the same test between the two different algorithms, since you hide different information from LLVM under different circumstances. If you have to put `test::black_box` everywhere to get anything but zeroes, the only thing you can really conclude is that LLVM is better at optimizing than you are at writing microbenchmarks (I'll agree it can be frustrating at times).

masklinn · on May 14, 2015

> This is, in fact, the default. You can use the attributes to inform the optimizer of your wishes, if you want more control.

Isn't the default that the optimiser will do whatever the hell it wants, and the attributes simply skew the optimiser's factors in one direction or another? I think what the author means here is that the caller function should be able to define whether the callee should be inlined or not.

> The bindgen tool can help here.

Would be really useful to have an implicit bindgen thing. Maybe a compiler plugin using e.g. Clang's C parser? That way there's no need to maintain the binding. I'd say I'd like a header generator more than a reader though.

dbaupp · on May 15, 2015

Bindgen has a compiler plugin too: https://github.com/crabtw/rust-bindgen#macro-usage

Also, FYI, https://github.com/rust-lang/rust/issues/10530 covers taking a Rust lib and generating a C header file.

steveklabnik · on May 14, 2015

Maybe I misunderstood what the parent wants, but you're right that the optimizer can do as it pleases, and you can use annotations to help it make the right decision.

An 'implicit' tool may in fact be cool. It's not perfect, and so needs tweaking in many cases, so the current state is pretty good, but for easier cases and/or when you don't care, I can see such a thing being useful.

masklinn · on May 15, 2015

> Maybe I misunderstood what the parent wants, but you're right that the optimizer can do as it pleases, and you can use annotations to help it make the right decision.

I understand TFAA's request to be a callsite annotation, which currently does not exist, e.g.

    inline foo()

to force inlining or

    noinline bar()

to prevent it, probably with the first one erroring out if the call is not inlinable.

Jweb_Guru · on May 15, 2015

I believe #[inline(always)] and #[inline(never)] both work like this.

kbenson · on May 15, 2015

According to steveklabnik here[1], those are on the definition, not the callsite, which is the distinction here. Although from other info here, it sounds like having it on the definition is a prerequisite in some cases if you wanted to somehow specify it for the callsite, as it needs to be serialized in the crate metadata to be inline-able, and that's controlled somewhat by whether it was defined as inline capable.

1: https://news.ycombinator.com/item?id=9548248

mkishi · on May 14, 2015

But those annotations are for the function definition instead of the call site, or are there call site annotations as well? I think the author's idea is analogous to the user-defined (instead of type-defined) move/copy semantics.

steveklabnik · on May 14, 2015

They're on the function definition, yes. There aren't ones at the call site.

kbenson · on May 15, 2015

As someone who just looked this up to confirm this and reply to someone else here, the docs[1] are fairly ambiguous in this respect. As someone soliciting feedback on the docs, that's a good spot to clarify. :)

P.S. I considered submitting a PR, but I don't know enough about rust yet to accurately phrase it.

1: https://doc.rust-lang.org/reference.html#inline-attributes

bronson · on May 15, 2015

Then I'm left wondering how you interpreted the sentence you quoted and replied to...?

> I’ve always thought it should be up to the caller to say which functions they’d like inlined,

steveklabnik · on May 15, 2015

I am ridiculously tired, after burning the candle from both ends for the last few weeks to get this release shipped. In the last four days, I've written almost 1700 lines of docs. I make mistakes sometimes :)

cmrx64 · on May 15, 2015

Bindgen uses libclang. It's also quite slow.