Fixing for loops in Go 1.22

jjwiseman · on Sept 19, 2023

I know there are much earlier examples, but the earliest warning about this behavior I could find in 60 seconds of searching is from the comp.lang.lisp FAQ, posted more than 30 years ago, in 1992:

    Mar 21, 1992, 1:00:47 AM
    Last-Modified: Tue Feb 25 17:34:30 1992 by Mark Kantrowitz
    ;;; ****************************************************************
    ;;; Answers to Frequently Asked Questions about Lisp ***************
    ;;; ****************************************************************
    ;;; Written by Mark Kantrowitz and Barry Margolin
    ;;; lisp-faq-3.text -- 16886 bytes
    
    [...]

    ----------------------------------------------------------------
    [3-9] Closures don't seem to work properly when referring to the
    iteration variable in DOLIST, DOTIMES and DO.
    
    DOTIMES, DOLIST, and DO all use assignment instead of binding to
    update the value of the iteration variables. So something like
    
    (dotimes (n 10)
      (push #'(lambda () (incf n))
            *counters*))
    
    will produce 10 closures over the same value of the variable N.
    ----------------------------------------------------------------

sixthDot · on Sept 20, 2023

D too https://issues.dlang.org/show_bug.cgi?id=2043.

That's actually expected when capturing by reference.

junke · on Sept 20, 2023

In the standard it is not specified if such loops mutate or rebind, and you have to assume it doesn't rebind if you capture variables. I do think however that once you learn how it works it stops being a problem (in any case I can select the form, macroexpand it and it shows how it's implemented)

varjag · on Sept 20, 2023

In theory sure. In practice it's easy enough to make this mistake mindlessly. I had this happen to me after many years of practice just this year (in an elaborate extended LOOP form which has same semantics).

AaronFriel · on Sept 19, 2023

The C# language team encountered this as well, after introducing lightweight closures in C# 4.0 it quickly became apparent that this was a footgun. Users almost always used loop variables incorrectly, and C# 5.0 made the breaking change.

Eric Lippert has a wonderful blog on the "why" from their perspective: https://ericlippert.com/2009/11/12/closing-over-the-loop-var...

I had a bit of trouble finding the original C# 5 announcement; that's hopefully not been lost in the (several?) blog migrations on the Microsoft domain since 2012.

nerdponx · on Sept 19, 2023

Meanwhile Python has received this same feature request many times over the years, and the answer is always that it would break existing code for little major benefit https://discuss.python.org/t/make-lambdas-proper-closures/10...

Given how much of an uproar there was over changing the string type in the Python 2 -> 3 transition, I can't imagine this change would ever end up in Python before a 4.0.

Cue someone arguing about how bad Python is because it won't fix these things, and then arguing about how bad Python is because their scripts from 2003 stopped working...

travisd · on Sept 19, 2023

It's worth noting that it's much less of a problem in Python due to the lack of ergonomic closures/lambdas. You have to construct rather esoteric looking code for it to be a problem.

    add_n = []
    for n in range(10):
        add_n.append(lambda x: x + n)
    add_n[9](10)  # 19
    add_n[0](10)  # 19

This isn't to say it's *not* a footgun (and it has bit me in Python before), but it's much worse in Go due to the idiomatic use of goroutines in a loop:

    for i := 0; i < 10; i++ {
        go func() { fmt.Printf("num: %d\n", i) }()
    }

eru · on Sept 20, 2023

In Python you are much more likely to hit that problem not with closures constructed with an explicit 'lambda', but with generator-comprehension expressions.

    (((i, j) for i in "abc") for j in range(3))

The values of the above depends on in which order you evaluate the whole thing.

(Do take what I wrote with a grain of salt. Either the above is already a problem, or perhaps you also need to mix in list-comprehension expressions, too, to surface the bug.)

nerdponx · on Sept 20, 2023

Yeah, this one is weird:

  gs1 = (((i, j) for i in "abc") for j in range(3))
  gs2 = [((i, j) for i in "abc") for j in range(3)]

  print(list(map(list, gs1)))
  print(list(map(list, gs2)))

Results:

  [[('a', 0), ('b', 0), ('c', 0)], [('a', 1), ('b', 1), ('c', 1)], [('a', 2), ('b', 2), ('c', 2)]]
  [[('a', 2), ('b', 2), ('c', 2)], [('a', 2), ('b', 2), ('c', 2)], [('a', 2), ('b', 2), ('c', 2)]]

That's a nice "wat" right there. I believe the explanation is that in gs2, the range() is iterated through immediately, so j is always set to 2 before you have a chance to access any of the inner generators. Whereas in gs1 the range() is still being iterated over as you access each inner generator, so when you access the first generator j=1, then j=2, etc.

Equivalents:

  def make_gs1():
      for j in range(2):
          yield ((i, j) for i in "abc")

  def make_gs2():
      gs = []
      for j in range(2):
          gs.append(((i, j) for i in "abc"))
      return gs

Late binding applies in both cases of course, but in the first case it doesn't matter, whereas in the latter case it matters.

I think early binding would produce the same result in both cases.

_ZeD_ · on Sept 20, 2023

or you could just be eager and use lists:

    >>> [[(i, j) for i in "abc"] for j in range(3)]
    [[('a', 0), ('b', 0), ('c', 0)], [('a', 1), ('b', 1), ('c', 1)], [('a', 2), ('b', 2), ('c', 2)]]

nerdponx · on Sept 20, 2023

Right, creating generators in a loop is not usually something you want to do, but it's meant to demonstrate the complexity that arises from late binding rather than demonstrate something you would actually want to do in a real program.

Spivak · on Sept 20, 2023

Ignoring the strange nature of this code in the first place the more pythonic way to do it would be

    from functools import partial
    from operators import add

    add_n = [partial(add, n)) for n in range(10)]

    assert add_n[5](4) == 9

Look ma, no closures.

eru · on Sept 20, 2023

'partial' creates a closure for you.

Spivak · on Sept 20, 2023

Unless you're talking philosophically how classes and closures are actually isomorphic then no, it doesn't. None of the variables in the outer scope are captured in the class instance.

https://github.com/python/cpython/blob/main/Lib/functools.py...

Here's a simplified version of that code that demonstrates the pattern.

    class partial:
      def __init__(self, func, *args, **kwargs):
        self.func = func
        self.args = args
        self.kwargs = kwargs

      def __call__(self, *args, **kwargs):
        return self.func(*self.args, *args, **(self.kwargs | kwargs))

      p = partial(add, 5)  # -> an instance of partial with self.args = (5,)
      res = p(4)  # -> calls __call__ which merges the args and calls add(5, 4)

eru · on Sept 20, 2023

I was talking 'philosophically' in that sense. The partial object does create a new scope that binds a few of those variables.

But you are also right that the mechanisms in Python are different (on some suitable mid-level of abstraction) for those two.

hinkley · on Sept 19, 2023

Everyone else solved this problem by using list comprehensions instead. Rob has surely heard of those.

lmm · on Sept 19, 2023

Of the two comprehension syntaxes in Haskell, Python picked the wrong one. Do notation (or, equivalently, Scala-style for/yield) feels much more consistent and easy to use - in particular the clauses are in the same order as a regular for loop, rather than the middle-endian order used by list comprehensions.

eru · on Sept 20, 2023

Haskell has both do-notation and list comprehension.

Comprehension in both Python and Haskell (for both lists and other structures) use the same order in both language, as far as I remember.

lmm · on Sept 20, 2023

> Haskell has both do-notation and list comprehension.

Right, and do-notation is the one everyone uses, because it's better. Python picked the wrong one.

> Comprehension in both Python and Haskell (for both lists and other structures) use the same order in both language, as far as I remember.

It may be the same order as Haskell but it's a terrible confusing order. In particular if you want to go from a nested list comprehension to a flat one (or vice versa) then you have to completely rearrange the order it's written in, whereas if you go from nested do-blocks to flat do-blocks then it all makes sense.

eru · on Sept 20, 2023

I see what you mean, but I don't find the order that confusing in neither Haskell or Python.

However, I can imagine a feature that we could add to Python to fix this: make it possible for statements to have a value. Perhaps something like this:

    my_generator = \
      for i in "abc":
        for b in range(3):
          print("foo")
          yield (i, b)

or perhaps have the last statement in block be its value (just like Rust or Ruby or Haskell do with the last statement in a block), and make the value of a for-loop be a generator of the individual values:

    my_list = list(
      for i in "abc":
        for b in range(3):
          (i, b))

Though there's a bit of confusion here whether the latter examples should be a flat structure or a nested one. You could probably use a similiar mechanism as the existing 'yield from' to explicitly ask for the flat version, and otherwise get the nested one:

    my_list = list(
      for i in "abc":
        yield from for b in range(3):
          (i, b))

Making Python statements have values looks to me like the more generally useful change than tweaking comprehensions. You'd probably not need comprehension at all in that case. Especially since you can already write loop header and body on a single line like

    for i in range(10): print(i)

if they are short enough.

nerdponx · on Sept 20, 2023

  for i in range(10): print(i)

But what would that return? [None] * 10?

The limited whitespace-based syntax limits the potential for fun inline statement things, but it also completely dodges the question of what any particular statement should evaluate to when used as an expression.

eru · on Sept 21, 2023

> But what would that return? [None] * 10?

Yes, I guess something like that. That was just meant as an example of how existing Python allows you to write loops on one line. It's not a good example for a meaningful comprehension in our alternative made-up Python dialect.

> The limited whitespace-based syntax limits the potential for fun inline statement things, [...]

Python already mostly allows you to use parens to override the indentation. They would just need to generalise that a bit. Btw, Haskell already does that:

Officially, Haskell has a syntax with curly braces and semicolons; and they define the indentation based syntax as syntactic sugar that desugars to ; and {}. But almost everyone uses indentation based syntax. The exception are perhaps code generators and when posting on a website that messes with indentation.

(And, because it's Haskell, the {}; syntax is just another layer of syntactic sugar for 'weird-operator'-based based syntax like >>=.)

camgunz · on Sept 20, 2023

When I was starting in Python years ago I had to turn my brain inside out to learn how to write list comprehensions. Sometimes I wonder what it's like to be a normal person with a normal non-programmer brain, having forgotten it entirely these last many years.

nerdponx · on Sept 20, 2023

But Python doesn't have any concept of a monad, so what would do-notation even be in Python? And who is the "everyone" using do-notation? I don't see any analogous syntax in Lua, Javascript, Ruby, or Perl.

In Python there is a nice tower of abstractions for iteration, but nothing more general than that, so it makes perfect sense IMO to use the syntax that directly evokes iteration.

The existing syntax is meant to mirror the syntax of a nested for loop. I agree that maybe it's confusing, but if you want to go from a multi-for comprehension to an actual nested for loop, then you don't have to invert the order.

lmm · on Sept 20, 2023

> But Python doesn't have any concept of a monad, so what would do-notation even be in Python?

It could work on the same things that Python's current list comprehensions work on. I'm just suggesting a different syntax. Comprehensions in Haskell originally worked for all monads too.

> And who is the "everyone" using do-notation? I don't see any analogous syntax in Lua, Javascript, Ruby, or Perl.

I meant that within Haskell, everyone uses do notation rather than comprehensions.

> The existing syntax is meant to mirror the syntax of a nested for loop. I agree that maybe it's confusing, but if you want to go from a multi-for comprehension to an actual nested for loop, then you don't have to invert the order.

You have to invert half of it, which I find more confusing than having to completely invert it. do-notation style syntax (e.g. Scala-style for/yield) would keep the order completely aligned.

eru · on Sept 21, 2023

> Comprehensions in Haskell originally worked for all monads too.

And that behaviour is still accessible via a compiler extension.

nerdponx · on Sept 21, 2023

Idris 2 still has both "monad comprehensions" and an applicative equivalent called "idiom brackets".

https://idris2.readthedocs.io/en/latest/tutorial/interfaces....

rofrol · on Sept 22, 2023

https://wiki.haskell.org/Do_notation_considered_harmful

panzi · on Sept 19, 2023

How does list comprehension change anything here? This has the same problem:

    add_n = [lambda x: x + n for n in range(10)]
    add_n[9](10)  # 19
    add_n[0](10)  # 19

billyjmc · on Sept 20, 2023

I’m not sure what they mean by list comprehensions, either, but for completeness’s sake, I must point out that this is solvable by adding `n` as a keyword argument defaulting to `n`:

    add_n = [lambda x, n=n: x + n for n in range(10)]
    add_n[9](10)  # 19
    add_n[0](10)  # 10

drekipus · on Sept 20, 2023

This is the way

Also pylsp warns you about this

simiones · on Sept 20, 2023

I don't think anyone is puzzled by the Go snippet being wrong.

The bigger problem in Go is the for with range loop:

  pointersToV := make([]*val, len(values))
  for i, v := range values {
    go func() { fmt.Printf("num: %v\n", v) } () //race condition
    pointersToV[i] = &v //will contain len(values) copies of a pointer to the last item in values
  }

This is the one they are changing.

Edit: it looks like they're actually changing both of these, which is more unexpected to me. I think the C# behavior makes more sense, where only the foreach loop has a new binding of the variable in each iteration, but the normal for loop has a single lifetime.

o11c · on Sept 20, 2023

It's actually worse in Python since there's no support for variable lifetimes within a function, so the `v2` workaround is still broken. (the default-argument workaround "works" but is scary)

This makes it clear: the underlying problem is NOT about for loops - it's closures that are broken.

nerdponx · on Sept 20, 2023

It's not broken, it's a different design. Maybe worse in a lot of cases, but it's not broken. It's working as intended.

eru · on Sept 20, 2023

You could say the design is broken, but the implementation is working as intended by the design.

jshen · on Sept 19, 2023

Doesn’t go vet complain about your code? I’m not at my computer right now so can’t check.

_ikke_ · on Sept 20, 2023

> Tools have been written to identify these mistakes, but it is hard to analyze whether references to a variable outlive its iteration or not. These tools must choose between false negatives and false positives. The loopclosure analyzer used by go vet and gopls opts for false negatives, only reporting when it is sure there is a problem but missing others.

So it will warn in certain situations, but not all of them

simiones · on Sept 20, 2023

Why would it? It's perfectly correct code, it's just not doing what you'd expect.

It might complain about the race condition, to be fair, but the same issue can be reproduced without goroutines and it would be completely correct code per the semantics.

jshen · on Sept 20, 2023

In many languages "if x = 3" is perfectly valid code, but almost certainly not what the person intended "if x == 3". It's very smart to warn someone in a scenario like this.

simiones · on Sept 21, 2023

I don't really write C too much, but I thought `if err = functionWithErrorReturn() { handleError(err) }` was a somewhat common idiom.

defrost · on Sept 21, 2023

It's a common enough idiom from "stone age" bare bones K&R C, absolutely.

It's also one of the great foot-guns of C programming as there are so many other almost but not that idioms and it's never clear on casual inspection whether the result of an assignment was meant to be examined or the result of a comparison.

With the evolution of C and C sanity tools that rightfully flag such statements for double checking and the desire to not have spurious flagging, etc. it's more common in later C code to see (say)

    if ((err = someFunction()) != NOERROR) { errorHandle(err) }

that optimises down to the same intermediate code where NOERROR is 0, sure, but it makes it very clear what is going on, an intended assignment and then an intended comparison.

As with all idoms the general practice in the larger codebase and house code standard rules apply - there are other ways of doing similar things.

Frotag · on Sept 20, 2023

I've run into this once. IIRC the workaround was to add a n=n arg to the lambda

sneak · on Sept 19, 2023

Somehow, Go managed to not break old code and also fix the problem.

I think this is a good case of Python not fixing things, given that a fix exists that solves both problems.

pcl · on Sept 19, 2023

> To ensure backwards compatibility with existing code, the new semantics will only apply in packages contained in modules that declare go 1.22 or later in their go.mod files.

IshKebab · on Sept 19, 2023

Python could very easily have a similar mechanism. Hell even CMake manages to do this right, and they got "if" wrong.

The Python devs sometimes seem stubbornly attached to bugs. Another one: to reliably get Python 3 on Linux and Mac you have to run `python3`. But on Windows there's no `python3.exe`.

Will they add one? Hell no. It might confuse people or something.

Except... if you install Python from the Microsoft Store it does have `python3.exe`.

rfoo · on Sept 20, 2023

> Except... if you install Python from the Microsoft Store it does have `python3.exe`.

It's worse. If you don't install Python from the Microsoft Store there will still be a `python3.exe`. But running it just opens Microsoft Store.

Imagine how confused one could be when someone typed `python3 a.py` over a SSH session and nothing happened.

wrboyce · on Sept 19, 2023

I’ve not run “python3” in years on my Mac, and I’m almost certain I never type it into Linux machines either; either I’m losing my mind, or there are some ludicrous takes in this thread.

rat9988 · on Sept 19, 2023

You are surely losing your mind then. Python3 isn't something esoteric.

wrboyce · on Sept 19, 2023

Entirely possible, but my point was I just type “python” and Python 3 happens. Do modern OS even come with Python 2 anymore?

I’m not claiming any mystery about Python, just disputing how the modern version is invoked.

jlokier · on Sept 20, 2023

Just tried "python" and "python3" on various Linux distros, which output respectively:

On an Ubuntu 20.04 desktop VM:

  python  => Python 2.7.18 (default, Jul  1 2022, 12:27:04)
  python3 => Python 3.8.10 (default, May 26 2023, 14:05:08)

On an Ubuntu 19.04 server:

  python  => -bash: python: command not found
  python3 => Python 3.7.5 (default, Apr 19 2020, 20:18:17)

On an Ubuntu 20.10 server:

  python  => -bash: python: command not found
  python3 => Python 3.8.10 (default, Jun  2 2021, 10:49:15)

I no longer have access to some RHEL7 and RHEL8 machines used for work recently, but if I recall correctly they do this by default:

Red Hat Enterprise Linux 7:

  python  => Some version of Python 2
  python3 => Some version of Python 3

Red Hat Enterprise Linux 8:

  python  => -bash: python: command not found # (use "python2" for Python 2)
  python3 => Some version of Python 3

You can change the default behaviour of unversioned "python" to version 2 or 3 on all the above systems, I think, so if you're running a Linux distro when "python" gets you Python 3, that configuration might have been done already.

MacOS 10.15 (Catalina) does something interesting:

  python  => WARNING: Python 2.7 is not recommended.
             This version is included in macOS for compatibility with legacy software.
             Future versions of macOS will not include Python 2.7.
             Instead, it is recommended that you transition to using 'python3' from within Terminal.
             Python 2.7.16 (default, Jun  5 2020, 22:59:21)
  python3 => Python 3.8.2 (default, Jul 14 2020, 05:39:05)

Liquid_Fire · on Sept 23, 2023

To be fair, few of these would qualify as "modern". Ubuntu 19.04 and 20.10, macOS 10.15 are all out of support, and RHEL 7 is almost ten years old and nearing the end of its support.

lmm · on Sept 20, 2023

> I just type “python” and Python 3 happens.

That was the old way. Python now recommends against installing Python3 in a way that does that, and most modern *nix don't.

erik_seaberg · on Sept 19, 2023

13.5.2 has /usr/bin/python3 (it’s 3.9.6) but not python2 or just python. Not sure when they changed, and YMMV with Homebrew.

wrboyce · on Sept 20, 2023

I suspect my confusion stemmed from mostly invoking `ipython` which doesn't include the 3 suffix (ok, part of the confusion may've been pub-related too :D).

turboponyy · on Sept 20, 2023

Depending on the package manager / distribution, 'python' might be symlinked to either Python 2 or Python 3. If you don't have Python 3 installed, it might very well point to Python 2. These days it will almost certainly prefer Python 3, but I am also in the habit of actually typing 'python3' instead of 'python' because of what I assume are issues I've had in the past.

beeburrt · on Sept 19, 2023

> to reliably get Python 3 on Linux and Mac you have to run `python3`.

This is not true on my Fedora 38 system, same with current Kali linux. Although, it is the case with Ubuntu 22.04.3.

wnoise · on Sept 19, 2023

_reliably_, as in on the vast majority of machines.

billyjmc · on Sept 20, 2023

Is that really python’s fault? It seems like it’s the distro making a design decision.

wnoise · on Sept 20, 2023

Well, no, not python's fault -- clearly the distros', and they probably should be blamed. But a PEP saying python2 and python3 should invoke the correct interpreter would help motivate the distributions.

(This is isomorphic to the usual victim-blaming discussion. Fault and blame vs some ability to make a difference; it's a shame that correctly pointing out a better strategy is both used to attack victims and attacked for attacking victims in the cases when that wasn't intended.)

wnoise · on Sept 22, 2023

In fact there is a PEP: https://peps.python.org/pep-0394/

IshKebab · on Sept 20, 2023

What do you mean? Fedora 38 doesn't have `python3`? Are you sure?

nerdponx · on Sept 19, 2023

Right, and Go has the luxury of being a compiler that generates reasonably portable binaries, while Python requires the presence of an interpreter on the system at run time.

lloeki · on Sept 20, 2023

> Python requires the presence of an interpreter on the system at run time.

A runtime interpreter does not prevent Perl to do similar things via `use 5.13`

Python has `from future` with similar abilities, it would absolutely be possible to do the same as Perl and Go and fix what needs to be fixed without breaking old code. One could design a `import 3.22` and `from 3.22 import unbroken_for` and achieve the same thing.

josephg · on Sept 20, 2023

The same trick would work with python just as well. There’s nothing about Python’s status as an interpreter which would stop them from adding a python semantic version somewhere in each python program - either in a comment at the top of each source file or in an adjacent config file. The comment could specify the version of python’s semantics to use, which would allow people to opt in to new syntax (or opt out of changes which may break the code in the future).

Eg # py 3.4

Cthulhu_ · on Sept 20, 2023

Yeah, it would just mean that the interpreter - just like the Go compiler - would need to have the equivalent of "if version > 3.4 do this, else do that". Which is fine for a while, but I can imagine it adds a lot of complexity and edge cases to the interpreter / compiler.

Which makes me think that a Go 2.0 or Python 4 will mainly be about removing branches and edge cases from the compiler more than making backwards-incompatible language changes.

josephg · on Sept 20, 2023

This is the direction multiple languages are moving in. Go and Rust both have something like this. (In Rust they're called "editions"). I think its inevitable that compilers get larger over time. I hope most of the time they aren't too onerous to maintain - there aren't that many spec breaking changes between versions. But if need be, I could also imagine the compiler eventually deprecating old versions if it becomes too much of an issue.

Arguably C & C++ compilers do the same thing via -std=c99 and similar flags at compile time.

Anyway, nothing about this is special or different with python. I bet the transition to python 3 would have been much smoother if scripts could have opted in (or opted out) of the new syntax without breaking compatibility.

baq · on Sept 20, 2023

Python probably could change this with a from __future__ import, i.e. in the same way.

wrboyce · on Sept 19, 2023

By letting you specify a language version requirement? Not exactly backwards compatible (because it is explicitly not, as per the article).

Python doesn’t make breaking changes in non-major versions, so as mentioned by the upthread comment the appropriate place for this change would be in Python 4.

Given the above, I’m really not sure what point you think you’re making in that final paragraph.

carbotaniuman · on Sept 19, 2023

This seems weird to given the number of breakages and standard library changes I seem to run into every version.

wrboyce · on Sept 19, 2023

Really? I find that surprising. I don’t write as much code as I used to but I’ve been writing Python for a long time and the only standard library breakages that come to mind were during the infamous 2 -> 3 days.

What sort of problems are have you faced upgrading minor versions?

orbisvicis · on Sept 19, 2023

The docs are full of remarks like "removed in 3.0 and reintroduced in 3.4" or "deprecated in 3.10", etc. A big one is the removal of the loop parameter in asyncio, but a lot of asyncio internals are (still?) undergoing significant changes, as getting the shutdown behavior correct is surprisingly difficult. Personally it's never cause me any issues - I'm always on board with the changes.

nerdponx · on Sept 20, 2023

Asyncio was explicitly marked as provisional for years and most of the incompatible changes happened during that time. Same goes for typing. The rest of the language is very very stable.

nayuki · on Sept 20, 2023

One of the few that I remember is fractions.gcd() -> math.gcd().

See https://docs.python.org/3.0/library/fractions.html#fractions... , https://docs.python.org/3.5/library/fractions.html#fractions... , https://docs.python.org/3.9/library/fractions.html (gone), https://docs.python.org/3.5/library/math.html#math.gcd

AlphaSite · on Sept 20, 2023

They do and have made relatively small ones, e.g. promoting __future__ features to default, etc.

kazinator · on Sept 19, 2023

If the change /doesn't/ break old code, it's also poorly justified.

It means code doesn't care about the issue being addressed.

The feature is only justified if it changes existing code, such that bugs you didn't even know about are fixed.

I.e. people read about the issue, investigate their code bases and go, oh hey, we actually have a latent bug here which the change will fix.

jjnoakes · on Sept 20, 2023

There's a second motivation in my opinion. Code might work today without the change, but it could be because the author originally wrote buggy code, caught it in testing, and had to waste time tracking it down and understanding nuances that don't need to be there. Once they figured that out, they implemented an ugly workaround (adding an extra function parameter to a goroutine or shadowing the loop variable with n := n).

Good language designers want to avoid both wasting developer's time and requiring ugly workarounds. Making a change that does both, especially if it doesn't break old code, is great imo.

kazinator · on Sept 20, 2023

Whichever way you implement the semantics of the loop variable, the developer has to understand the nuances, don't you think? And those nuances have to be there; all you can do is replace them with other nuances.

If a fresh variable i is bound for each iteration, then an i++ statement in the body will not have the intended effect of generating an extra increment that skips an iteration.

If you want the other semantics, whichever one that is, the workaround is ugly.

jjnoakes · on Sept 21, 2023

I think you can choose nuances that minimize unexpected behavior in practice, and I think the go team did a good job here.

esrauch · on Sept 20, 2023

New code written today will use the new version and have the correct behavior from day 1.

Old code that is maintained will eventually be upgraded, which yes does come with work sometimes where you realize your code works on version X but not version X+10 and you do a combination of tests and reading patch notes to see what changed.

kazinator · on Sept 20, 2023

There is no "correct" behavior here; either one is a valid choice that can be documented and that programs can rely on and exploit.

Code doesn't care about when it's written, only what you run it on, and with what compatibility options.

E.g. one possibility is that ten-year-old code that wrongly assumed the opposite behavior, and has a bug, will start to work correctly on the altered implementation.

masklinn · on Sept 20, 2023

Python is in a larger bind because it only has function scoping and variable declaration is implicit. It does not have sub-function scopes.

So does not really have a good way to fix the issue, even by using a different keyword as JS did.

OTOH default parameters being evaluated at function definition make mitigating it relatively simple.

codeflo · on Sept 20, 2023

Yeah, block scoping is one of those "weird CS ideas" that I'm sure at some point early in Python's design was deemed too complicated for the intended audience, but is also quite a natural way to prevent some human errors. JavaScript made the same mistake and later fixed it (let/const).

Cthulhu_ · on Sept 20, 2023

I'm not a computer scientist so I can't rule whether function scope was a mistake, and can't see how block scoping would be considered too complicated, I personally think it fits much better with my mental model. Then again, Python doesn't have blocks in the traditional sense of the word IIRC, in C style languages the accolades are a pretty clear delineator.

Parts of my previous job were terrible because it had JS functions thousands of lines of code long where variables were constantly reused (and often had to be unset before another block of code). That said, that wasn't the fault of function scope per se, but of a bad but very productive developer.

masklinn · on Sept 20, 2023

TBF you can have block scoping in an indentation-based language, though it probably help to merge the too, as in Haskell: `let…in` will define variables in the `let` clause, and those variables are only accessible in the `in` clause (similarly case…of)

kzrdude · on Sept 20, 2023

I love python, but it's one of the biggest annoyances. Local variables like in Lua make a lot of sense.

arnsholt · on Sept 20, 2023

Python does actually have a single instance of sub-function scopes: When you say `try: ... except Exception as e: ...` the `e` variable is deleted at the end of the `except` clause. I think this is because the exception object, via the traceback, refers to the handling function's invocation record, which in turn contains a map of all the function's local variables. So if the variable worked like normal variables in Python it'd create a reference cycle and make the Python GC sad. So if you need that behaviour, you need to reassign the exception to a new name [0].

0: https://docs.python.org/3/reference/compound_stmts.html#the-...

orbisvicis · on Sept 19, 2023

Is it a bug? I've always depended on late-binding closures and I think even recently in a for loop, not that I'm going to go digging. You can do neat things with multiple functions sharing the same closure. If you don't want the behavior bind the variable to a new name in a new scope. From the post I get the sense that this is more problematic for languages with pointers.

lmm · on Sept 19, 2023

IMO it's a misdesign in the same way as e.g. JavaScript's "this". Most languages figured out 40 or so years ago that scoping should be lexical.

orbisvicis · on Sept 20, 2023

The scope is lexical, the lookup is dynamic. What you want is for each loop iteration to create a new scope, which I would categorize as "not lexical".

lmm · on Sept 20, 2023

By that argument a recursive function shouldn't create a new scope every time it recurses, and a language that fails Knuth's 1964 benchmark of reasonable scoping (the "man or boy test") would be fine. The loop body is lexically a block and like any other block it should have its own scope every time it runs.

orbisvicis · on Sept 20, 2023

Except that the for loop does not create a new scope and is not a block:

https://docs.python.org/3/reference/executionmodel.html#stru...

Dylan16807 · on Sept 20, 2023

Also, loop bodies already did have their own scope each iteration.

I wouldn't say either behavior is non-lexical. The only thing changing is which lexical scope these variables go into.

lmm · on Sept 20, 2023

If the loop "variable" (and IMO thinking of it as a variable is halfway to making the mistake) is in a single scope whose lifetime is all passes through the loop body, that's literally non-lexical; there is no block in the program text that corresponds to that scope. Lexically there's the containing function and the loop body, there's no intermediate scope nestled between them.

Dylan16807 · on Sept 20, 2023

> and IMO thinking of it as a variable is halfway to making the mistake

I used plural for a reason.

> there is no block in the program text that corresponds to that scope.

The scope starts at the for. There is a bunch of state that is tied to the loop, and if you rewrote it as a less magic kind of loop you'd need to explicitly mark a scope here.

What's non-lexical about it? You could replace "for" with "{ for" to see that a scope of "all passes through the loop body" does not require anything dynamic.

And surely whether a scope is implicit or explicit doesn't change whether a scope is lexical. In C I can write "if (1) int x=2;" and that x is scoped to an implicit block that ends at the semicolon.

Would you say an if with a declaration in it is non-lexical, because both the true block and the else block can access the variable? I would just say the if has a scope, and there are two scopes inside it, all lexical. And the same of a for loop having an outer and inner scope.

simiones · on Sept 20, 2023

The problem isn't with closures, the closure semantics are perfectly fine.

The problem is in the implementation of for-range loops, where the clear expectation is that the loop variable is scoped to each loop iteration, not to the whole loop scope (otherwise said, that the loop variable is re-bound to a new value in each loop iteration). The mental mode approximately everyone has for a loop like this:

  for _, v := range values {
    //do stuff with v
  }

is that it is equivalent to the following loop:

  for i := range values {
    v := values[i]
    //do stuff with v
  }

In Go 1.22 and later, that is exactly what the semantics will be.

In Go 1.21 or earlier, the semantics are closer to this (ignoring the empty list case for brevity):

  for i := 0, v := values[0]; i < len(values); i++, v=values[i] {
    //do stuff with v
  }

And note that this mis-design has appeared in virtually every language that has loops and closures, and has been either fixed (C# 5.0, Go 1.22) or it keeps being a footgun that people complain about (Python, Common Lisp, C++).

4bpp · on Sept 20, 2023

I don't know, my feeling is that the issue really is with how closure capture was interpreted when imperative languages started implementing lambdas. What was happening in Go seems to either amount to default capture by reference rather than value, or to the loop counters in question being unmarked reference types. The former strikes me as unintuitive given that before lambdas, reference-taking in imperative languages was universally marked (ex. &a); the latter strikes me as unintuitive because with some ugly exceptions (Java), reference types should be marked in usage (ex. *a + *b instead of a+b). Compare to C++ lambdas, where reference captures must be announced in the [] preamble with the & sigil associated with reference-taking.

(In functional languages, this problem did not arise, since most variables are immutable and those that are not are syntactically marked and decidedly second-class. In particular, you would probably not implement a loop using a mutable counter or iterator.)

simiones · on Sept 20, 2023

Even if Go allowed both capture-by-value and capture-by-reference, this issue would have arisen when using capture-by-reference.

For example, in the following C++:

  auto v = std::vector<int>{1, 2, 3};
  auto prints = std::vector<std::function<void()>>();
  auto incrs = std::vector<std::function<void()>>();
  for (auto x : v) {
    prints.push_back([&x]()->void {std::cout<<x<<", "; })
    incrs.push_back([&x]()->void {++x;});
  }
  for (auto f : incrs) {
    f();
  }
  for (auto f : prints) {
    f();
  } //expected to print 2, 3, 4; actually prints 6, 6, 6

I would also note that this problem very much arises in functional languages - it exists in the same way in Common Lisp and Scheme, and I believe it very much applies to OCaml as well (though I'm not sure how their loops work).

Tried it out, OCaml does the expected thing:

  open List
  let funs = ref [  ] ;;
  for i = 1 to 3 do
    funs := (fun () -> print_int i) :: !funs
  done ;;

  List.iter (fun f -> f()) !funs ;; //prints 321

4bpp · on Sept 20, 2023

> this issue would have arisen when using capture-by-reference

I understand - but in those languages capture-by-reference has to be an explicit choice (by writing the &) rather than the default, which highlights the actual behaviour. The problem with the old Go solution was that it would apparently behave as capture by reference without any explicit syntactic marker that it is so, and without a more natural alternative that captures by value, in a context where from other languages you would expect that the capture would happen by value.

> Common Lisp and Scheme

I have to admit I haven't worked in either outside of a tutorial setting, but my understanding is that they are quite well-known for having design choices in variable scoping that are unusual and frowned upon in modern language design

> Ocaml

Your example shows that it captures by value as I said, right? For it to work as the old Go examples, i would have to be a ref cell whose contents are updated between iterations, which is not how the semantics of for work. If it did, you'd have to use the loop counter as !i.

tsimionescu · on Sept 20, 2023

In Go 1.22 as well, closures still capture-by-reference. The change is that there is now a new loop variable in each loop iteration, just like in OCaml. But two closures that refer the same loop variable (that are created in the same iteration, that is) will still see the changes each makes to that variable.

And what I was trying to show with my example was that this kind of behavior would be observable in OCaml as well, if it were to be implemented like that.

orbisvicis · on Sept 20, 2023

I think that's a C-centric assumption which is moot as Python's "for" does not create any new scopes. Just reading Knuth's man-or-boy test I was struck by the alien nature of the ALGOL 60 execution model, even though to Python it can be considered a distant ancestor.

https://en.m.wikipedia.org/wiki/Man_or_boy_test

jerf · on Sept 19, 2023

jaredpar on the C# team offered the very first comment on the Github issue for this proposal: https://github.com/golang/go/discussions/56010

I think it played a large part in helping get past the default-deny that any language change proposal should have. The other big one for me was the scan done over the open source code base and the balance of bugs fixed versus created.

em-bee · on Sept 19, 2023

as soon as i saw mention of c# going through the same thing, i realized that this was discussed before: https://news.ycombinator.com/item?id=33160236

hinkley · on Sept 19, 2023

Java also had this problem with anonymous classes. The solution is usually to introduce a functor. Being pass-by-value, it captures the state of the variables at its call time, which helps remove some ambiguity in your code.

If you try to do something weird with variable capture, then any collections you accumulate data into (eg, for turning an array into a map), will behave differently than any declared variables.

Go is trying to thread the needle by only having loop counters work this way. But that still means that some variables act weird (it's just a variable that tends to act weird anyway). And I wonder what happens when you define multiple loop variables, which people often do when there will be custom scanning of the inputs.

simiones · on Sept 20, 2023

Java has never had this problem with variables (either in a for loop or free-floating ones), since Java has never had support for closures.

There is one somewhat similar problem in Java that you're maybe thinking of: anonymous classes that reference fields of the current object. I don't think that behavior is surprising, and there are very important use cases for it.

What Go is doing is perfectly sensible. The ability to capture variables is extremely powerful, and often desired. It's just the unexpected scoping of loop variables that introduces a problem. The following code is doing exactly what most people would expect, for example:

  a := 0
  incr := func() {a += 1}
  print := func() {fmt.Printf("%d", a)}
  print() // prints 0
  incr()
  print() //prints 1

kaba0 · on Sept 20, 2023

> An anonymous class cannot access local variables in its enclosing scope that are not declared as final or effectively final.

So no, Java didn’t have this problem.

eru · on Sept 20, 2023

Which of the various meanings of the word 'functor' are you using here?

hinkley · on Sept 20, 2023

Function that returns a function.

You pass your counter into the function, it returns a function that remembers the original value, not the value as it keeps iterating later on in the caller.

eru · on Sept 20, 2023

That's just a higher order function?

That's an interesting definition. I thought you would either go with https://en.wikipedia.org/wiki/Functor_(functional_programmin... or with https://en.wikipedia.org/wiki/Function_object

https://en.wikipedia.org/wiki/Functor_(disambiguation) has a few more choices, but doesn't seem to have yours.

Sharlin · on Sept 20, 2023

Functor must be one of the worst overloaded terms in all of computing.

pornel · on Sept 21, 2023

JS also had this problem and introduced for(let) loops.

za3faran · on Sept 21, 2023

Yet in typical golang fashion of not learning from previous languages, they chose to ignore this behavior, only to go back and try to fix it later.

tester756 · on Sept 19, 2023

It is crazy that such behaviour even gets deployed

It is so unintuitive...

wahern · on Sept 19, 2023

It's unintuitive to users of the language, but it's very intuitive from the perspective of those implementing the language. Everybody seems to make this mistake. Lua 5.0 (2003) made this mistake, but they fixed it in Lua 5.1 (2006). (Lua 5.0 was the first version with full lexical scoping.)

tester756 · on Sept 19, 2023

>It's unintuitive to users of the language, but it's very intuitive from the perspective of those implementing the language.

It sounds like a lack of dogfooding, lack of review?

catach · on Sept 19, 2023

To the degree the the implementers are also users they carry their implementer understanding into their use. Dogfooding doesn't help when your understanding doesn't match that of your users.

skywhopper · on Sept 20, 2023

The problem is that the error conditions are relatively rare. Most of the time it doesn’t break anything. So even with dogfooding you can miss it or not see it as a problem early on. But after 10 years of evidence that it was a mistake, that it’s almost never intended, and the fix won’t break much if anything, it’s time to fix it.

masklinn · on Sept 20, 2023

No, it’s just an obvious behaviour when you understand how the language works.

tester756 · on Sept 20, 2023

"if you learned how this work, then you understand this behaviour"

I disagree with this approach, despite the fact that you can logically explain this,

then it still is terrible design and as you see golang (and c# and other langs) designers realized it too and changed it

So, how can you even try to defend this design when even the designers decided to change it (despite the fact that changing lang is really hard)?

masklinn · on Sept 20, 2023

> I disagree with this approach

That is not actually relevant to the point.

> So, how can you even try to defend this design

I'm doing no such thing, I'm pointing out that your reasoning is faulty: dogfooding does not help when the behaviour is logical and obvious to the designer-cum-user, and same with reviewing.

tester756 · on Sept 20, 2023

>dogfooding does not help when the behaviour is logical and obvious to the designer-cum-user, and same with reviewing.

While I can agree with dogfodding, then review doesn't have to be done just by the people that are responsible for the design/impl.

You can have external reviewers for (let's call it) sanity check

gtowey · on Sept 19, 2023

It's not crazy. It's just the difference between a pointer and a value, which is like comp sci 101.

I think the main things that make it such a trap is that the variable type definition is implicit so the fact that it's a pointer becomes a bit hidden, and that easy concurrency means the value is evaluated outside of the loop execution more often.

Cthulhu_ · on Sept 20, 2023

> It's just the difference between a pointer and a value, which is like comp sci 101.

That might be the case, but my comp sci 101 was 15 odd years ago now and since then I have _never_ had to think about pointers vs values, until I started a Go project a few years ago. But even that was more comprehensible than the pointer wizardry we had to do in C/C++ back when.

I don't want to have to think about managing my application's memory, I much prefer being in the code, thinking of variable scope and maintainability which in a lot of languages automatically translates to healthy memory usage.

tester756 · on Sept 19, 2023

>It's not crazy.

No. Full disagree.

Array represents a concept of holding multiple values (let's simplify) of the same type.

Loop (not index based) over array represents concept of going *over* array's elements and executing some code body for each of it.

Now, if the behaviour isn't that loop's body is executed for each array element (let's forget about returns, breaks, etc)

then the design is terrible (or implementation, but that'd mean that it was a bug)

I have totally no idea how can you design this thing in such a unintuitive way unless by mistake/accidentally.

mik1998 · on Sept 19, 2023

I don't know much about Go but the design seems very intuitive to me. You're doing something like (let ((i variable)) (loop (setf i ...) ...body)), which if there is a closure in the loop body will capture the variable i and also subsequent mutations.

The fix is to make a new variable for each iteration, which is less obvious implementation wise but as per the post works better if you're enclosing over the loop variable.

simiones · on Sept 20, 2023

While I agree that the design is very clear for the cases illustrated here, and I am a bit puzzled on why the Go designers chose to change these as well, the design is not at all clear for the other case:

  for i, v := range []int{1, 2, 3} {
    funcs = append(funcs, func() {fmt.Printf("%v:%v, ", i, v)})
  }
  for _, fun := range funcs {
    fun()
  } //prints 2:3, 2:3, 2:3

The reason why this happens is clear. But, it's not what people expect from the syntax, not at all. And it's also a behavior that is never useful. There is 0 reason to capture the loop variables, as evidenced by the fact that none of the languages that have started like this and taken a breaking change to switch to the expected behavior has found even a single bug caused by this breaking change.

mik1998 · on Sept 20, 2023

> And it's also a behavior that is never useful.

False. There are cases where it is useful to have the loop variable available directly. For example, you can add one to the loop variable to skip an iteration, which would not work with an iteration-local loop variable.

simiones · on Sept 20, 2023

In a for-in-range loop, the variables are read-only inside the loop body, so there is no way to skip this.

I do agree that there are reasons to modify the iteration variable in a C-style for loop, so I am surprised that those loops are being modified as well. C#, which went through a similar change, did NOT apply such a change for those for loops.

eru · on Sept 20, 2023

And an optimizing compiler can reduce your latter case to the former, if they can prove it's safe to do so.

jrockway · on Sept 19, 2023

The loop semantics do not have anything to do with arrays. The point of confusion is whether a new slot for data is being created before each iteration, or whether the same slot is being used for each iteration. It turns out that the same slot is being used. The Go code itself is clear `for i := 0; i < 10; i++`. `i := 0` is where you declare i. Nothing else would imply that space is being allocated for each iteration; the first clause of the (;;) statement is only run before the loop. So you're using the same i for every iteration. This is surprising despite how explicit it is; programmers expect that a new slot is being allocated to store each value of i, so they take the address to it, and are surprised when it's at the same address every iteration. (Sugar like `for i, x := range xs` is even more confusing. The := strongly implies creating a new i and x, but it's not doing that!)

Basically, here are two pseudocode implementations. This is what currently happens:

     i = malloc(sizeof(int))
     *i = 0
   loop:
     <code>
     *i = *i + 1
     goto loop if *i < 10

This is what people expect:

     secret = malloc(sizeof(int))
     *secret = 0
   loop:
     i = malloc(sizeof(int))
     *i = *secret
     <code>
     *secret = *secret + 1
     goto loop if *secret < 10

You can see that they are not crazy for picking the first implementation; it's less instructions and less code, AND the for loop is pretty much exactly implementing what you're typing in. It's just so easy to forget what you're actually saying that most languages are choosing to do something like the second example (though no doubt, not allocating 8 bytes of memory for each iteration).

Remember, simple cases work:

    for i := 0; i < 10; i++ {
        fmt.Println(i) // 0 1 2 3 4 ...
    }

It's the tricky cases that are tricky:

    var is []*int
    for i := 0; i < 10; i++ {
        is = append(is, &i)
    }
    for _, i := range is {
       fmt.Println(*i) // 9 9 9 9 9 ...
    }

If you really think about it, the second example is exactly what you're asking for. You declared i into existence once. Of course its address isn't going to change every iteration.

tester756 · on Sept 19, 2023

>The loop semantics do not have anything to do with arrays.

Loop in general or "for each" style loop, that's huge difference.

The 2nd one has a lot to do with collections.

>You can see that they are not crazy for picking the first implementation; it's less instructions and less code

Yes, it is not crazy when you're looking at it from the reverse engineering / implementation side

but if you start thinking about it from user's perspective then it is very bad behaviour

because they used "foreach" like loop which is a concept of walking thru every element of collection.

acoustics · on Sept 20, 2023

I still don't see how looping over a collection is different from looping over a sequence of numbers from 1 to n.

tester756 · on Sept 20, 2023

Depends what do you actually mean by sequence, but mostly purpose.

Normal "for" is like: repeat this code body as long as condition is satisfied

Foreach is more like: walk thru this collection

Look (c#):

foreach (var item in items) ...

for (int i=0; i<10; i++) { }

In the 2nd version it is possible to jumps ahead, back, do not move, etc. Generally play around "i's" values

Meanwhile I haven't seen yet any1 trying to do anything like this in foreach, because it is meant for just walking thru collection

beeburrt · on Sept 19, 2023

I'm getting 10s for your last code example (not 9s)

jrockway · on Sept 20, 2023

Ah yup. I didn't test it and you normally never see i after that last increment!

hinkley · on Sept 19, 2023

If I understood the example, Java had this same problem. I'm wondering if C# does as well.

xmcqdpt2 · on Sept 20, 2023

It would be hard to trigger it in Java. All references are pass-by-value, so you would have to do something like creating an array, passing that array and then replacing an element in it in on every loop iteration. Unless I got something wrong, it would be hard to do this by mistake IMO.

hinkley · on Sept 20, 2023

If you do an asynchronous callback in an inner loop that tries to log the loop counter and a calculated value at the same time, you will find that the loop counter has incremented underneath you and you'll get for instance '20' for all of the logs. That was my introduction to this sort of problem.

The solution as I said elsewhere is to pop out the inner block to a separate function, where the value of the counter is captured when the outer function is called, not when the inner one runs.

simiones · on Sept 20, 2023

I don't think you remember well how you triggered this error, since Java just doesn't allow you to reference a non-final variable from an inner function. It sounds like you're talking about code like this, but this just doesn't compile:

  for (int i = 0; i < n; i++) {
    callbacks.add(new Callback(){
      public void Call() {
        System.out.println(i); //compiler error: local variables referenced from an inner class must be final or effectively final
      }
    });
  }
  for (var f : callbacks) {
    f.Call();
  }

Note that code like this works, and does the expected thing:

  for (int i : new int[]{0, 1, 2}) {
    callbacks.add(new Callback(){
      public void Call() {
        System.out.println(i);
      }
    });
  }
  for (var f : callbacks) {
    f.Call();
  } //prints 0 1 2

baq · on Sept 20, 2023

The problem is when it doesn't matter at first and then your language and its use evolves.

krackers · on Sept 19, 2023

I think https://eli.thegreenplace.net/2019/go-internals-capturing-lo... describes the problem in more detail?

msteffen · on Sept 19, 2023

This post was great—thanks for posting it!

Interestingly, the reason the old i := i trick works is not at all what I thought!

The trick, for reference:

    for i := 0; i < 5; i++ {
     i := i  // the trick 
     go func() {
      print(i)
     }()
    }

What I assumed happened:

- The escape analyzer sees that the new `i` is passed to a goroutine, so it is marked as escaping its lexical scope

- Because it's escaping its lexical scope, the allocator allocates it on the heap

- One heap allocation is done per loop iteration, and even though the new `i` is captured by reference, each goro holds a unique reference to a unique memory location

What actually happens:

- Go's compiler has heuristics to decide whether to capture by reference or by value. One component of the heuristic is that values that aren't updated after initialization are captured by value

- The new i is scoped to the for loop body, and is not updated by the for loop itself. Therefore it's identified as a value that isn't updated after initialization

- As a result, the Go compiler generates code that captures `i` by value instead of by reference. No heap allocations or anything like that are done.

I recognize that the latter behavior is better, but if anyone with intimate knowledge of Go knows why the former doesn't (also) happen (is that even how Go works?) I would love to find out!

xiaq · on Sept 19, 2023

I don’t think it’s so complex. Without i := i, there’s only one i. With i := i, there’s one i per iteration.

Closure captures are always by reference.

Heap vs stack allocation don’t affect the language semantics.

kevincox · on Sept 20, 2023

Yup. The linked article is a little confused. It thinks that an optimization to pass by value is affecting the behavior. In reality it only passes by value when it is indistinguishable from passing by reference (and it thinks it would be cheaper).

continuitylimit · on Sept 20, 2023

There is no “trick”. It’s the language spec! That go func can take arguments. Just add the argument for clarity. The “trick” here is saving the declaration in the go func’s signature. ‘go func(i0 int) { .... }(i)’

davidw · on Sept 19, 2023

That's a much better explanation for someone like me who isn't very familiar with Go.

campbel · on Sept 19, 2023

Won't this end up breaking programs that depend on the current behavior?

infogulch · on Sept 19, 2023

> To ensure backwards compatibility with existing code, the new semantics will only apply in packages contained in modules that declare go 1.22 or later in their go.mod files. ... It is also possible to use //go:build lines to control the decision on a per-file basis.

sixstringtheory · on Sept 19, 2023

Doesn't that mean that all code written so far can't take up newer versions of the Go compiler for any other reason like new features/bugfixes/optimizations/etc without a full audit of codepaths involving for loops?

kevincox · on Sept 20, 2023

No, the version declared in go.mod is different than the version of the toolchain used to compile the project. If you declare an older version even new toolchains will act like the previous versions.

ericpauley · on Sept 19, 2023

No, it does not. Packages can compile using 1.22 and gain other benefits without opting into this change.

sixstringtheory · on Sept 19, 2023

Ah, I didn't see the part about //go:build

acheong08 · on Sept 20, 2023

Without /:go:build tags, you can just define 1.21 as your Go version in go.mod to opt out of new features while getting other benefits of the new compiler

Cthulhu_ · on Sept 20, 2023

No I don't think so; any old working code will be using the x := x workaround, which will keep working when going to this version with the changed loop mechanics. What may happen is a form of... some adage, I forgot the name, where code accidentally relies on the old behaviour and breaks when that old behaviour is no longer there.

(that same adage applies to e.g. browser manufacturers having to implement bugs to not break certain websites)

Quekid5 · on Sept 20, 2023

Are you thinking of Hyrum's Law?

sgerenser · on Sept 21, 2023

Whatever, just give me an option to enable spacebar heating. https://xkcd.com/1172/

Quekid5 · on Sept 24, 2023

I enjoyed that reference :)

s17n · on Sept 20, 2023

There isn’t really anything that you could actually use the old behavior for, any code that depends on it is probably wrong anyway.

omeid2 · on Sept 19, 2023

I don't know why you're being down voted, but it is actually breaking the Go1 compat promise. Which says:

    It is intended that programs written to the Go 1 specification will continue to compile and run correctly, unchanged, over the lifetime of that specification. At some indefinite point, a Go 2 specification may arise, but until that time, Go programs that work today should continue to work even as future "point" releases of Go 1 arise (Go 1.1, Go 1.2, etc.).

freedomben · on Sept 19, 2023

I upvoted the question to offset one of the downs because I agree it's a fair question. However I would guess the downvotes are because TFA addressed this issue directly and comprehensively, so it's a clear "I didn't read the article" indicator :-) Possibly also because the downvoters can't imagine a scenario where this would be desirable behavior (i.e. it's always a bug)

colejohnson66 · on Sept 19, 2023

But if it’s a bug, then the logic to not compile future versions is wrong, IMO. If it’s a feature change, then such logic would make sense.

campbel · on Sept 19, 2023

Yeah its fair, I didn't closely read that section. Although, I'm not entirely convinced the approach is safe, maybe its worth it to fix such a common pitfall.

efuquen · on Sept 20, 2023

In a previous blog post they basically said they will never make a Go 2, and also addressed a lot of things about compatibility:

https://go.dev/blog/compat

In particular they said:

> The end of the document warns, “[It] is impossible to guarantee that no future change will break any program.” Then it lays out a number of reasons why programs might still break.

> For example, it makes sense that if your program depends on a buggy behavior and we fix the bug, your program will break. But we try very hard to break as little as possible and keep Go boring.

Cthulhu_ · on Sept 20, 2023

> In a previous blog post they basically said they will never make a Go 2

No, they didn't say that, they said it wouldn't be backwards-incompatible with Go 1. Relevant quote:

> [...] when should we expect the Go 2 specification that breaks old Go 1 programs?

> The answer is never. Go 2, in the sense of breaking with the past and no longer compiling old programs, is never going to happen. Go 2 in the sense of being the major revision of Go 1 we started toward in 2017 has already happened.

wrs · on Sept 19, 2023

Note the word “programs”, not “files”. If your program doesn’t declare go 1.22 in its go.mod, it will continue to work (or not work!), unchanged.

robertlagrant · on Sept 19, 2023

Isn't that "working with future point releases", though? If I don't declare 1.22, am I not excluded from that point release?

skywhopper · on Sept 20, 2023

This has all been addressed in the proposal. The research was done and this change will impact so few projects that it’s worth making a technical exception to the compatibility promise to fix a real design flaw.

mrkstu · on Sept 19, 2023

No, the compiler will revert to the original behavior, it only adopts the new behavior with the declaration.

ben0x539 · on Sept 19, 2023

I assume that if compiling with 1.22 or later, you still get all the benefits from that version like other new features, bug fixes or perf improvements, just not this particular change.

Cthulhu_ · on Sept 20, 2023

No, it doesn't break the promise; "Go programs that work today should continue to work even as future "point" releases of Go 1 arise (Go 1.1, Go 1.2, etc.)."

You can install Go 1.22 and your program will compile and run as-is. That's the promise. If however you opt-in to the changed for loop behaviour by adjusting your go.mod, the onus is on you to update your program accordingly.

It's only a backwards incompatible change if the developer makes a backwards incompatible change by updating the configured target version.

(I'm aware I'm probably being pedantic here, I understand the language used seems to imply you can just set it to v1.22 and it works but it's a bit more specific)

campbel · on Sept 19, 2023

I don't mind.

Yeah, I thought this kind of change wouldn't happen because of this promise.

tgv · on Sept 19, 2023

I think it was downvoted precisely because of that. It's a bit of a contentious issue.

jacquesm · on Sept 19, 2023

And they do. You can specify the precise logic to use on a per-file basis.

doctor_eval · on Sept 19, 2023

To add to the other comments, in the run-up to go1.21 they talked about how they’d analysed a very large corpus of Go code to see what would be affected, and it was a very very small number.

I remember thinking that the number of people who have created inadvertent bugs due to this design (myself included) would be significantly greater than the number of people affected by the fix.

skywhopper · on Sept 20, 2023

The original proposal for this change went into great detail about the research they did into existing uses of this syntax. In my memory, they found vanishingly few cases in the Google codebase or GitHub code where the change would violate the expected behavior. The decision to break the backwards compatibility here came only after determining how few codebases would be affected and developing a mechanism in Go itself (the version specification in go.mod) to require actively modifying the code to build with the new behavior.

tapirl · on Sept 20, 2023

A lots:

https://twitter.com/go100and1/status/1690412229135601664

https://twitter.com/go100and1/status/1690587305806057472

https://twitter.com/go100and1/status/1690589791686119424

https://twitter.com/go100and1/status/1690591234715492352

https://twitter.com/go100and1/status/1690593184857145344

https://twitter.com/go100and1/status/1691456732151889920

Most of them are not mentioned in the proposal doc at all.

nerdponx · on Sept 19, 2023

Python has the same problem (to the extent that it's actually a problem, which you might or might not agree with), and this is the #1 reason they won't change it.

matthewmueller · on Sept 19, 2023

It's only enabled for modules that run Go 1.22 and higher

badrequest · on Sept 19, 2023

I also can't imagine a case where it is useful or even truly intended to rely on this behavior.

ben0x539 · on Sept 19, 2023

Yeah I don't think it's so much "we explicitly rely on this behavior, how dare you change this" as "somewhere in our mountains of maintenance-mode code that haven't seen the sun shine through an editor window in years, this behavior cancels out another bug that we never noticed". Tooling should be able to detect when code relies on this, but it's still gonna cost some non-zero amount of developer effort to touch ancient code and safely roll out a new version if it needs to be actively addressed.

rsc · on Sept 19, 2023

If you have tests and they break with GOEXPERIMENT=loopvar, then there is a new tool that will tell you exactly which loop is causing the breakage. That's a post for a few weeks from now.

kuchenbecker · on Sept 20, 2023

Cthulhu_ · on Sept 20, 2023

Neither can I, but there may be cases of code accidentally relying on it - there's an adage that I forgot the name of that says just that, and I think compiler manufacturers are the most aware of that adage.

omginternets · on Sept 19, 2023

Yeah it’s def a code smell …

minroot · on Sept 19, 2023

Some time spent with Go gives a strong indication that Go team always has backwards compatibility in mind

jacquesm · on Sept 19, 2023

They do. Go has avoided most of the pitfalls that other language eco-systems have fallen for over the years (backwards compatibility issues, soft forks masquerading as language improvements, re-booting the whole language under the same name, aggressively pushing down on other languages etc). They've done remarkably well in those respects, and should deserve huge credit for it.

beltsazar · on Sept 20, 2023

Yes, the loopvar change will break some programs, and hence the compatibility promise. But the Go team argues that the change will fix much more programs than it will break [1].

This makes me wonder, though, what guarantees that a similar breaking change won't ever happen again in the future? If any change with #(programs fixed) >> #(programs broken) is accepted, we might as well remove the compatibility promise page [2].

---

[1] https://github.com/golang/go/issues/60078

[2] https://go.dev/doc/go1compat

baq · on Sept 20, 2023

You may also wonder how many would it fix if it wasn't hidden behind a flag?

omginternets · on Sept 19, 2023

Seems like yes, though hopefully that should be rare.

kzrdude · on Sept 19, 2023

I have run into this problem in Python too, but not recently. I'm not sure if Python has changed or if I just caught on to the problem.

This should be enough to show that it still can wind up as a problem in Python:

    funcs = [(lambda: x) for x in range(3)]
    funcs[0]()  # outputs: 2

paulddraper · on Sept 19, 2023

That is correct.

Python used to be worse; it used to share scope outside the list comprehension.

nightfly · on Sept 19, 2023

It still does for regular loops right?

thatguysaguy · on Sept 20, 2023

Yeah loops don't get their own scopes (unless you add one using this thing I made as a joke: https://github.com/davisyoshida/360blockscope)

pulvinar · on Sept 19, 2023

GPT-4 says: The behavior you're observing is due to the late binding nature of closures in Python. When you use a lambda inside a list comprehension (or any loop), it captures a reference to the variable x, not its current value. By the time you call funcs[0](), x has already been set to the last value in the range, which is 2.

To get the desired behavior, you can pass x as a default argument to the lambda:

   funcs = [(lambda x=x: x) for x in range(3)]
   funcs[0]() # outputs 0

scottlamb · on Sept 19, 2023

I've written a tiny bit of Go and am aware of the general problem this solves. I don't get their more subtle examples (the letsencrypt one or "range c.informerMap" vs "range alarms".

When you do "for k, v := range someMap", is "v" of the map's value type (and one binding for the whole loop, copied before each iteration)? This would explain the problem, but I would have expected "v" to be a reference into the map, and I couldn't find the answer in a quick skim of the "For statements with range clause" in the spec. I'm probably looking in the wrong place because I touch Go rarely...

[1] https://go.dev/ref/spec#For_statements

edit: oh, the answer is in the "code block"-formatted table. Guess I had banner blindness. "v" is the copied value, not a reference. I'm surprised!

erik_seaberg · on Sept 19, 2023

Go doesn’t support pointers to map keys or values. It does support pointers to array slots, but for-range copies each slot rather than giving you a pointer to it.

scottlamb · on Sept 19, 2023

I suppose that makes sense when I think about it for a bit. My recent expectations come from work in Rust. There the language prevents you from mutating a map while holding a reference into it. Go doesn't have a mechanism to prevent that (except the one you said, simply not supporting those references at all). If you had a reference into a map that was resized because of a subsequent mutation, your reference would have to keep the whole previous map alive and point to different memory than a reference acquired since then. Both seem undesirable.

With array slots, the same issue is present but is a bit more explicit because those resizes happen with `mySlice = append(mySlice, ...)`.

erik_seaberg · on Sept 19, 2023

I think the slice append semantics are very error-prone, and it would have been better if a slice was a shareable reference to a single mutable thing, like a map (or a list from Python or Java or …)

konart · on Sept 20, 2023

>If you had a reference into a map

Maps in golang are of reference type, just to be clear.

https://go.dev/blog/maps

implementation: https://go.dev/src/runtime/map.go

foldr · on Sept 21, 2023

Indeed, removing values from a map while iterating over it in Go is safe and guaranteed to have the expected behavior.

tedunangst · on Sept 19, 2023

If you have a map of string to int, then v is of type int. It's a value. It's not pointer to int.