It doesn't matter, the most important factor is the training data. The network i...

jazzyjackson · 2023-11-07T20:05:41

Thanks for putting language to a vague feeling I've had wrt "causal intervention", I tend to think there's no such thing as self or self-preservation/fear-of-death or desire-to-love that might facilitate actual agentic decision making without being embedded in a fight for survival against an adverserial environment.

As far as embodiement and environment goes, does simulating an environment get us anywhere? Agents given avatars in games practice causal intervention, no? Is that too a matter of training a model in a rich enough, accurate enough simulation? I guess that's the problem self driving cars face, and they at least have a programmed-in-fear of mortality, tho not its own, in the form of avoiding a fatal wreck.

visarga · 2023-11-07T22:20:21

> does simulating an environment get us anywhere?

Sure does, one of the few super-human AIs - AlphaZero, learned from an environment made of the go board and his opponent. Diverse exploration is essential, but ultimately everything was learned from that one bit of reward signal at the end of the game.