Hacker News new | past | comments | ask | show | jobs | submit login
Reinforcement learning with unsupervised auxiliary tasks (deepmind.com)
102 points by joaorico on Nov 20, 2016 | hide | past | favorite | 2 comments



> On Atari the agent now achieves on average 9x human performance

Very impressive. I guess the human limit has to do with humans being limited about number of things to track at once? I wonder if they can apply this to optimizing the traffic lights in a big city.


Can't wait to see how it will handle Starcraft 2.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: