Wasn't Andrew Ng doing this since 2008?


Wow that video is an amazing time capsule of top AI researchers working together! But aside from controlling helicopters the research is very different. That research was based on apprenticeship learning where a human provides the ground truth and the algorithm learns to mimic it. This paper is learning general control systems, and critically provably stable control systems, without human involvement.

