Hacker News new | comments | show | ask | jobs | submit login

I'm missing (and the article doesn't exactly state) what the fitness function is here.

It might just as well don't care about being shot at.




We currently don't look at being shot at (although it's an interesting suggestion!) - the current reward function is basically this:

  distance_reward - off_road_penalty - speeding_penalty - slow_penalty - discomfort_penalty;
Plus if you collide or drive against traffic, the episode terminates.

https://github.com/openai/universe-windows-envs/blob/f5aad96...


Well that part was mostly a bit of a joke - It could add in a few interesting variables such as deflated tyres causing the car to drift more than the AI is expecting.




Guidelines | FAQ | Support | API | Security | Lists | Bookmarklet | DMCA | Apply to YC | Contact

Search: