Hacker News new | comments | show | ask | jobs | submit login

We currently don't look at being shot at (although it's an interesting suggestion!) - the current reward function is basically this:

  distance_reward - off_road_penalty - speeding_penalty - slow_penalty - discomfort_penalty;
Plus if you collide or drive against traffic, the episode terminates.

https://github.com/openai/universe-windows-envs/blob/f5aad96...




Applications are open for YC Summer 2018

Guidelines | FAQ | Support | API | Security | Lists | Bookmarklet | Legal | Apply to YC | Contact

Search: