MCE 2018: Deep Reinforcement Learning at Scale and Self-Play | Filip Wolski