OpenAI's Q*?: Reinforcement Learning, Model-Based vs. Model-Free Methods, and Q-Learning