Introduction to Proximal Policy Optimization algorithm (PPO)