15min History of Reinforcement Learning and Human Feedback