Generative AI Foundations on AWS | Part 5: Preparing data and training at scale