How is data prepared for machine learning?