Nowadays, Airlines have understood that traditional customer segmentation in the airline industry by booking class does not reflect the complex passenger’s behavior. As one of the main providers of IT solutions for Airlines industry, Amadeus has the resources and infrastructure to manage all the ticketing and booking data as well as understanding the Airline needs and market particularities. By combining different data sources produced by the different airline systems, we have applied unsupervised machine learning techniques to improve our understanding of customer behavior.
For this product development, featuring engineering was applied with diverse variables including demographics information, ancillary, customer RFM, purchase etc. All the ETL process has been implemented with Spark API on Scala (using both Spark 1.6 and 2.1), and the SparkML library was used for the clustering. Simulations were performed on our own cluster. We will present results of a customer segmentation analysis for Airlines and how our results differ from the traditional rules based on business experience or intuition. More important, we want to show how Spark can be used as the main tool for Machine Learning analysis with Big Data to create relevant business insight for Airlines. Keywords: Airline industry; Segmentation; SparkML: Business insight
About: Databricks provides a unified data analytics platform, powered by Apache Spark™, that accelerates innovation by unifying data science, engineering and business.
Read more here: [ Ссылка ]
Connect with us:
Website: [ Ссылка ]
Facebook: [ Ссылка ]
Twitter: [ Ссылка ]
LinkedIn: [ Ссылка ]
Instagram: [ Ссылка ] Databricks is proud to announce that Gartner has named us a Leader in both the 2021 Magic Quadrant for Cloud Database Management Systems and the 2021 Magic Quadrant for Data Science and Machine Learning Platforms. Download the reports here. [ Ссылка ]
Ещё видео!