Skip to content

Airline Marketing Study: prediction of customer satisfaction and customer clustering using sklearn libraries.

Notifications You must be signed in to change notification settings

leo-cavalcante/airline-passenger-satisfaction

Repository files navigation

Ironhack Logo

Airline Passengers Satisfaction Classification by Léo

Leonardo Cavalcante Araújo

Data Analytics Full-Time FEB2021, Paris & March 25th 2021

Content

Airline Presentation Cover

Project Description

Customer analytics: Data Analysis and Visualizations using the Airline Passenger Satisfaction dataset from Kaggle.

Content of the dataset

  • Socio-demographics data from customers.
  • Service attributes satisfaction grades given by the customers.
  • Satisfaction/Dissatisfaction of customers.

Objective

  • Predict dissatisfaction: predictive analysis using sklearn library.
  • Customers Clustering: classification analysis using sklearn library.

Workflow

  1. Data search, import and setting the libraries Pandas, Numpy, Matplotlib, Seaborn and SKlearn.
  2. Data cleaning Clean data and prepare for analysis.
  3. Data Analysis and Visualization.
  4. Dissatisfaction Prediction Analysis: dataset preparation, normalization of attributes, one hot encoding, feature selection and applying model.
  5. Customer Clustering: same steps as before, but with one difference: features transformation with Principal Component Analysis (PCA). Thus, clustering customers in 5 different customer segments.
  6. Presentation: construction and oral presentation to the students of Ironhack Data Cohort.

Organization

PS.: individual project.

Links

Here you may find the relevant links for the repository, the main code and the final presentation slides.

GitHub Repository: airline-passenger-satisfaction

Google Slides: Airline marketing Study Presentation

Jupyter Notebook: Airline Passenger Satisfaction

Releases

No releases published

Packages

No packages published