Skip to content

The projects aim is to find the to best ML algorithm evaluated on its efficiency in predicting whether homes should be classified as expensive or not expensive.

Notifications You must be signed in to change notification settings

Cintia0528/Data_Science-Supervised_Machine_Learning_Classification_Housing

Repository files navigation

Supervised Machine Learning - Classification

Goal

To classify properties into "Expensive" / "Not Expensive" categories with the help of Supervised Machine Learning.

Overview

We are interested in making better investment decisions, and hence evaluating properties based on 70+ features, whether they qualify as expensive or inexpensive properties.

Context

Trying out and fine-tuning a variety of Machine Learning models to get the best prediction

  1. Is our Machine Learning model predicting the value of properties successfully?
  2. What type of errors are most prone for each of the models?

Task:

  • Import database of over 1500 properties
  • Explore, analyze and clean over 70 features
  • Try and fine-tune ML models for the best outcome

Deliverables

The Google Colab Notebook for trying out different ML algorithms is found here. Further Machine Learning experimentation with LazyPredict and VotingClassifier is found here, with a supporting Medium article here.

Skills & Tools

  1. Data Reading & Cleaning
  2. Data Splitting
  3. Building a Preprocessor
  4. Modelling ( Decision Tree, KNN, Random Forest, XGBoost)
  5. Fine Tuning
  6. Error Analysis

Further Analysis

  1. Perfecting the model with Lazy predict
  2. Pooling individual models' strength with Voting Classifier

Note: In the notebook the Lazypredict + VotingClassifier combo gave us approximately 95%, but when applied to brand new dataset via a Streamlit application it had the highest accuracy with over 97%.

About

The projects aim is to find the to best ML algorithm evaluated on its efficiency in predicting whether homes should be classified as expensive or not expensive.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published