Skip to content

Analyzing Auto Dataset in order to study the best predictors of car price (as part of the Data Science Certification program at Cognitive Class)

Notifications You must be signed in to change notification settings

franmore/auto-data-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

logo Cognitive Class

Automobile Data Analysis

(as part of the Data Science Certification Program from IBM Cognitive Class)

Table of Contents

  • Data Acquisition
  • Data Wrangling
    • Identify and handle missing values
      • Identify missing values
      • Deal with missing values
      • Correct data format
    • Data standardization
    • Data Normalization
    • Binning
    • Dummy Variable
  • Exploratory Data Analysis
    • Analyzing Individual Feature Patterns using Visualization
    • Descriptive Statistical Analysis
    • Basics of Grouping
    • Correlation and Causation
    • ANOVA

Extract of the dataset

auto-dataset-extract

Snapshots of the study

horse_power_bins engine-vs-size engine-location-vs-price drive-wheels-vs-price heatmap_body-vs-drive-wheels

Conclusion

Important variables to take into account when predicting the car price :

Continuous numerical variables:

  • Length
  • Width
  • Curb-weight
  • Engine-size
  • Horsepower
  • City-mpg
  • Highway-mpg
  • Wheel-base
  • Bore

Categorical variables:

  • Drive-wheels

About

Analyzing Auto Dataset in order to study the best predictors of car price (as part of the Data Science Certification program at Cognitive Class)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published