Skip to content

Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple DataFrames, visualization, Machine Learning

Notifications You must be signed in to change notification settings

roshankoirala/pySpark_tutorial

Repository files navigation

pySpark_tutorial

List of contents

  • RDDs and DataFrame
  • Exploratory data analysis
  • Handeling multiple dataframes
  • Visualization
  • Machine learning

About

Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple DataFrames, visualization, Machine Learning

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published