Skip to content

abhishekmsharma/big-data-electricity-consumption-analysis-apache-spark

Repository files navigation

big-data-electricity-consumption-analysis-apache-spark

Developed for analysing and visualizing trends related to electricity and energy consumption

The project worked on a dataset containing more than 2 million records about electricity consumption on a per minute basis. The plethora of data was read and processed using Apache Spark Streaming. Spark Machine Learning Library (MLlib) was used for analyzing the usage patters, clustering the data points, and predicting the trends in electricity consumption.

Technology used: Apache Hadoop, Apache Spark, Spark MLlib, Java

Data: https://archive.ics.uci.edu/ml/datasets/individual+household+electric+power+consumption

Releases

No releases published

Packages

No packages published

Languages