Skip to content

Anomaly Detection on the Enron financial-email dataset, using specialised unsupervised machine learning algorithms: One-class SVM, Isolation Forest, LOF.

License

Notifications You must be signed in to change notification settings

najeebuddinm98/enron_anomaly_detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Anomaly Detection on Enron Dataset

In this project, we aim to build and train models based on machine learning algorithms commonly used for unsupervised anomaly detection; namely

  • one-class Support Vector Machine (SVM)

  • Isolation Forest

  • Local Outlier Factor (LOF)

The dataset used is a modified version of the Enron financial + email dataset that contains information about Enron Corporation, an energy, commodities, and services company that infamously went bankrupt in December 2001 as a result of fraudulent business practices. The dataset we have obtained is from the Udacity Data Analyst Nanodegree and their GitHub page. Inspiration for loading and preprocessing the dataset was taken from Will Koehrsen's Medium article.

The Enron dataset is widely used to try and develop models that can identify the persons of interests (POIs), i.e. individuals who were eventually tried for fraud or criminal activity in the Enron investigation, from the features within the data. The email + financial data contains the emails themselves, metadata about the emails such as number received by and sent from each individual, and financial information including salary and stock options.

NOTE:

All references are presented in the form of appropriate hyperlinks within the paragraphs both here and in the notebook rather than in a separate section.

About

Anomaly Detection on the Enron financial-email dataset, using specialised unsupervised machine learning algorithms: One-class SVM, Isolation Forest, LOF.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published