Skip to content

Ongoing projects in data analysis and visualization.

Notifications You must be signed in to change notification settings

j9pino/PortfolioProjects

Repository files navigation

Data Analysis Portfolio Projects

This repository will demonstrate my growing experience with data analysis using various applications and programming languages.

Currently, you may view the following:

Movie Industry Correlation

Tools: Python Jupyter Notebook
Techniques: data cleaning, analysis, and visualization using pandas, seaborn, numpy, and matplotlib
Dataset: https://www.kaggle.com/datasets/danielgrijalvas/movies?resource=download

Twitter Sentimental Analysis

Tools: Python Jupyter Notebook
Techniques: data cleaning, analysis, and visualization using NLTK, lambda, seaborn, and plotly
Dataset: Tweets with Covid-19 hashtags from July 24, 2020 to August 30, 2020. Provided by a Coursera guided project.

Bibliometrics: Data Gathering and Cleaning

Tools: Web of Science, Sci2, OpenRefine
Techniques: .isi file conversion with Sci2, data cleaning with OpenRefine, and basic productivity charts with Power BI
Dataset: Used Web of Science database to collect 2017-2022 publication data for Tennessee Technological University.

Bibliometrics: Data Analysis and Presentation

Tools: Sci2, Gephi, Inkscape
Techniques: network analysis and pruning with Sci2, and visualization with Gephi
Dataset: Used Web of Science database to collect 2017-2022 publication data for Tennessee Technological University.