Skip to content

seyedmahdiamin1998/ETL_catawiki

Repository files navigation

About The Project

Process

It's a project to extract data from a database, transform them, and finally load it on another database. (ETL: Extract --> transform --> load)

In this project, I extract Jewelry data from the PostgreSQL database which we collected in the previous project from catawiki website. After that, I transform this data and store useful features in another PostgreSQL database.

Getting Started

Requirements

$ pip install -r requirements.txt

or if you use pipenv for managing virtual environments you can either install dependencies by code below.

$ pipenv install

Usage

To run the projecct:

python3 ETL.py

catawiki_datawarehouse

License

It's a personal project, to show my abilities in the area of data. If you want to use this repo, please my GitHub URL in your project.

Contact

LinkedIn : https://www.linkedin.com/in/seyedmahdiamin/