Skip to content

ClaudioPoli/JobAds

Repository files navigation

JobAds

MIT License LinkedIn


Job Ads

Management and analysis of data related to Glassdoor platform job postings
Report Bug

Table of Contents
  1. About The Project
  2. Getting Started
  3. Usage
  4. Contributing
  5. License
  6. Acknowledgments

About The Project

Management of structured and unstructured data

Design and implementation of a database relating to ads placed on the Glassdoor platform, the development steps are listed below:

  • Requirements analysis: understanding of the domain, definition of a preliminary sector scheme and of the operations that can be carried out by the system; -Conceptual modeling: application of conceptual modeling techniques and definition of an object-oriented conceptual scheme;
  • Logical modeling: application of logic modeling techniques and definition of a logical E-R scheme;
  • Physical modeling: transformation of the logical scheme into a physical scheme through the use of DDL and DML necessary for the definition of a database; -Operations: implementation of user operations regarding the management and analysis of data relating to the ads on the platform, their geographical position and the reviews associated with them through QL.

Software: draw.io [Conceptual / Logical Design], Datagrip [SQL IDE], Pycharm [Python IDE], GATE [NLP]

DBMS: PostgreSQL (PostGIS - geographic extension)

Pipeline NLP: Corpus PMI extraction, NER, Corpus Augmented TF-IDF / KyotoDomainRelevance extraction

Data source: 'https://www.kaggle.com/andresionek/data-jobs-listings-glassdoor'

(back to top)

Built With

(back to top)

Getting Started

Prerequisites

  • PostgreSQL

Installation

  1. Clone the repo
    git clone https://github.com/ClaudioPoli/JobAds.git
  2. That's it!

(back to top)

Usage

The project involves creating a database in PostgreSQL using SQL scripts, in order you need to run:

  • DDL
  • Constraint
  • DML

Subsequently it is possible to execute the scripts related to the queries useful for the extraction of different information. Executable queries can be found in:

  • BenefitAnalysis
  • NumberOfListings
  • JobAnalysis
  • IndustryAnalysis

Contributing

Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

If you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag "enhancement". Don't forget to give the project a star! Thanks again!

  1. Fork the Project
  2. Create your Feature Branch (git checkout -b feature/AmazingFeature)
  3. Commit your Changes (git commit -m 'Add some AmazingFeature')
  4. Push to the Branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

(back to top)

License

Distributed under the MIT License. See LICENSE.txt for more information.

(back to top)

Acknowledgments

(back to top)

About

Management of structured and unstructured data

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published