Skip to content

Assignment for Distributed Data Processing Course. Pipelined Hash Join and Semi Join algorithms between tables in different SQLite databases

Notifications You must be signed in to change notification settings

VasilikiZarkadoula/Distributed-Data-Processing-Auth-2023

Repository files navigation

Distributed Data Processing Auth2023 - Docker Application

Prerequisites

To run this application, you need to have Docker installed on your machine. You can download and install Docker from the official website: https://www.docker.com

Getting Started

Follow the steps below to run the application using Docker:

  1. Open a terminal and clone the repository to your local machine:
git clone https://github.com/VasilikiZarkadoula/Distributed-Data-Processing-Auth-2023
  1. Navigate to the repository directory:
cd your-repository
  1. Build the Docker image using the provided Dockerfile:
docker build -t myapp .
  1. Run the Docker container:
docker run myapp

This will execute the Python scripts in the following order:

  1. create_table.py: Creates tables in the databases.

  2. check_tables.py: Performs checks on the tables.

  3. pipelined_hash_join.py: Executes the pipelined hash join algorithm.

  4. semi_join.py: Performs the semi-join operation.

  5. Once the container finishes running, you can access the results or any generated output as per the functionality of the Python scripts.

About

Assignment for Distributed Data Processing Course. Pipelined Hash Join and Semi Join algorithms between tables in different SQLite databases

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published