This Hadoop project involves analysing the book datasets to solve a few problem statements.
-
Updated
May 22, 2019
This Hadoop project involves analysing the book datasets to solve a few problem statements.
Install Hadoop, HDFS, Yarn and Spark on 3 Ubuntu 18.04 Machines
Exercise of using MapReduce to parse a CSV file
The projects implemented as a part of the Cloud Computing course using Big Data Technologies
A python implementation of matrix multiplication using Hadoop streaming API
Apache Hadoop docker image | Running Python MapReduce
Simulating a consultancy project for Repsol, the repository contains both the code notebook and the analysis.
Leverage the power of Apache Spark for large-scale data processing and analysis
hadoop mapreduce problems using hadoop version 3.3.0
An Ansible Role to Configure and setup Hadoop Client Node.
Un TP qui vise à familiariser les apprenants avec le système de fichiers distribué Hadoop (HDFS). Les objectifs spécifiques comprennent le démarrage des processus Hadoop, la création d'une structure d'arborescence dans le HDFS, la manipulation de fichiers en utilisant des commandes Hadoop.
Data Engineering Project with Hadoop HDFS and Kafka
Some code during learning Hadoop.
This engine will be the core of our monitoring mechanism. This engine will use the benefits of machine learning to provide a better solution with dynamic parameters.
Add a description, image, and links to the hadoop-hdfs topic page so that developers can more easily learn about it.
To associate your repository with the hadoop-hdfs topic, visit your repo's landing page and select "manage topics."