apache-hadoop

Star

Here are 73 public repositories matching this topic...

gangodu / cloud

Star

AWS Cloudera Hadoop setup with H2O, Spark, MR

aws aws-lambda aws-s3 bigdata java-8 aws-ec2 mapreduce big-data-analytics maven-pom apache-hadoop

Updated Apr 24, 2017
Java

sawadogosalif / Big-Data-Technologies

Star

Big Data Technologies can be defined as software tools for analyzing, processing, and extracting data from an extremely complex and large data set with which traditional management tools can never deal

apache-spark apache-kafka apache-hive apache-hadoop apache-hbase pysark

Updated Apr 30, 2022
Python

smohammadhejazi / mapreduce-on-twitter-dataset

Star

Applying MapReduce in Java on a Twitter dataset using Apache Hadoop

mapreduce-java apache-hadoop

Updated Feb 6, 2022
Java

Narius2030 / Sakila-Business-Analysis

Star

Implement a Hive data warehouse to store meaningful data, apply Machine Learning like Clustering or Regression for dealing with business problems

machine-learning data-analysis apache-hive hiveql etl-pipeline apache-hadoop pyhive

Updated May 25, 2024
Jupyter Notebook

luckyp71 / hadoop-hbase-phoenix-zookeeper-integration

Star

Hadoop, HBase, Phoenix, and Zookeeper Integration

phoenix big-data hadoop bigdata hbase zookeeper apache-phoenix apache-zookeeper apache-hadoop

Updated May 13, 2018
Shell

Umer86 / Dice-Big-Data-Certification

Star

This repository contains all the material related to this big data certification.

yarn hive impala pyspark hdfs mapreduce databricks sqoop apache-hadoop

Updated Nov 7, 2022

shuuji3 / spark-ceph-connector

Star

🌟Spark Ceph Connector: Implementation of Hadoop Filesystem API for Ceph

spark apache-spark hadoop ceph apache-hadoop

Updated Aug 25, 2020
Scala

unobatbayar / big-data-processing

Star

Learning Apache Hadoop for Big Data. Moreover, exploring Map Reduce, Apache Spark RDD, Distributed Processing and Stream Processing

big-data map-reduce apache-hadoop

Updated May 27, 2020
Python

berksudan / Analysis-on-Big-Data-with-Hadoop

Star

Implementation of Statistical Methods via Hadoop Map-Reduce Library.

hadoop bigdata mapreduce hadoop-mapreduce mapreduce-java apache-hadoop

Updated Dec 9, 2019
Java

Bahaabrougui / Big-Data-Smart-Cars-Pipeline-ServerSide

Star

Big Data pipeline for real-time sensor fusion and predective analysis.

java couchdb docker apache-spark yarn cloudera hdfs apache-kafka apache-hadoop

Updated Jul 1, 2022
Java

rachmanz / WSL2DW

Star

Intalasi WSL2 untuk Praktikum ABD

derby-database apache-hive apache-hadoop

Updated Mar 7, 2024

shawnzhu / docker-hive-1

Star

Docker image for Hive Metastore

apache-hive apache-hadoop

Updated Oct 19, 2020
Dockerfile

aquib-sh / setup-hadoop

Star

A BASH script to setup Apache Hadoop and Apache Hive with Derby database on Debian GNU/Linux

linux bash hive hadoop debian ubuntu shell-script hadoop-cluster bash-script derby setup-script hadoop-hdfs apache-hadoop

Updated Dec 7, 2022
Shell

0LIFR1 / runtime-analytics

Star

Batch processing runtime analytics

python sql big-data spark pandas apache-hadoop

Updated Dec 27, 2022

Chabane / spark-custom-datasource

Star

apache-spark pyspark inputformat apache-arrow apache-hadoop

Updated Mar 25, 2019
Java

Coursal / Hadoop-Letter-File-Index-Counter

Star

A Hadoop-based Java project that counts the max number of word occurences for each letter in a textfile of a folder.

java map hadoop mapper reducer reduce mapreduce wordcount word-count hadoop-mapreduce word-counter apache-hadoop wordcounter

Updated Nov 2, 2020
Java

VikentiosVitalis / advanced_topics_in_database_systems

Star

Data Science Project - for 'Advanced Topics in Database Systems' M.Sc. Course ECE @ntua

python data-science big-data apache-spark pyspark apache-hadoop

Updated Jan 17, 2024
Python

bayudwiyansatria / library-java-apache-hadoop

Star

Apache Hadoop. Apache Hadoop is a collection of open-source software utilities that facilitate using a network of many computers to solve problems involving massive amounts of data and computation. It provides a software framework for distributed storage and processing of big data using the MapReduce programming model. Originally designed for co…

library libraries java-library apache-hadoop apache-hadoop-framework bayudwiyansatria apache-hadoop-library

Updated Oct 7, 2021
Java

felidsche / cloud-computing-2020

Star

Repository for the master's course Cloud Computing of the TU Berlin in the winter term 2020/21.

java bash openstack apache-flink google-cloud-platform kolla-ansible apache-hadoop

Updated Apr 14, 2021
Shell

lepetitprinz / apache-hadoop

Star

Hands-on learning Hadoop

yarn hive hbase apache-hadoop

Updated Apr 17, 2023
Java

Improve this page

Add a description, image, and links to the apache-hadoop topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the apache-hadoop topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

apache-hadoop

Here are 73 public repositories matching this topic...

gangodu / cloud

sawadogosalif / Big-Data-Technologies

smohammadhejazi / mapreduce-on-twitter-dataset

Narius2030 / Sakila-Business-Analysis

luckyp71 / hadoop-hbase-phoenix-zookeeper-integration

Umer86 / Dice-Big-Data-Certification

shuuji3 / spark-ceph-connector

unobatbayar / big-data-processing

berksudan / Analysis-on-Big-Data-with-Hadoop

Bahaabrougui / Big-Data-Smart-Cars-Pipeline-ServerSide

rachmanz / WSL2DW

shawnzhu / docker-hive-1

aquib-sh / setup-hadoop

0LIFR1 / runtime-analytics

Chabane / spark-custom-datasource

Coursal / Hadoop-Letter-File-Index-Counter

VikentiosVitalis / advanced_topics_in_database_systems

bayudwiyansatria / library-java-apache-hadoop

felidsche / cloud-computing-2020

lepetitprinz / apache-hadoop

Improve this page

Add this topic to your repo