aws-emr
Here are 126 public repositories matching this topic...
Bits of code I use during live demos
-
Updated
Jan 23, 2024 - Jupyter Notebook
An AWS based solution using AWS CloudWatch and AWS Lambda based on Python to automatically terminate AWS EMR clusters that have been idle for a specified period of time.
-
Updated
Jun 5, 2024 - Python
Terraform module to create AWS EMR resources 🇺🇦
-
Updated
May 4, 2024 - HCL
Data Analysis Exercise over Walmart Stock
-
Updated
Jun 26, 2019 - Jupyter Notebook
The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on writing pyspark code.
-
Updated
Jun 13, 2022 - Python
Cloud-based AI / ML workflow and data application development framework
-
Updated
Jun 1, 2024 - Python
Terraform Examples
-
Updated
Feb 9, 2020 - HCL
A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from locally hosted Airflow containers. The end product is a Superset dashboard and a Postgres database, hosted on an EC2 instance at this address (powered down):
-
Updated
May 14, 2022 - Python
Create Data Lake on AWS S3 to store dimensional tables after processing data using Spark on AWS EMR cluster
-
Updated
Oct 10, 2019 - Python
Run Snowplow's enrichments on Amazon Elastic MapReduce with minimum fuss
-
Updated
May 22, 2023 - Ruby
An ETL pipeline that extracts data from S3, processes them using Spark, and loads the data back into S3 as a set of dimensional tables
-
Updated
May 5, 2020 - Python
A collection of airflow sample workflows for data processing on aws
-
Updated
Dec 1, 2017 - Python
A working example of Twitter -> Kafka -> Spark Streaming integration by a beginner
-
Updated
May 29, 2017 - Jupyter Notebook
Improve this page
Add a description, image, and links to the aws-emr topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the aws-emr topic, visit your repo's landing page and select "manage topics."