#

etl-pipeline

Here are 1,378 public repositories matching this topic...

aohus / etl-pipeline-with-modeling

Docker를 사용하여 Hadoop 생태계의 구성 요소와 기타 필수 서비스를 컨테이너화하여 강력한 데이터 엔지니어링 환경을 설정하는 방법을 보여줍니다. 설정에는 Hadoop (HDFS, YARN), Apache Hive, PostgreSQL 및 Apache Airflow가 포함되며, 이들 모두가 원활하게 작동하도록 구성되어 있습니다.

hadoop data-modeling datawarehouse airflow-docker etl-pipeline datavault

Updated May 29, 2024
Shell

DAGWorks-Inc / hamilton

Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.

Updated May 28, 2024
Jupyter Notebook

ETL-awesome-api

IvanildoBarauna / ETL-awesome-api

Solução completa dedicada a realizar ETL de dados de cotações de moedas usando Python. Fonte dos dados: https://docs.awesomeapi.com.br/api-de-moedas

python data-engineering data-analytics data-analysis etl-pipeline

Updated May 28, 2024
Python

MobileTeleSystems / horizon

Simple HWM Store backend

etl rest-api etl-pipeline etl-components hwm

Updated May 28, 2024
Python

julian506 / openweathermap-etl

A simple ETL for temperature data from the Openweathermap API, storing it into an Azure SQL Database

python etl azure scheduler data-engineering azure-sql-database etl-pipeline data-engineering-pipeline

Updated May 28, 2024

stellar / stellar-etl-airflow

Airflow DAGs for the Stellar ETL project

python airflow blockchain data-analysis stellar stellar-network etl-framework etl-pipeline stellar-lumens

Updated May 28, 2024
Python

TheODDYSEY / YTS-Pipeline-Postgres

ETL pipeline 🪈 for scraping, transforming, and loading YTS movie data 🎞️ into PostgreSQL 🛢️ Container using Docker🐳

docker yaml docker-compose pandas python3 postresql etl-pipeline bs4-requests scraping-web pipeline-dock-tech

Updated May 28, 2024
Python

wri / gfw-data-api

GFW Data API

api-server metadata-api etl-pipeline

Updated May 28, 2024
Python

jvalue / jayvee

Jayvee is a domain-specific language and runtime for automated processing of data pipelines

data-science typescript data-engineering domain-specific-language data-pipeline etl-pipeline

Updated May 28, 2024
TypeScript

OP-TED / ted-rdf-conversion-pipeline

TED Semantic Web Services

rdf epo semantic-web procurement transformation eprocurement rml etl-pipeline rml-mapping cellar-sync ted-sws

Updated May 28, 2024
HTML

netwerk-digitaal-erfgoed / ld-workbench

A CLI tool for transforming large RDF datasets using pure SPARQL.

etl sparql lod linked-open-data etl-pipeline

Updated May 28, 2024
TypeScript

unstract

Zipstack / unstract

No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents

unstructured-data etl-pipeline llm-platform

Updated May 28, 2024
Python

chayansraj / Microsoft-Azure-Medallion-Data-pipeline

In this project we are going to create an end-to-end data platform right from Data Ingestion, Data Transformation, Data Loading and Reporting.

mysql data-science cloud sql database big-data spark analytics azure data-visualization data-engineering cloud-computing azure-storage powerbi azure-data-factory etl-pipeline dataingestion azure-databricks azure-synapse-analytics

Updated May 28, 2024
Jupyter Notebook

DemoDevv / pipeline_ETL_report_enquete_clients_DT

Pipeline ETL (Extract, Transform, Load) permettant de faire une modification de la data provenant du report généré par Drag'n Survey pour que celle-ci puisse être utilisée dans l'outil de visualisation Power BI.

pipeline data-visualization survey-analysis etl-pipeline

Updated May 28, 2024
Python

onetl

MobileTeleSystems / onetl

One ETL tool to rule them all

spark etl plugin-system etl-pipeline etl-components pydantic hwm

Updated May 28, 2024
Python

incubator-streampark

apache / incubator-streampark

Make stream processing easier! Easy-to-use streaming application development framework and operation platform.

streaming apache easy-to-use etl-pipeline development-framework streampark operation-platform

Updated May 28, 2024
Java

usedatabrew / blink

OpenSource data platform to build event-driven systems. It's like Deebezium for golang :)

streaming kafka etl stream-processing data-engineering data-processing debezium stream-processor etl-pipeline

Updated May 27, 2024
Go

MobileTeleSystems / horizon-hwm-store

Horizon HWM Store for onETL

etl etl-pipeline etl-components hwm

Updated May 27, 2024
Python

jpcadena / pydantic-sqlalchemy-tutorial

Tutorial for Pydantic and SQLAlchemy

Updated May 27, 2024
Python

AyushRaiKhare / Ayush_Khare_Data_Engineering_Portfolio

Ayush @ Data Engineering Portfolio

jenkins data-science data data-visualization data-engineering dataflow dbt kubernetes-deployment data-engineer etl-pipeline data-engineering-pipeline mlops data-engineering-nanodegree

Updated May 27, 2024

Improve this page

Add a description, image, and links to the etl-pipeline topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the etl-pipeline topic, visit your repo's landing page and select "manage topics."