Skip to content

This workshop aims to make use of telco data set to work with CDP utilizing the Data services (CDF, CDE, CDW & CML) for Telecom Churn Use Case.

Notifications You must be signed in to change notification settings

DashDipti/e2e-cdp-telcochurn

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Cloudera Technical Workshop


Version : 1.0.0 25th March 2024


landing1

Introduction

Cloudera Data Platform (CDP) has been built from the ground up to support hybrid, multi-cloud data management in support of a Data Fabric architecture. This workshop introduces CDP, with a focus on the data management capabilities that enable the Data Fabric and Data Lakehouse. The workshop will include 5 key components which are: Cloudera Data Flow, Cloudera Data Engineering, Cloudera Data Warehouse, Cloudera Machine Learning & Data Visualization. A brief description of the same is provided below in the below overview section.

If you are new to Cloudera Data Platform then please spend some time in knowing more about the platform by clicking here & here.

Data Services

In this workshop, we will be getting to experience 4 out of the 5 key Data services that CDP offeres which covers the entire data lifecycle. They are:

Cloudera DataFlow (CDF) offers a flow-based low-code development paradigm that aligns best with how developers design, develop, and test data distribution pipelines. With over 450+ connectors and processors across the ecosystem of hybrid cloud services—including data lakes, lakehouses, cloud warehouses, and on-premises sources—CDF-PC provides indiscriminate data distribution. Read More here.

Cloudera Data Engineering (CDE) is the only cloud-native service purpose-built for enterprise data engineering teams. Building on Apache Spark, Data Engineering is an all-inclusive data engineering toolset that enables orchestration automation with Apache Airflow, advanced pipeline monitoring, visual troubleshooting, and comprehensive management tools to streamline ETL processes across enterprise analytics teams. Read More here.

Cloudera Data Warehouse (CDW) is a cloud service for creating self-service data warehouses and the underlying compute clusters for teams of business analysts. Data Warehouse is an auto-scaling, highly concurrent and cost effective analytics service that ingests high scale data anywhere, from structured, unstructured and edge sources. It supports hybrid and multi-cloud infrastructure models by seamlessly moving workloads between on-premise and any cloud for reports, dashboards, ad-hoc and advanced analytics, including AI, with consistent security and governance. Read More here.

Cloudera Machine Learning CDP Machine Learning enables enterprise data science teams to collaborate across the full data lifecycle with immediate access to enterprise data pipelines, scalable compute resources, and access to preferred tools. CDP Machine Learning optimizes ML workflows across your business with native and robust tools for deploying, serving, and monitoring models. With extended SDX for models, govern and automate model cataloging and then seamlessly move results to collaborate across CDP experiences including CDP Data Warehouse and CDP Operational Database. Read More here.

CDP Data Visualization enables data engineers, business analysts, and data scientists explore data quickly and easily, collaborate, and share insights across the data lifecycle—​from data ingest to data insights and beyond.

Pre-requisites

  1. Laptop with a supported OS (Windows 7 not supported) or MacBook. Please disable any VPNs.

  2. A modern browser - Google Chrome (IE, Firefox, Safari not supported).

  3. Wi-Fi Internet connection with minimal security firewall on laptop and network.
    and please do not copy/paste strings with trailing characters while executing the workshop.

Access Details

Your instructor will guide you through the following.
(1) Credentials: Participants must enter their First Name, Last Name & Company details and make a note of corresponding Workshop Login Username, Workshop Login Password and CDP Workload User to be used in this workshop.
(2) Workshop login: Using the details in the previous step make sure you are able to login here.

Workshop Flow

flow

Workshop Guides

About

This workshop aims to make use of telco data set to work with CDP utilizing the Data services (CDF, CDE, CDW & CML) for Telecom Churn Use Case.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages