This repository comprises the design, implementation, and analysis of a near real-time data warehouse prototype for an electronics business chain, utilising a multi-threaded Extract, Transform, Load (ETL) pipeline leveraging the efficient HYBRIDJOIN algorithm implemented with Java and MySQL on customer sales data.
mysql
data-science
database
data-warehouse
business-intelligence
data-analysis
relational-databases
near-real-time
real-time-processing
data-warehousing
extract-transform-load
database-design
sales-analysis
etl-pipeline
join-method
data-modelling
data-mo
multidimensional-database
-
Updated
Mar 1, 2024 - Java