Skip to content
#

mini-batch-kmeans

Here are 9 public repositories matching this topic...

This project consists in the implementation of the K-Means and Mini-Batch K-Means clustering algorithms. This is not to be considered as the final and most efficient algorithm implementation as the objective here is to make a clear comparison between the sequential and parallel execution of the clustering steps.

  • Updated Jul 9, 2023
  • C++

Developed for "Management and Analysis of Physics Dataset Mod. B," this project uses Dask and CloudVeneto VMs to handle a massive 250GB dataset. Clustering on 800k RCV1 articles involves dataset reduction by macrocategory and also implementing cosine similarity for improved clustering, as suggested by Natural Language Processing principles.

  • Updated Jan 25, 2024
  • HTML

This project used a Kmeans after PCA model to segment retail customers to optimize marketing efforts. When the model repeatedly returned a single cluster, the model was used to prove the customers' homogenous characteristics. Influenced the bank's marketing strategies and initiatives. Developed in Jupyter Notebook with Python for FNB.

  • Updated Aug 20, 2021
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the mini-batch-kmeans topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mini-batch-kmeans topic, visit your repo's landing page and select "manage topics."

Learn more