#

distributed-training

Here are 144 public repositories matching this topic...

m-ali-awan / sagemaker-learning-brad

Well commented code for different types of training configurations

spot-instances distributed-training

Updated Jul 1, 2021
Jupyter Notebook

SuperbTUM / Faster-Distributed-Training

Faster large mini-batch distributed training w/o. squeezing devices

fusion apex distributed-training natural-gradients mixup onnx-runtime fairscale

Updated Jul 28, 2023
Python

Hemantr05 / distributed_training

This project contains scripts/modules for distributed training

distributed-training tensorflow-distributed nvidia-apex pytorch-distributeddataparallel

Updated Mar 25, 2022
Python

lukeconibear / intro_ml

Short course: Introduction to Machine Learning

machine-learning course deep-learning tensorflow scikit-learn keras pytorch fundamentals transfer-learning data-pipeline distributed-training model-tuning pytorch-lightning

Updated Apr 25, 2022

compdnn / comptrain

Compression-accelerated distributed DNN training system at large scales.

compression dnn memory-efficient distributed-training

Updated Nov 4, 2022
HTML

Muhammad-Shah / tensorflow-advance-techniques-and-projects-specializtions

Access programming assignments and labs from the TensorFlow Advanced Techniques and TensorFlow Developer Specializations by deeplearning.ai on Coursera. 🚀🧠

computer-vision deep-learning tensorflow coursera style-transfer artificial-neural-networks object-detection autoencoders distributed-training deeplearning-ai

Updated Apr 5, 2024
Jupyter Notebook

transiteration / scaling-ml

A GitHub repository showcasing the implementation of AI scaling techniques and integration with MLflow for streamlined experiment tracking and management in machine learning workflows.

data-science natural-language-processing pytorch distributed-training mlops

Updated Jan 27, 2024
Python

Hz188 / experiments

Everything is born from a simple experiment.

cmake leetcode learning-by-doing distributed-training

Updated May 23, 2024
Python

StefanoFioravanzo / mx-operator

Tools for ML/MXNet on Kubernetes. Rework of original tf-operator to support MXNet framework.

kubernetes deep-learning mxnet operator distributed-training

Updated Sep 14, 2018
Go

buropas / Transfer_Learning_for_Custom_Object_Classification

Transfer Learning applied to Image Classification (VGG16 - Distributed Training on Multi-GPUs)

computer-vision tensorflow cnn image-classification transfer-learning vgg16 data-augmentation distributed-training multi-gpu-training

Updated Oct 11, 2021
Jupyter Notebook

nauyan / Multi-GPU-Training-Tensorflow

Training Using Multiple GPUs

gpu cuda keras tensorflow-tutorials cudnn keras-tensorflow multigpu distributed-training

Updated Feb 15, 2023
Python

EunjuYang / DistributedPyTorch

Example of Distributed pyTorch

pytorch data-parallelism distributed-training multi-node-dataparallelism multi-gpu-training modelparallelism pytorch-mp pytorch-dp

Updated Mar 23, 2019
Python

majd-alhafi / Distributed-Training-Tensorflow

python deep-learning jupyter tensorflow jupyter-notebook distributed-training

Updated Jun 7, 2022
Jupyter Notebook

CactusQ / horovod_distributed_training

Distributed training of a CNN using MNIST dataset, Tensorflow and Horovod

deep-learning tensorflow cnn distributed mnist distributed-training horovod

Updated Mar 6, 2024
Python

sukumarh / distributed-training

Distributed training using PyTorch DDP & Suggestive resource allocation

docker kubernetes flask deep-learning cnn pytorch lightgbm distributed-training pytorch-distributeddataparallel

Updated Dec 15, 2020
Jupyter Notebook

meongju0o0 / DistMHAug

Official DGL Implementation of "Distributed Graph Data Augmentation Technique for Graph Neural Network". KSC 2023

distributed-training graphsage dgl

Updated Dec 29, 2023
Python

StefanoFioravanzo / dl-operator

General purpose Kubernetes operator for DL frameworks written in Python

kubernetes deep-learning operator kubernetes-python-client distributed-training

Updated Sep 17, 2018
Python

pdefusco / Distributed_XGBoost_with_PySpark_CML

Project showcasing how to get started with Distributed XGBoost using PySpark in CML.

pyspark xgboost distributed-training

Updated Aug 30, 2023
Jupyter Notebook

anjanatiha / Distributed-Machine-Learning-for-Bio-marker-Detection-from-Wearable-Sensor-Big-Data

Distributed Machine Learning for Bio-marker Prediction from Big Data Stream collected from Multi-modal Wearable Sensor Data

machine-learning apache-spark distributed-computing mhealth predictive-analytics sensor-data biomarker-discovery distributed-training mobile-health

Updated Mar 25, 2017
Python

valayDave / metaflow-kube-demo

Metaflow On Kubernetes

kubernetes deep-learning distributed-training metaflow experiments-analytics machine-learning-productivity

Updated Apr 5, 2020
Jupyter Notebook

Improve this page

Add a description, image, and links to the distributed-training topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the distributed-training topic, visit your repo's landing page and select "manage topics."