quantization
Here are 565 public repositories matching this topic...
Models made for Edge Devices and NN Optimizations
-
Updated
Oct 25, 2019 - Python
Matlab code for "Tree-structured quantization on Grassmann and Stiefel manifold", S. Schwarz et al., DCC 2021
-
Updated
Oct 29, 2020 - MATLAB
DynamicQuantization_Bert from pytorch tutorials
-
Updated
Aug 23, 2022 - Jupyter Notebook
ZeQLoRA: Efficient Finetuning of Quantized LLMs with ZeRO and LoRA
-
Updated
Jul 24, 2023 - Jupyter Notebook
A toy example of OCTAV algorithm for finding the optimal clipping scalar in the quantization error problem
-
Updated
Nov 15, 2023 - Python
Qualization library based on 1R
-
Updated
Apr 13, 2020 - Rust
-
Updated
Feb 18, 2020 - Python
Pytorch Model Quantization, Layer Fusion and Optimization
-
Updated
Jun 28, 2022 - Jupyter Notebook
Project assignment for course Introduction to Telecommunications at ECE NTUA
-
Updated
Mar 17, 2023 - MATLAB
Uniform quantizer that uses mexCallMATLAB to call different MATLAB commands and plot the results
-
Updated
Jun 18, 2023 - C
Quantizing LLMs using GPTQ
-
Updated
Dec 31, 2023 - Jupyter Notebook
Implementation of MedQ: Lossless ultra-low-bit neural network quantization for medical image segmentation
-
Updated
May 17, 2024
Regularized Classification-Aware Quantization
-
Updated
May 3, 2021 - Python
A compilation of various ML and DL models and ways to optimize the their inferences.
-
Updated
Nov 10, 2023 - Jupyter Notebook
-
Updated
Sep 16, 2021 - Jupyter Notebook
Optimized CPU Implementation of Llama2-LLM
-
Updated
Mar 21, 2024 - Python
Efficient Inference techniques implemented in PyTorch for computer vision.
-
Updated
Dec 2, 2020 - Python
Improve this page
Add a description, image, and links to the quantization topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the quantization topic, visit your repo's landing page and select "manage topics."