High-efficiency floating-point neural network inference operators for mobile, server, and Web
-
Updated
Jun 1, 2024 - C
High-efficiency floating-point neural network inference operators for mobile, server, and Web
Stretching GPU performance for GEMMs and tensor contractions.
A tiny, zero-deps WASM-accelerated tensor library for JavaScript.
A vector, quaternion, and matrix single-file public domain library for C99
hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library
Proyectos e implementacion de bibliotecas de la materia Programacion (2009).
Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm
Proyecto 4 - Estructuras de Datos
A collection of algorithms and data structures
It's quite funny how poorly implemented this is. Libraries that do these operations usually work in highly parallelized environments (sometimes entirely on the GPU). But the beauty of doing it manually is the learning curve. (Vec Vec double heap indirections are a crime against humanity)
Linear Algebra library in C++
A Java program that implements the product between matrices using threads. An assignment from OS teaching @unipv.
BLAS-like Library Instantiation Software Framework
DBCSR: Distributed Block Compressed Sparse Row matrix library
🧬 Search-Based approach to repair unrealisable Linear-Time Temporal Logic (LTL) specifications.
💥 Fast matrix-multiplication as a self-contained Python library – no system dependencies!
Tuned OpenCL BLAS
Problemario 9 - Programacion 1
Add a description, image, and links to the matrix-multiplication topic page so that developers can more easily learn about it.
To associate your repository with the matrix-multiplication topic, visit your repo's landing page and select "manage topics."