multimodal
Here are 655 public repositories matching this topic...
日本語LLMまとめ - Overview of Japanese LLMs
-
Updated
May 11, 2024
Official code for Paper "Mantis: Multi-Image Instruction Tuning"
-
Updated
May 11, 2024 - Python
Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.
-
Updated
May 11, 2024 - Python
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
-
Updated
May 11, 2024 - Python
Running Gen AI models and applications on NVIDIA Jetson devices with one-line command
-
Updated
May 11, 2024 - Shell
Real-Time Multimodal Pipelines for GenAI
-
Updated
May 11, 2024 - Python
Orchestrate Swarms of Agents From Any Framework Like OpenAI, Langchain, and Etc for Real World Workflow Automation. Join our Community: https://discord.gg/DbjBMJTSWD
-
Updated
May 11, 2024 - Python
autoupdate paper list
-
Updated
May 11, 2024 - Python
Implementation for the different ML tasks on Kaggle platform with GPUs.
-
Updated
May 11, 2024 - Jupyter Notebook
A web UI Project In order to learn the large language model. This project includes features such as chat, quantization, fine-tuning, prompt engineering templates, and multimodality.
-
Updated
May 10, 2024 - Python
Build and explore multimodal web interactives with pieces of paper!
-
Updated
May 10, 2024 - JavaScript
Open-source simulation engine for robotic general intelligence (RGI)
-
Updated
May 10, 2024 - C++
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
-
Updated
May 10, 2024 - Python
CSCI2470 Deep Learning Spring 2024: Enhancing Out-of-Distribution Object Detection with CLIP: A Vision-Language Approach
-
Updated
May 10, 2024 - Python
Stable Diffusion and LLMs offline on your own hardware
-
Updated
May 10, 2024 - Python
Visualize streams of multimodal data. Fast, easy to use, and simple to integrate. Built in Rust using egui.
-
Updated
May 11, 2024 - Rust
Data Infrastructure for Multimodal AI: Data, models, and orchestration in a unified declarative interface.
-
Updated
May 11, 2024 - Python
Improve this page
Add a description, image, and links to the multimodal topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the multimodal topic, visit your repo's landing page and select "manage topics."