Official code for Paper "Mantis: Multi-Image Instruction Tuning"
-
Updated
May 16, 2024 - Python
Official code for Paper "Mantis: Multi-Image Instruction Tuning"
Enhanced ChatGPT Clone: Features OpenAI, Assistants API, Azure, Groq, GPT-4 Vision, Mistral, Bing, Anthropic, OpenRouter, Vertex AI, Gemini, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development
FreeGenius AI, an advanced AI assistant that can talk and take multi-step actions. Supports numerous open-source LLMs via Llama.cpp or Ollama or Groq Cloud API, with optional integration with AutoGen agents, OpenAI API, Google Gemini Pro and unlimited plugins.
Automate browser-based workflows with LLMs and Computer Vision
A fully-annotated, open-design dataset of autonomous and piloted high-speed flight
📸 A powerful, high-performance React Native Camera library.
Recrafting Video Ads with Generative AI
Anthropic Claude API wrapper for Go
The Freiburg Vision Test (FrACT) assesses visual acuities and contrast thresholds. It runs in any modern browser, or as webApp.
Productivity Pro is a note-taking and task organization app specifically created for students.
Python API to use Intel RealSense camera and sync the camera with Dorna 2 robotic arm.
PhotonVision is the free, fast, and easy-to-use computer vision solution for the FIRST Robotics Competition.
High performance barcode scanner for React Native using VisionCamera
A simple playground Web UI for using the Gemini Pro Vision and Gemini Pro AI models with Next.js
A python framework accelerating ML based discovery in the medical field by encouraging code reuse. Batteries included :)
Add a description, image, and links to the vision topic page so that developers can more easily learn about it.
To associate your repository with the vision topic, visit your repo's landing page and select "manage topics."