#

vision

Here are 1,492 public repositories matching this topic...

TIGER-AI-Lab / Mantis

Official code for Paper "Mantis: Multi-Image Instruction Tuning"

language video vision mantis vlm multimodal lmm fuyu mllm llava-llama3 multi-image-understanding

Updated May 16, 2024
Python

danny-avila / LibreChat

Enhanced ChatGPT Clone: Features OpenAI, Assistants API, Azure, Groq, GPT-4 Vision, Mistral, Bing, Anthropic, OpenRouter, Vertex AI, Gemini, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development

Updated May 16, 2024
TypeScript

GoogleCloudPlatform / java-docs-samples

Java and Kotlin Code samples used on cloud.google.com

kotlin java appengine video cdn auth samples vision translate automl

Updated May 16, 2024
Java

eliranwong / freegenius

FreeGenius AI, an advanced AI assistant that can talk and take multi-step actions. Supports numerous open-source LLMs via Llama.cpp or Ollama or Groq Cloud API, with optional integration with AutoGen agents, OpenAI API, Google Gemini Pro and unlimited plugins.

google ai gemini vision openai mistral autogen groq stable-diffusion chatgpt llava llamacpp ollama llama3

Updated May 16, 2024
Python

Skyvern-AI / skyvern

Automate browser-based workflows with LLMs and Computer Vision

python api workflow automation browser computer vision gpt browser-automation rpa playwright llm

Updated May 15, 2024
Python

tii-racing / drone-racing-dataset

A fully-annotated, open-design dataset of autonomous and piloted high-speed flight

control computer-vision robotics path-planning dataset vision motion-capture quadrotor visual-inertial-odometry motion-capture-data ros2 drone-racing autonomous-robots scene-understanding inertial-data

Updated May 15, 2024
Python

mrousavy / react-native-vision-camera

📸 A powerful, high-performance React Native Camera library.

Updated May 15, 2024
Swift

google-marketing-solutions / vigenair

Recrafting Video Ads with Generative AI

machine-learning video ai google-cloud vision video-editing video-ads video-generation vertex-ai large-language-models llm generative-ai video-to-video

Updated May 15, 2024
TypeScript

liushuangls / go-anthropic

Anthropic Claude API wrapper for Go

go golang ai vision streaming-api claude tool-use llm anthropic claude-ai claude-api function-calling

Updated May 15, 2024
Go

FrACT10

michaelbach / FrACT10

The Freiburg Vision Test (FrACT) assesses visual acuities and contrast thresholds. It runs in any modern browser, or as webApp.

contrast vision psychophysics cappuccino objective-j visual-acuity

Updated May 14, 2024
Objective-J

stoobit / Productivity-Pro

Productivity Pro is a note-taking and task organization app specifically created for students.

swift students ios apple school schedule vision note-taking students-project tasks-list swiftui note-taking-app ipados pencilkit

Updated May 14, 2024
Swift

findo

hyonukusan / findo

수행평가

macos vision appkit coreml swiftui

Updated May 14, 2024
Swift

jessielw / HDR-Multi-Tool

A graphical user interface for parsing HDR10+ and Dolby Vision

electron windows parser json gui modern queue tool extract vision hdr10 rpu dolby hdr10plus dolbyvision

Updated May 14, 2024
JavaScript

dorna-robotics / camera

Python API to use Intel RealSense camera and sync the camera with Dorna 2 robotic arm.

robot vision realsense-camera dorna

Updated May 14, 2024
Python

TextGrabber2-app / TextGrabber2

macOS menu bar app that efficiently detects text from copied images.

macos swift menubar mac ocr vision

Updated May 14, 2024
Swift

kxkw / chatgpt-telegram-bot

The most lightweight and easy to use ChatGPT and DALLE-3 Telegram bot with token balances, user management and admin privileges

bot ai telegram vision openai gpt telebot whisper gpt-4 chatgpt chatgpt-api dalle-3 gpt-4o

Updated May 14, 2024
Python

PhotonVision / photonvision

PhotonVision is the free, fast, and easy-to-use computer vision solution for the FIRST Robotics Competition.

java opencv computer-vision frc vision wpilib vision-processing

Updated May 16, 2024
Java

vision-camera-barcode-scanner

mgcrea / vision-camera-barcode-scanner

High performance barcode scanner for React Native using VisionCamera

react-native camera barcode vision

Updated May 13, 2024
TypeScript

e-roy / gemini-pro-vision-playground

A simple playground Web UI for using the Gemini Pro Vision and Gemini Pro AI models with Next.js

nextjs gemini vision gemini-api gemini-pro-vision gemini-pro gemini-ai

Updated May 13, 2024
TypeScript

BiomedSciAI / fuse-med-ml

A python framework accelerating ML based discovery in the medical field by encouraging code reuse. Batteries included :)

Updated May 16, 2024
Python

Improve this page

Add a description, image, and links to the vision topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision topic, visit your repo's landing page and select "manage topics."