AI apps using OpenAI Vision model.
-
Updated
May 16, 2024 - JavaScript
AI apps using OpenAI Vision model.
SGPT is a command-line tool that provides a convenient way to interact with OpenAI models, enabling users to run queries, generate shell commands and produce code directly from the terminal.
Enhanced ChatGPT Clone: Features OpenAI, Assistants API, Azure, Groq, GPT-4 Vision, Mistral, Bing, Anthropic, OpenRouter, Vertex AI, Gemini, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development
🤯 Lobe Chat - an open-source, modern-design LLMs/AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / Perplexity ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT chat application.
Desktop AI Assistant powered by GPT-4, GPT-4 Vision, GPT-3.5, DALL-E 3, Langchain, Llama-index, chat, vision, voice control, image generation and analysis, autonomous agents, code and command execution, file upload and download, speech synthesis and recognition, access to Web, memory, prompt presets, plugins, assistants & more. Linux, Windows, Mac.
The most advanced Web UI for AI chat
JanAr: GUI application leveraging GPT-4-Vision and GPT models to automatically generate engaging social media captions for artwork images. Customized for a glass workshop and picture framing business, it blends artistic insights with effective online engagement strategies.
Shell wrapper for OpenAI's ChatGPT, DALL-E, Whisper, and TTS. Features LocalAI, Ollama, Gemini, and Mistral integration.
SirChatalot is a Telegram bot leveraging ChatGPT, Claude or YandexGPT. It uses Whisper for speech-to-text and DALL-E, Stability AI or YandexART for image creation. It can use vision capabilities or tools/functions.
An AI-powered Mattermost ChatGPT chatbot that utilizes the OpenAI API to provide helpful, contextual responses to user messages, extract text from links, and describe or generate images. With Docker support!
AI Telegram Bot, ChatGPT, Dalle2, Whisper, GPT-4 Vision, Stability AI
Browser extension that generates alternate text for images using GPT-4 Vision.
High quality resources & applications for LLMs, multi-modal models and VectorDBs
Convert a screenshot to a working Flutter app.
[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
How a Picture of Car Damage Can File Your Insurance Claim
Raiha Discord Accessibility Bot
Network Topology Image Analsysis
Add a description, image, and links to the gpt-4-vision topic page so that developers can more easily learn about it.
To associate your repository with the gpt-4-vision topic, visit your repo's landing page and select "manage topics."