#

gpt4v

Here are 33 public repositories matching this topic...

mnotgod96 / AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

agent gpt4 llm generative-ai chatgpt gpt4v

Updated May 13, 2024
Python

X-PLUG / MobileAgent

Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception

android agent harmony ios app gui automation mobile copilot multimodal mobile-agents mllm multimodal-large-language-models gpt4v multimodal-agent

Updated Apr 3, 2024
Python

reworkd / tarsier

Vision utilities for web interaction agents 👀

python ocr selenium webscraping pypi-package playwright llms gpt4v

Updated May 6, 2024
Jupyter Notebook

AmberSahdev / Open-Interface

Control Any Computer Using LLMs

python windows macos linux machine-learning automation assistant openai gpt pyinstaller self-driving pyautogui assistant-computer-control self-driving-software gpt4 llm gpt4v gpt4vision

Updated May 12, 2024
Python

bdekraker / WebcamGPT-Vision

Lightweight GPT-4 Vision processing over the Webcam

computer-vision openai gpt-4 chatgpt gpt4-api gpt4v

Updated Nov 9, 2023
JavaScript

langgptai / Awesome-Multimodal-Prompts

Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.

awesome awesome-list prompts multimodal dall-e gpt4 prompt-engineering chatgpt prompt-injection newbing jailbreak-prompt gpt4v dall-e3 multimodal-prompts dall-e3-prompts

Updated Oct 25, 2023

vscode-ui-sketcher

pAIrprogio / vscode-ui-sketcher

Draw your projects to life

ui-design vscode-extension tldraw gpt4v

Updated May 6, 2024
TypeScript

amazing-openai-api

soulteary / amazing-openai-api

Convert different model APIs into the OpenAI API format out of the box.

openai openai-api azure-openai azure-openai-api gpt4v gpt4vision yi-34b google-gemini gemini-pro yi-34b-chat

Updated Feb 21, 2024
Go

zzxslp / MM-Navigator

web-navigation gpt4v llm-agents

Updated Nov 16, 2023

tiwater / flowgen

AutoGen Visualized - Visual Tools for Multi-Agent Development.

agent artificial-intelligence openai autogen rag llm chatgpt llava gpt4v

Updated May 10, 2024
TypeScript

kyegomez / MambaByte

Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta

machine-learning ai tokenizer ml artificial-intelligence mamba multi-modality megabyte gpt4v

Updated Mar 11, 2024
Python

admineral / GPT4-Vision-React-Starter

Early Alpha Release: Chat with Your Image - Leveraging GPT-4 Vision and Function Calls for AI-Powered Image Analysis and Description

ai openai openai-api gpt4 chatgpt-api openaiapi gpt4-api gpt4v gpt-4-vision-preview gpt4-vision

Updated Nov 29, 2023
TypeScript

sketch2app

cameronking4 / sketch2app

The ultimate sketch to code app made using GPT4 vision. Choose your desired framework (React, Next, React Native, Flutter) for your app. It will instantly generate code and preview (sandbox) from a simple hand drawn sketch on paper captured from webcam

code-generator nextjs openai wireframe app-maker sketch2code gpt4 design2code code-assistant ai-tool gpt4v gpt4-vision sketch2app pad2pixel generate-app-ai

Updated May 3, 2024

cocacola-lab / MineLand

Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical Needs

minecraft ai-agents ai-agent large-language-models llm vision-language-model multimodal-large-language-models gpt4v

Updated May 11, 2024
Python

roboflow / gpt-checkup

Monitor the performance of OpenAI's GPT-4V model over time.

computer-vision model-analysis gpt4v gpt-4v

Updated May 13, 2024
HTML

martintmv-git / gpt4v-streamlit-voiceover

AI Voiceover with GPT4V

python jupyter-notebook openai streamlit gpt4v

Updated May 10, 2024
Jupyter Notebook

kyegomez / HRTX

Multi-Modal Multi-Embodied Hivemind-like Iteration of RTX-2

machine-learning ai ml artificial-intelligence ensemble multi-modal rtx multi-modality rt-2 gpt4v

Updated Mar 12, 2024
Python

neka-nat / mylangrobot

Language instructions to mycobot using GPT-4V

whisper mycobot chatgpt segment-anything gpt4v gpt-4-vision-preview gpt-4-vision

Updated Dec 11, 2023
Python

logicalroot / gpt-4v-demos

🤖 GPT-4V Demos • Test the model's vision capabilities in your browser using Streamlit • Easy setup

python openai streamlit gpt-4 gpt4 gpt4v gpt-4v

Updated Dec 3, 2023
Python

GraphPKU / CoI

Chain of Images for Intuitively Reasoning

chatbot llama multimodal chatgpt llava visual-language-models gpt4v dalle3 chain-of-throught chain-of-image

Updated Nov 29, 2023
Python

Improve this page

Add a description, image, and links to the gpt4v topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gpt4v topic, visit your repo's landing page and select "manage topics."