gpt4v
Here are 33 public repositories matching this topic...
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception
-
Updated
Apr 3, 2024 - Python
Vision utilities for web interaction agents 👀
-
Updated
May 6, 2024 - Jupyter Notebook
Control Any Computer Using LLMs
-
Updated
May 12, 2024 - Python
Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.
-
Updated
Oct 25, 2023
Convert different model APIs into the OpenAI API format out of the box.
-
Updated
Feb 21, 2024 - Go
-
Updated
Nov 16, 2023
Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta
-
Updated
Mar 11, 2024 - Python
Early Alpha Release: Chat with Your Image - Leveraging GPT-4 Vision and Function Calls for AI-Powered Image Analysis and Description
-
Updated
Nov 29, 2023 - TypeScript
The ultimate sketch to code app made using GPT4 vision. Choose your desired framework (React, Next, React Native, Flutter) for your app. It will instantly generate code and preview (sandbox) from a simple hand drawn sketch on paper captured from webcam
-
Updated
May 3, 2024
Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical Needs
-
Updated
May 11, 2024 - Python
Monitor the performance of OpenAI's GPT-4V model over time.
-
Updated
May 13, 2024 - HTML
AI Voiceover with GPT4V
-
Updated
May 10, 2024 - Jupyter Notebook
Multi-Modal Multi-Embodied Hivemind-like Iteration of RTX-2
-
Updated
Mar 12, 2024 - Python
Language instructions to mycobot using GPT-4V
-
Updated
Dec 11, 2023 - Python
Chain of Images for Intuitively Reasoning
-
Updated
Nov 29, 2023 - Python
Improve this page
Add a description, image, and links to the gpt4v topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the gpt4v topic, visit your repo's landing page and select "manage topics."