NExT-GPT / NExT-GPT Star 2.9k Code Issues Pull requests Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model multimodal gpt-4 foundation-models visual-language-learning large-language-models llm chatgpt instruction-tuning multi-modal-chatgpt Updated Jan 22, 2024 Python
DAMO-NLP-SG / Video-LLaMA Star 2.5k Code Issues Pull requests [EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding llama large-language-models video-language-pretraining vision-language-pretraining cross-modal-pretraining blip2 minigpt4 multi-modal-chatgpt Updated May 11, 2024 Python