mllm

Datasets, case studies and benchmarks for extracting structured information from PDFs, HTML files or images, created by the Parsee.ai team. Datasets also on Hugging Face: https://huggingface.co/parsee-ai

datasets rag llm mllm

Updated May 15, 2024
Jupyter Notebook

bigai-nlco / LSTP-Chat

Star

A Video Chat Agent with Temporal Prior

spatial-temporal video-language llm mllm visual-instruction-tuning multimodal-large-language-models

Updated Feb 28, 2024
Python

zzq2000 / MIKO

Star

MIKO: Multimodal Intention Knowledge Distillation from Large Language Models for Social-Media Commonsense Discover

social-media intention llm mllm

Updated Mar 5, 2024
Python

gyunggyung / OpenMLLM

Star

Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?

multilingual open-source efficiency fine-tuning efficientnet llm mllm

Updated Mar 27, 2023
C

UCSC-VLAA / Sight-Beyond-Text

Star

This repository includes the official implementation of our paper "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"

alignment vlm ai-alignment vision-language vicuna llm mllm llava llama2

Updated Sep 15, 2023
Python

VisualWebBench / VisualWebBench

Star

Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"

machine-learning natural-language-processing computer-vision deep-learning evaluation question-answering visual-question-answering multimodal multimodal-deep-learning foundation-models large-language-models llm llms mllm multimodal-large-language-models large-multimodal-models

Updated Apr 17, 2024
Python

KwaiVGI / Uniaa

Star

Unified Multi-modal IAA Baseline and Benchmark

benchmark dataset image-aesthetic-assessment mllm llava

Updated Apr 16, 2024

X-PLUG / mPLUG-HalOwl

Star

mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating

benchmark contrastive-learning hallucinations mllm multimodal-large-language-models multimodal-hallucination

Updated Jan 29, 2024
Python

graphic-design-ai / graphist

Star

Official Repo of Graphist

graphic-design hlg lmm llm mllm layout-generation

Updated Apr 23, 2024

Ahnsun / merlin

Star

Merlin: Empowering Multimodal LLMs with Foresight Minds

mllm

Updated May 8, 2024
Python

BAAI-DCAI / DataOptim

Star

A collection of visual instruction tuning datasets.

llm mllm visual-instruction-tuning

Updated Mar 14, 2024
Python

TIGER-AI-Lab / Mantis

Star

Official code for Paper "Mantis: Multi-Image Instruction Tuning"

language video vision mantis vlm multimodal lmm fuyu mllm llava-llama3 multi-image-understanding

Updated May 23, 2024
Python

Improve this page

Add a description, image, and links to the mllm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mllm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mllm

Here are 36 public repositories matching this topic...

kassy11 / Awesome_NLP_PaperList

isLinXu / MLLM-Research-Learn

alexander-moore / vlm

kassy11 / Awesome_Visually-Augmented_NLP

xirui-li / attacks-on-LLMs

eric-ai-lab / MultipanelVQA

BUAADreamer / Chinese-LLaVA-Med

CharlieDDDD / AISurveyPapers

parsee-ai / parsee-datasets

bigai-nlco / LSTP-Chat

zzq2000 / MIKO

gyunggyung / OpenMLLM

UCSC-VLAA / Sight-Beyond-Text

VisualWebBench / VisualWebBench

KwaiVGI / Uniaa

X-PLUG / mPLUG-HalOwl

graphic-design-ai / graphist

Ahnsun / merlin

BAAI-DCAI / DataOptim

TIGER-AI-Lab / Mantis

Improve this page

Add this topic to your repo