Skip to content

Finetuning BLIP-2/ Flan-T5-xl for caption generation PEFT-LoRA

Notifications You must be signed in to change notification settings

AviSoori1x/multimodal_LLMs

Repository files navigation

multimodal_LLMs

Finetuning BLIP-2/ Flan-T5-xl for caption generation PEFT-LoRA. Used synthetic images generated using a diffusion model and a heuristic based prompting strategy

About

Finetuning BLIP-2/ Flan-T5-xl for caption generation PEFT-LoRA

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages