Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What is the input sample of the forward function in videollama #146

Open
llx-08 opened this issue Mar 8, 2024 · 1 comment
Open

What is the input sample of the forward function in videollama #146

llx-08 opened this issue Mar 8, 2024 · 1 comment

Comments

@llx-08
Copy link

llx-08 commented Mar 8, 2024

Hi, I'm wondering what is the input sample of the forward function in videollama.py.

1709872064572

It seems like an dict() which contains image, text_input as its keys, but I can't find any usage as example. Besides, I check the inference process in demo_audiovideo.py, it's different with the forward process. Can you provide some example to use the forward function in videollama? Thank you very much!

@EQ3000
Copy link

EQ3000 commented Apr 28, 2024

I am also finding this solution.!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants