Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

XInstructBLIP demo text generation #689

Open
ParkJun-Yeong opened this issue Apr 18, 2024 · 0 comments
Open

XInstructBLIP demo text generation #689

ParkJun-Yeong opened this issue Apr 18, 2024 · 0 comments

Comments

@ParkJun-Yeong
Copy link

ParkJun-Yeong commented Apr 18, 2024

hi. i executed demo.ipynb setting modality to only 'audio'.

in vicuna7b_v2.yaml

  modalities: ['audio']

and I run the inference function in demo.ipynb because I don't need the web UI.

inference(None, None, "examples/audio/Group_of_Dogs_Barking.wav", None, "Question: What is the dog doing? Answer: ", "Qformer Prompt: What is the dog doing?", 1, 250, 5, 1, 1.5, 0.9, "Beam search")
inference(None, None, "examples/audio/Group_of_Dogs_Barking.wav", None, "Question: What is the dog doing? Answer: ", "Question: What is the dog doing? Answer: , 1, 250, 5, 1, 1.5, 0.9, "Beam search")

all config for generation was set samely as original demo code.
but in both 'beam search' and 'nucleus sampling', the generated output was below.

image

how can I reproduce the result of the paper?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant