Image Input #134

Legerdo · 2024-03-29T07:04:26Z

Describe the feature

hello.
llamafile seems to have image input functions such as jpg/png/gif/bmp.

Example)
llamafile -ngl 9999 --temp 0
--image ~/Pictures/lemurs.jpg
-m llava-v1.5-7b-Q4_K.gguf
--mmproj llava-v1.5-7b-mmproj-Q4_0.gguf
-e -p '### User: What do you see?\n### Assistant: '
--no-display-prompt 2>/dev/null

Is it possible to implement this feature in the future?
Or is there some problem that makes it impossible?

amakropoulos · 2024-03-29T08:01:18Z

hi, thanks for the request!
that should be feasible.
how would you like to use it / see it inside Unity?

Legerdo · 2024-03-29T08:33:17Z

Eventually, I would like to add the ability to describe to the user what the NPC character's camera (eyes) sees.

I haven't tested this against the local vision model yet, so I don't know to what extent it's possible, but it would be interesting if it were!

Legerdo added the enhancement New feature or request label Mar 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Image Input #134

Image Input #134

Legerdo commented Mar 29, 2024

amakropoulos commented Mar 29, 2024

Legerdo commented Mar 29, 2024

Image Input #134

Image Input #134

Comments

Legerdo commented Mar 29, 2024

Describe the feature

amakropoulos commented Mar 29, 2024

Legerdo commented Mar 29, 2024