Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Document usage of request_id #900

Open
AlexanderFillbrunn opened this issue Feb 12, 2024 · 0 comments
Open

feat: Document usage of request_id #900

AlexanderFillbrunn opened this issue Feb 12, 2024 · 0 comments

Comments

@AlexanderFillbrunn
Copy link

Feature request

Hi,
The /v1/generate endpoint returns a request_id as part of the JSON response. I assume that when finished is set to false, I can somehow use this request ID to query for the rest of the output later. However, the OpenAPI documentation I can access under http://127.0.0.1:3000/ does not seem to document anywhere which endpoint to use for that. Or am I mistaken and this is not possible?

I am using the latest ghcr.io/bentoml/openllm Docker image like this:
docker run --rm -it -p 3000:3000 --platform linux/x86_64 ghcr.io/bentoml/openllm start facebook/opt-1.3b --backend pt

Kind regards,
Alexander

Motivation

This feature would allow me to find out how the request_id can be used to follow up on incomplete queries, if this is at all possible.

Other

No response

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant