LLM Isnights (rate limit handling and so on ) #881

mdarweash · 2024-04-03T17:47:18Z

mdarweash
Apr 3, 2024

Does it support rate limit handling (like backoff, Retry) or is it on the path of being implemented?

any sort of collecting insights about LLM usage (number of tokens used ...etc) for later push to analytics (like prometheus or any data store)

kwilliams-halosight · 2024-04-05T00:28:13Z

kwilliams-halosight
Apr 5, 2024

I have another similar question, but I can answer in part. If your AI Service returns a Response , the object has a Usage property object that contains token usage separated by input/output/etc.

0 replies

langchain4j · 2024-04-11T11:00:16Z

langchain4j
Apr 11, 2024
Maintainer

Not yet implemented

1 reply

langchain4j Apr 11, 2024
Maintainer

A simple "retry after a rate limite error" strategy can be implemented quite easily I guess, but for a more robust solution we need to think if it should be a part of a library or not.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLM Isnights (rate limit handling and so on ) #881

{{title}}

Replies: 2 comments 1 reply

{{title}}

{{title}}

{{title}}

Select a reply

LLM Isnights (rate limit handling and so on ) #881

mdarweash Apr 3, 2024

Replies: 2 comments · 1 reply

kwilliams-halosight Apr 5, 2024

langchain4j Apr 11, 2024 Maintainer

langchain4j Apr 11, 2024 Maintainer

mdarweash
Apr 3, 2024

Replies: 2 comments 1 reply

kwilliams-halosight
Apr 5, 2024

langchain4j
Apr 11, 2024
Maintainer

langchain4j Apr 11, 2024
Maintainer