Skip to content

High Latency for a simple CFG generation #617

Discussion options

You must be logged in to vote

Since Outlines is almost as fast as non-guided generation, this shouldn't be the case.

Only the regex-guided generation in outlines uses the efficient/optimal approach we described in our paper, to which that statement is referring. The community provided CFG-guided generation takes a different approach and does not offer similar performance guarantees.

Replies: 6 comments 4 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
3 replies
@Shivam-Srivastava
Comment options

@lapp0
Comment options

@lapp0
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@rlouf
Comment options

Comment options

You must be logged in to vote
0 replies
Answer selected by rlouf
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
4 participants
Converted from issue

This discussion was converted from issue #615 on February 06, 2024 09:39.