Implement "Continue". Final output left truncated and unfinished because LLM max_token was reached. #504

MinhNgyuen · 2024-04-24T01:36:36Z

Right now if you give an Agent a task to write a long report and a lot of context, it will run out of output_tokens and stop midway through its response. There is no option to have the LLM continue generating the report in the case the LLM ran out of output tokens.

Similar to the ChatGPT web ui, add an option to continue output generation and finalize the output.

Here's some example code of how its done with OpenAI

def openai_write_long_response(prompt: str, context: str) -> str:
    client = OpenAI()
    completions = []
    messages = [
        {
            "role": "user",
            "content": f"Below is some relevant information that you can use to craft your report\n\n{context}",
        },
        {
            "role": "user",
            "content": prompt,
        },
    ]

    completion = client.chat.completions.create(
        model="gpt-4-turbo",
        messages=messages,
    )
    completions.append(completion)
    output = completion.choices[0].message.content
    while completion.choices[0].finish_reason == "length":
        messages += [
            {
                "role": "system",
                "content": "Message was truncated. Please continue.",
            },
            {
                "role": "assistant",
                "content": output,
            },
        ]
        completion = client.chat.completions.create(
            model="gpt-4-turbo",
            messages=messages,
        )
        completions.append(completion)
        output += completion.choices[0].message.content + " "

    return output

The text was updated successfully, but these errors were encountered:

MinhNgyuen · 2024-04-24T01:37:25Z

If you could point to where in the code we could implement this I can take a stab at adding this functionality.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement "Continue". Final output left truncated and unfinished because LLM max_token was reached. #504

Implement "Continue". Final output left truncated and unfinished because LLM max_token was reached. #504

MinhNgyuen commented Apr 24, 2024

MinhNgyuen commented Apr 24, 2024

Implement "Continue". Final output left truncated and unfinished because LLM max_token was reached. #504

Implement "Continue". Final output left truncated and unfinished because LLM max_token was reached. #504

Comments

MinhNgyuen commented Apr 24, 2024

MinhNgyuen commented Apr 24, 2024