Improve: Research Agent Regex Parsing #469

hjamet · 2024-04-21T19:09:15Z

Description

Improvements

Resolves numerous infinite loop issues caused by LLM response format errors, especially for OLLAMA models like LLAMA3.
Significantly simplifies agent code and avoids duplication by introducing a new parent class that enables automatic response validation.

Resolved Issues

The list may not be exhaustive.

Related Pull Requests

Update prompt.jinja2 with an example for more consistent format reply (https://github.com/stitionai/devika/issues/429) #438 : Both pull requests are compatible. They aim to solve the same issue but with different approaches that can be combined.

Explanations

The current version of Devika expects to receive a specific JSON or MARKDOWN response format from LLMs in order to parse them correctly. Unfortunately, LLMs (especially smaller models like LLAMA3) do not always manage to respond perfectly to this task and bury their JSON between explanations, making their response impossible to parse. This pull request aims to use REGEX for better extraction of responses from LLMs.
Furthermore, in order to simplify this method and generalize it to all agents easily and without duplicating code, I propose in this pull request a new parent class for agents AgentTemplate which offers generic render & parse_answer methods that automatically extract the expected format from prompt.jinja2 and perform the expected validations. This modification significantly simplifies the code by reducing about 2/3 of the code for each agent, while avoiding repetition and limiting errors of the type "Invalid response from the model, trying again..."

For now, and as an example and test, I have only modified the RESEARCH agent, which now works perfectly with all LLMs I have tested: GPT4, CLAUDE OPUS, MISTRAL 8x7B, LLAMA3, and GEMMA. Once this pull request is validated, I am committed to adapting all the other agents in the same way.

hjamet · 2024-04-21T19:11:59Z

devika.py

ONLY BETTER FORMATTING : NO MODIFICATIONS

hjamet · 2024-04-21T19:12:18Z

src/agents/__init__.py

JUST ADD THE NEW FILES

hjamet · 2024-04-21T19:12:46Z

.gitignore

IGNORE VSCODE & REORGANIZE A BIT

hjamet · 2024-04-21T19:14:13Z

src/agents/agent_template.py

THE MAIN MODIFICATION

The parent class of all agents uses regex to efficiently parse LLM responses

hjamet · 2024-04-21T19:15:03Z

src/agents/agents.py

ONLY FORMATING

hjamet · 2024-04-21T19:15:40Z

src/agents/researcher/prompt.jinja2

SMALL PROMPT MODIFICATION

To avoid getting more than 3 web search

hjamet · 2024-04-21T19:16:31Z

src/agents/researcher/researcher.py

ADAPT AGENT TO NEW MOTHER CLASS

Adaptation of the Research Agent to the changes made in the pull request

darrassi1 · 2024-04-22T22:19:28Z

#469 and #438 works very well

odeemi · 2024-04-24T04:54:43Z

This is a big improvement when using ollama. Hopefully it's merged soon 👍

ARajgor · 2024-04-25T09:54:56Z

can you do this for all the agents?

hjamet · 2024-04-25T10:50:16Z

can you do this for all the agents?

Of course ! I will fix the conflicts and extend the idea to all agents :)

epem · 2024-04-26T17:50:13Z

Thanx! Good job!

mirek190 · 2024-04-27T18:25:19Z

still waiting to merge ....

hjamet and others added 5 commits April 21, 2024 00:14

ci: 🙈 Add vscode to gitignore

c937166

feat: 🚧 better validate_response

6e00978

fix: ⚡ Agent template working for researcher

cb276b6

refactor: ♻️ Small improvment in code structure

041ff4b

fix: 🩹 Limit the LLM to 3 search

6fc6bce

hjamet commented Apr 21, 2024

View reviewed changes

devika.py

Copy link

Author

hjamet Apr 21, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ONLY BETTER FORMATTING : NO MODIFICATIONS

hjamet commented Apr 21, 2024

View reviewed changes

src/agents/__init__.py Outdated

Copy link

Author

hjamet Apr 21, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

JUST ADD THE NEW FILES

hjamet commented Apr 21, 2024

View reviewed changes

ARajgor added the pending review label Apr 22, 2024

hjamet mentioned this pull request Apr 22, 2024

Not working with LLM #396

Closed

hjamet added 2 commits April 25, 2024 13:30

feat: 🔀 Fix merge conflicts

843d471

fix: 🐛 Small improvment

03b92bf

ARajgor added the enhancement New feature or request label Apr 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve: Research Agent Regex Parsing #469

Improve: Research Agent Regex Parsing #469

hjamet commented Apr 21, 2024 •

edited

hjamet Apr 21, 2024

hjamet Apr 21, 2024

hjamet Apr 21, 2024

hjamet Apr 21, 2024

hjamet Apr 21, 2024

hjamet Apr 21, 2024

hjamet Apr 21, 2024

darrassi1 commented Apr 22, 2024

odeemi commented Apr 24, 2024

ARajgor commented Apr 25, 2024

hjamet commented Apr 25, 2024

epem commented Apr 26, 2024

mirek190 commented Apr 27, 2024

Improve: Research Agent Regex Parsing #469

Are you sure you want to change the base?

Improve: Research Agent Regex Parsing #469

Conversation

hjamet commented Apr 21, 2024 • edited

Description

Improvements

Resolved Issues

Related Pull Requests

Explanations

hjamet Apr 21, 2024

Choose a reason for hiding this comment

hjamet Apr 21, 2024

Choose a reason for hiding this comment

hjamet Apr 21, 2024

Choose a reason for hiding this comment

hjamet Apr 21, 2024

Choose a reason for hiding this comment

hjamet Apr 21, 2024

Choose a reason for hiding this comment

hjamet Apr 21, 2024

Choose a reason for hiding this comment

hjamet Apr 21, 2024

Choose a reason for hiding this comment

darrassi1 commented Apr 22, 2024

odeemi commented Apr 24, 2024

ARajgor commented Apr 25, 2024

hjamet commented Apr 25, 2024

epem commented Apr 26, 2024

mirek190 commented Apr 27, 2024

hjamet commented Apr 21, 2024 •

edited