🚀 Feature: re-write Langchain instrumentation to use Langchain Callbacks #541

nirga · 2024-02-27T14:31:39Z

Which component is this feature for?

Langchain Instrumentation

🔖 Feature description

Right now, we monkey-patch classes and methods in LlamaIndex which requires endless work and constant maintenance. Langchain has a system for callbacks that can potentially be used to create/end spans without being too coupled with with the framework's inner structure.

🎤 Why is this feature needed ?

Support Langchain entirely and be future-proof to internal API changes

✌️ How do you aim to achieve this?

Look into Langchain callbacks and how other frameworks are using it.

🔄️ Additional Information

No response

👀 Have you spent some time to check if this feature request has been raised before?

I checked and didn't find similar issue

Are you willing to submit PR?

None

maciejwie · 2024-03-13T18:34:43Z

Chainlit uses langchain's callbacks for their instrumentation, and it seems to work well enough. They inherit from langchain's BaseTracer class as well and do their observability through callbacks, as well as some front-end functions (ex: updating the user-facing message on each token). Langchain does allow for multiple independent callbacks to be specified, so doing it this way doesn't exclude any user-created callbacks.

midhun1998 · 2024-03-17T17:02:10Z

Hi @nirga ,
I have a rough idea on the implementation now and I am willing to pick up this issue.
After reading your comments in traceloop/openllmetry-js#133 (comment) here is my understanding:

Create a call handler similar to StdOutCallbackHandler in the same directory as opentelemetry-instrumentation-langchain
Modify the task_wrapper, atask_wrapper, workflow_wrapper, and aworkflow_wrapper to inject the callback handler to the instance and use the callback to set/unset span. A small doubt here is that do we still need a task_wrapper.py and workflow_wrapper.py? Can I add the injecting logic to __init__.py itself. Which one's the right approach?
The callback handler should be injected for Langchain classes instances which support callbacks.

Please correct me if my understanding is incorrect.
Thanks,
Midhun

nirga · 2024-03-17T17:03:19Z

Right @midhun1998! And indeed we probably don't need them all

midhun1998 · 2024-03-17T17:04:48Z

Thanks for the confirmation, Nir! I will keep the issue updated with the progress. 🙂

midhun1998 · 2024-03-26T19:55:16Z

Hi @nirga ,

I tried the approach suggested and met with some roadblocks. Need your input. Below are the details:

Observation and Notes:

I have added the initial set of changes to my fork here: main...midhun1998:openllmetry:feat/langchain-callback (Please ignore the key of _span_dict as of now. My plan to use UUID or something as key instead of name. Its just a placeholder now.)
We are injecting the callback handler to the constructor of the class as expected but instead of start_as_current_span we will have to call start_span and end_span manually now.

Challenges:

I observed that since we are no longer calling the start_as_current_span() we might need some way to attach the span to the parent span if any and this is where I'm facing an issue and would appreciate your input. There seems to be no parent span during the creation of the spans in the callback handler E.g. When SequentialChain was calling LLMChain I was expecting the SequentialChain span to persist and be taken as a parent Span but that was not the case. The traceloop UI shows the span but it detached individual spans without any parent. I believe this has to do with the constructor initialization that we are doing.
The callback handler only applies to very few classes such as Chain, Agent, Tool, and lacks the support for other classes which were used earlier such as Template, BasePromptTemplate BaseOutputParser, RunnableSequence, etc. How are we looking to support these? Do we maintain the monkey patching for methods for the others?

nirga · 2024-04-06T12:21:31Z

Linking here our slack conversation so I'll remember that we've discussed and answered these already 😅

nirga added the help wanted Extra attention is needed label Feb 27, 2024

midhun1998 linked a pull request Apr 28, 2024 that will close this issue

feat(instrumentation): Updated Langchain instrumentation to use Langchain Callbacks #902

Draft

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🚀 Feature: re-write Langchain instrumentation to use Langchain Callbacks #541

🚀 Feature: re-write Langchain instrumentation to use Langchain Callbacks #541

nirga commented Feb 27, 2024

maciejwie commented Mar 13, 2024 •

edited

midhun1998 commented Mar 17, 2024

nirga commented Mar 17, 2024

midhun1998 commented Mar 17, 2024

midhun1998 commented Mar 26, 2024

nirga commented Apr 6, 2024

🚀 Feature: re-write Langchain instrumentation to use Langchain Callbacks #541

🚀 Feature: re-write Langchain instrumentation to use Langchain Callbacks #541

Comments

nirga commented Feb 27, 2024

Which component is this feature for?

🔖 Feature description

🎤 Why is this feature needed ?

✌️ How do you aim to achieve this?

🔄️ Additional Information

👀 Have you spent some time to check if this feature request has been raised before?

Are you willing to submit PR?

maciejwie commented Mar 13, 2024 • edited

midhun1998 commented Mar 17, 2024

nirga commented Mar 17, 2024

midhun1998 commented Mar 17, 2024

midhun1998 commented Mar 26, 2024

Observation and Notes:

Challenges:

nirga commented Apr 6, 2024

maciejwie commented Mar 13, 2024 •

edited