feat: basic Support for OpenTelemetry Metrics and Token Usage Metrics in OpenAI V1 #369

Humbertzhang · 2024-01-27T08:19:06Z

Hello all, this is my first PR for issue#251

This PR introduces fundamental components for OpenTelemetry Metrics support, and it add counters for token usage data of openai.resources.chat.completions in OpenAI V1.

My goal with this PR is to present my approach to addressing the issue at hand and to seek your feedback on the implementation. Your insights and suggestions will be extremely valuable for refining this solution. Moving forward, based on your feedback, I plan to enhance this implementation further and extend support to other instrumentors.

Looking for your reviews and constructive criticism to help improve this integration.

…i.resources.chat.completions

CLAassistant · 2024-01-27T08:19:11Z

All committers have signed the CLA.

nirga · 2024-01-27T10:31:36Z

Nice work @Humbertzhang! Let's connect it to some observability platform to see if it works?

nirga · 2024-01-28T21:00:00Z

Sharing some work that's being done now with the OpenTelemetry community that we should align with - traceloop/semantic-conventions#2

Humbertzhang · 2024-01-29T04:22:03Z

Hi @nirga!
I setup a demo environment using otel-collector and prometheus, and then setup a openai + traceloop demo for generate metrics.
You can find those files at: https://github.com/Humbertzhang/demo_otel_prometheus

And I think it reports metrics as expect!

Humbertzhang · 2024-01-29T04:23:44Z

Sharing some work that's being done now with the OpenTelemetry community that we should align with - traceloop/semantic-conventions#2

ok @nirga , I will look into this pr align with it !

Humbertzhang · 2024-01-31T13:26:01Z

Update to Align with traceloop/semantic-conventions#2

Hello @nirga, I have updated my code to align with the changes proposed of openai.chat_completions.tokens metric in traceloop/semantic-conventions#2.

You can see the results under normal conditions in the attached image. I have appropriately added attributes such as llm.response.model and llm.usage.token_type .

You can observe the results under exceptional conditions in below images.
During this run, the network was initially stable but became unreachable for OpenAI requests after a few minutes, resulting in an APIConnectionError.

However, I encountered challenges in retrieving the server.address. I attempted to acquire it in the same manner as the Span(Like https://github.com/traceloop/openllmetry/blob/main/packages/opentelemetry-instrumentation-openai/opentelemetry/instrumentation/openai/shared/__init__.py#L40), but this it did not get the base_url in practical scenarios(). Do you have any alternative suggestions or insights on this matter?

nirga · 2024-01-31T18:46:25Z

Looks good @Humbertzhang! Before I fully review it, can you:

rebase
fix lint issues
add tests (this actually helped me better review PRs in the past 😅)
add all suggested metrics

Reg. your question - can you give an example of when it didn't work for you?

Humbertzhang · 2024-02-10T08:39:12Z

Hi @nirga !
I have added all the suggested metrics of openai and their corresponding tests in this pr:

llm.openai.chat_completions.tokens
llm.openai.chat_completions.choices
llm.openai.chat_completions.duration
llm.openai.embeddings.tokens
llm.openai.embeddings.vector_size
llm.openai.embeddings.duration
llm.openai.image_generations.duration

and I have manually tested all metrics in prometheus.

Additionally, I've completed the rebase and addressed all linting issues.

Regarding server.address, I add a function called _get_openai_base_url, and I always get a "" response by calling it.
This function is intended to mirror the method used by spans to retrieve the OpenAI URL.
I'm uncertain if there's a more effective approach to obtain the OpenAI URL—any suggestions would be appreciated.

For the metric Metric: openai.embeddings.vector_size, I'm contemplating whether a counter is the most suitable instrument for recording it. Maybe Gauge is better for recording it? What do you think about it?

I look forward to your review and any feedback you may have!

Humbertzhang · 2024-02-18T07:30:14Z

Hi @nirga , I have updated tests for metrics using VCR, and I hope it meets the format.

gyliu513 · 2024-02-19T13:44:20Z

Thanks @Humbertzhang !

Hey @nirga , can we get this merged? I was planning to create a PR to enable watsonx as well, and it will depend on this PR, thanks!

nirga · 2024-02-19T15:02:08Z

@gyliu513 yeah probably today / tomorrow. I need to see why the tests are failing, and I want to move this to a common semantic conventions.

nirga · 2024-02-21T14:59:50Z

@Humbertzhang looks like there's regression in the streaming test of openai? 🤔

nirga · 2024-02-26T17:04:14Z

Nice work @Humbertzhang! It's a significant milestone for OpenLLMetry :)

paolorechia · 2024-02-27T07:19:15Z

@Humbertzhang thanks for the work here, could you resolve the conflicts so we can merge? :)

paolorechia · 2024-02-27T07:27:40Z

...opentelemetry-instrumentation-openai/opentelemetry/instrumentation/openai/shared/__init__.py

@@ -129,6 +131,13 @@ def _set_response_attributes(span, response):
 )


+def _get_openai_base_url():


❓ Is this function still always returning empty string?

I can confirm this behavior if no initialization is done.

In [2]: def _get_openai_base_url(): ...: base_url = openai.base_url if hasattr(openai, "base_url") else openai.api_base ...: if not base_url: ...: return "" ...: return base_url ...: In [3]: import openai In [4]: _get_openai_base_url() Out[4]: '' In [5]:

On the other hand, if you inspect the client instance, you should get the base url:

In [6]: client = openai.OpenAI() In [7]: client.base_url Out[7]: URL('https://api.openai.com/v1/')

Does this help you?

I've reviewed your code, sadly I don't see an easy way to extract this URL from an instance in the current instrumentation code.

My only idea was to instrument the client constructor so you can inject a listener which retrieves the URL for you. If you store it in a Singleton, you could retrieve it while calling this function:

def _handle_response(response, span, token_counter=None, choice_counter=None, duration_histogram=None, duration=None): if is_openai_v1(): response_dict = model_as_dict(response) else: response_dict = response # metrics record _set_chat_metrics(token_counter, choice_counter, duration_histogram, response_dict, duration) # span attributes _set_response_attributes(span, response_dict) if should_send_prompts(): _set_completions(span, response_dict.get("choices")) return response

Or maybe even directly inside the _set_chat_metrics.

But again, quite some effort for just one URL :)

@nirga what do you think? Should we track this URL as a separate issue?

@paolorechia not sure I follow - I think this function does return the right base URL, no?

Yes, @Humbertzhang mentioned in a comment, he couldn’t get this URL to work correctly, which why I looked a bit on that

edit: no, I think it’s not working if I understood it correctly

@Humbertzhang I think this is the logic we should use to get it: https://github.com/traceloop/openllmetry/blob/main/packages/opentelemetry-instrumentation-openai/opentelemetry/instrumentation/openai/shared/__init__.py
(depending on the SDK version, it will be in different attributes).

For v1, it should be instance._client.base_url, and for v0 it should be the code you wrote below

Humbertzhang · 2024-02-27T09:46:48Z

@Humbertzhang thanks for the work here, could you resolve the conflicts so we can merge? :)

Hi @paolorechia, I resolved conflicts in my local env, but when I run tests of traceloop-sdk and openai, I got

FAILED tests/test_privacy_no_prompts.py::test_simple_workflow - openai.APIConnectionError: Connection error.
FAILED tests/test_prompt_management.py::test_prompt_management - openai.APIConnectionError: Connection error.
FAILED tests/test_sdk_initialization.py::test_resource_attributes - openai.APIConnectionError: Connection error.
FAILED tests/test_workflows.py::test_simple_workflow - openai.APIConnectionError: Connection error.

and

>           raise CannotOverwriteExistingCassetteException(cassette=cassette, failed_request=vcr_request)
E           vcr.errors.CannotOverwriteExistingCassetteException: Can't overwrite existing cassette ('/Users/maxzhang/Desktop/githubpjs/openllmetry/packages/traceloop-sdk/tests/cassettes/test_workflows/test_simple_workflow.yaml') in your current record mode ('none').
E           No match for the request (<Request (POST) https://api.openai.com/v1/chat/completions>) was found.
E           Found 1 similar requests with 0 different matcher(s) :
E           
E           1 - (<Request (POST) https://api.openai.com/v1/chat/completions>).
E           Matchers succeeded : ['method', 'scheme', 'host', 'port', 'path', 'query']
E           Matchers failed :

errors and exceptions.

Have you or @nirga encountered similar exceptions when using VCR?

nirga · 2024-02-27T09:48:41Z

@Humbertzhang I'd try to re-generate the cassettes. Comment out the OpenAI mock key from conftest.py, specify one of your own and then run poetry run pytest --record-mode=once. If you don't have an OpenAI API key I can do that for you :)

Humbertzhang · 2024-02-27T09:58:27Z

Still got connection errors even i can curl request openai... Maybe u can help me with that @nirga ?
I have pushed my resolve conflict commits.

…metrics

Humbertzhang · 2024-02-27T10:22:56Z

@nirga not pushed successfully just now😅, now it is pushed

nirga · 2024-02-27T12:33:20Z

@Humbertzhang All tests pass now 🤩
Thanks so much for this, it's a significant project!
Can we just also fix and test the API base as well? (where @paolorechia and I commented above)

…ests for it.

Humbertzhang · 2024-02-27T13:35:54Z

Hi @nirga and @paolorechia, I have fixed the _get_openai_base_url func (@paolorechia's code and #522 really helped me, thx!), it now can get url.
I also added assert checks for it in openai's tests/metrics tests.

Humbertzhang added 2 commits January 27, 2024 15:57

add metrics basic support and add metrics collect in OpenAIV1's opena…

1d120c5

…i.resources.chat.completions

refine comments

ea736fa

align with 'Semantic Conventions for OpenAI Metrics'

3ae9024

Humbertzhang added 8 commits February 1, 2024 11:39

fix lint errors

e19ae5e

Merge branch 'main' into metrics

c8ee9c7

Add test for openai metrics

f8897ea

fix lint

3fabd7a

complete chat completions metrics and add tests for it

3ce0840

Merge branch 'main' into metrics

c1c4649

Add embedding and image_gen metrics and corresponding tests

b021421

fix

db0b3e6

Humbertzhang added 2 commits February 18, 2024 11:40

Merge branch 'main' into metrics

e045e50

using vcr for tests in metrics

713d721

No need for updating project.json

787b026

Merge branch 'main' into metrics

0c97f61

nirga and others added 3 commits February 21, 2024 21:08

Merge branch 'main' into metrics

4d091c1

Merge branch 'main' into metrics

0b3377c

fix stream test failed

3f806da

Humbertzhang and others added 3 commits February 26, 2024 19:37

Merge branch 'main' into metrics

7b9ee12

Merge branch 'main' into metrics

8b6a097

Merge branch 'main' into metrics

e41308a

nirga approved these changes Feb 26, 2024

View reviewed changes

fix merge conflict

0be525d

paolorechia reviewed Feb 27, 2024

View reviewed changes

Merge branch 'main' of https://github.com/traceloop/openllmetry into …

668c24f

…metrics

nirga added 9 commits February 27, 2024 11:47

Merge branch 'main' into metrics

9f807ae

chore(openai): regenerated cassettes

66aa8fc

Merge branch 'main' into metrics

a8cb641

chore: conftest

91a4883

chore(sdk): regenerated cassettes

e7062fc

chore: more cassettes

4e81ca7

chore: lock

bb9648c

chore(openai): missing test dep

47f623c

fix(sdk): reverted haystack comment out

01c99a4

Merge branch 'main' into metrics

20dd07d

nirga changed the title ~~Basic Support for OpenTelemetry Metrics and Token Usage Metrics in OpenAI V1~~ feat: basic Support for OpenTelemetry Metrics and Token Usage Metrics in OpenAI V1 Feb 27, 2024

fix: _get_openai_base_url now can really get openai base_url. Added t…

2fb9ef6

…ests for it.

paolorechia approved these changes Feb 27, 2024

View reviewed changes

nirga merged commit 3eba03e into traceloop:main Feb 27, 2024
7 checks passed

hanchchch mentioned this pull request Apr 8, 2024

feat(instrumentation-anthropic): Support for OpenTelemetry metrics for Anthropic #764

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: basic Support for OpenTelemetry Metrics and Token Usage Metrics in OpenAI V1 #369

feat: basic Support for OpenTelemetry Metrics and Token Usage Metrics in OpenAI V1 #369

Humbertzhang commented Jan 27, 2024 •

edited

CLAassistant commented Jan 27, 2024 •

edited

nirga commented Jan 27, 2024

nirga commented Jan 28, 2024

Humbertzhang commented Jan 29, 2024

Humbertzhang commented Jan 29, 2024

Humbertzhang commented Jan 31, 2024

nirga commented Jan 31, 2024 •

edited

Humbertzhang commented Feb 10, 2024

Humbertzhang commented Feb 18, 2024 •

edited

gyliu513 commented Feb 19, 2024

nirga commented Feb 19, 2024

nirga commented Feb 21, 2024 •

edited

nirga commented Feb 26, 2024

paolorechia commented Feb 27, 2024

paolorechia Feb 27, 2024 •

edited

paolorechia Feb 27, 2024

paolorechia Feb 27, 2024

nirga Feb 27, 2024

paolorechia Feb 27, 2024 •

edited

nirga Feb 27, 2024

nirga Feb 27, 2024

Humbertzhang commented Feb 27, 2024

nirga commented Feb 27, 2024 •

edited

Humbertzhang commented Feb 27, 2024

Humbertzhang commented Feb 27, 2024

nirga commented Feb 27, 2024

Humbertzhang commented Feb 27, 2024

		@@ -129,6 +131,13 @@ def _set_response_attributes(span, response):
		)


		def _get_openai_base_url():

feat: basic Support for OpenTelemetry Metrics and Token Usage Metrics in OpenAI V1 #369

feat: basic Support for OpenTelemetry Metrics and Token Usage Metrics in OpenAI V1 #369

Conversation

Humbertzhang commented Jan 27, 2024 • edited

CLAassistant commented Jan 27, 2024 • edited

nirga commented Jan 27, 2024

nirga commented Jan 28, 2024

Humbertzhang commented Jan 29, 2024

Humbertzhang commented Jan 29, 2024

Humbertzhang commented Jan 31, 2024

Update to Align with traceloop/semantic-conventions#2

nirga commented Jan 31, 2024 • edited

Humbertzhang commented Feb 10, 2024

Humbertzhang commented Feb 18, 2024 • edited

gyliu513 commented Feb 19, 2024

nirga commented Feb 19, 2024

nirga commented Feb 21, 2024 • edited

nirga commented Feb 26, 2024

paolorechia commented Feb 27, 2024

paolorechia Feb 27, 2024 • edited

Choose a reason for hiding this comment

paolorechia Feb 27, 2024

Choose a reason for hiding this comment

paolorechia Feb 27, 2024

Choose a reason for hiding this comment

nirga Feb 27, 2024

Choose a reason for hiding this comment

paolorechia Feb 27, 2024 • edited

Choose a reason for hiding this comment

nirga Feb 27, 2024

Choose a reason for hiding this comment

nirga Feb 27, 2024

Choose a reason for hiding this comment

Humbertzhang commented Feb 27, 2024

nirga commented Feb 27, 2024 • edited

Humbertzhang commented Feb 27, 2024

Humbertzhang commented Feb 27, 2024

nirga commented Feb 27, 2024

Humbertzhang commented Feb 27, 2024

Humbertzhang commented Jan 27, 2024 •

edited

CLAassistant commented Jan 27, 2024 •

edited

nirga commented Jan 31, 2024 •

edited

Humbertzhang commented Feb 18, 2024 •

edited

nirga commented Feb 21, 2024 •

edited

paolorechia Feb 27, 2024 •

edited

paolorechia Feb 27, 2024 •

edited

nirga commented Feb 27, 2024 •

edited