[Feature Request] Allow users to connect pr_agent to an existing SageMaker inference endpoint #609

mattiaciollaro · 2024-01-18T18:48:36Z

Context: assume a user has e.g. a pre-configured LLM inference endpoint in SageMaker (for example, a self-hosted Llama model as described here). It would be nice to be able to allow the user to configure pr-agent to leverage that endpoint e.g. by means of a dedicated AI handler.

Discord chat: https://discord.com/channels/1057273017547378788/1057273018084237344/1197261978591309884

cc: @krrishdholakia

mrT23 · 2024-01-23T19:50:42Z

@krrishdholakia do you think this request is feasible ?

krrishdholakia · 2024-01-23T21:42:03Z

Hey @mattiaciollaro @mrT23 we already support sagemaker - https://docs.litellm.ai/docs/providers/aws_sagemaker

What am i missing?

mrT23 · 2024-01-28T07:18:07Z

@mattiaciollaro is this PR still relevant?

mattiaciollaro · 2024-01-28T18:59:51Z

Sorry for the delay guys.

Hey @mattiaciollaro @mrT23 we already support sagemaker - https://docs.litellm.ai/docs/providers/aws_sagemaker

https://docs.litellm.ai/docs/providers/aws_sagemaker seems to support SageMaker JumpStart models specifically.

I am thinking of a different situation where a model is already deployed via SageMaker and a reference to the inference endpoint name is available (as in here). In that case, how can we instruct pr-agent to leverage the LLM behind that pre-existing endpoint? I am not sure I see a way of doing this via https://docs.litellm.ai/docs/providers/aws_sagemaker

In the context of a POC with my team, the way we accomplished this was to hack the pr-agent's default AI handler (which is the LiteLLM AI handler) and use the sagemaker SDK (specifically, the HF predictor to make requests to the pre-existing SageMaker endpoint.

I imagine a cleaner solution would be to implement a dedicated AI handler for this usecase?

@mattiaciollaro is this PR still relevant?

I don't have a PR out for this, but yes: I think the feature request is still relevant :) My apologies again for the delay!

mattiaciollaro changed the title ~~[feature request] Allow users to connect pr_agent to an existing SageMaker inference endpoint~~ [Feature Request] Allow users to connect pr_agent to an existing SageMaker inference endpoint Jan 18, 2024

mrT23 added the clarification needed label Jan 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Allow users to connect pr_agent to an existing SageMaker inference endpoint #609

[Feature Request] Allow users to connect pr_agent to an existing SageMaker inference endpoint #609

mattiaciollaro commented Jan 18, 2024

mrT23 commented Jan 23, 2024

krrishdholakia commented Jan 23, 2024

mrT23 commented Jan 28, 2024

mattiaciollaro commented Jan 28, 2024

[Feature Request] Allow users to connect pr_agent to an existing SageMaker inference endpoint #609

[Feature Request] Allow users to connect pr_agent to an existing SageMaker inference endpoint #609

Comments

mattiaciollaro commented Jan 18, 2024

mrT23 commented Jan 23, 2024

krrishdholakia commented Jan 23, 2024

mrT23 commented Jan 28, 2024

mattiaciollaro commented Jan 28, 2024