Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

(sagemaker-model-deployment): Support AWS Inferentia instance types #290

Open
1 of 2 tasks
kukushking opened this issue Feb 28, 2024 · 10 comments
Open
1 of 2 tasks
Labels
backlog enhancement New feature or request

Comments

@kukushking
Copy link

Describe the feature

I'd like to be able to use AWS Inferentia for my endpoint inference containers

Use Case

Proposed Solution

No response

Other Information

No response

Acknowledgements

  • I may be able to implement this feature request
  • This feature might incur a breaking change
@kukushking kukushking added the needs-triage This issue or PR still needs to be triaged. label Feb 28, 2024
@krokoko
Copy link
Collaborator

krokoko commented Feb 28, 2024

@kukushking
Copy link
Author

Thanks @krokoko. Is this supported for JumpStart Foundation Models? I don't see Inf instance types here

@kukushking
Copy link
Author

Ah sorry just found it. Nvm, closing the issue.

@kukushking
Copy link
Author

Error: The instance type ml.inf1.2xlarge is not supported. Default instance type: ml.g5.2xlarge. Supported instance types: ml.g5.2xlarge, ml.g5.4xlarge, ml.g5.8xlarge, ml.g5.16xlarge.

I get this error when deploying Mistral 7B with Inf. Am I doing something wrong?

@krokoko
Copy link
Collaborator

krokoko commented Feb 28, 2024

Could you please share the code snippet you are using ?

@kukushking
Copy link
Author

Sure, here is the link.

@krokoko
Copy link
Collaborator

krokoko commented Mar 11, 2024

Thanks @kukushking , will add it to the backlog

@krokoko krokoko added bug Something isn't working enhancement New feature or request and removed needs-triage This issue or PR still needs to be triaged. bug Something isn't working labels Mar 11, 2024
Copy link
Contributor

This issue is now marked as stale because it hasn't seen activity for a while. Add a comment or it will be closed soon. If you wish to exclude this issue from being marked as stale, add the "backlog" label.

@github-actions github-actions bot added the stale label May 11, 2024
Copy link
Contributor

Closing this issue as it hasn't seen activity for a while. Please add a comment @mentioning a maintainer to reopen. If you wish to exclude this issue from being marked as stale, add the "backlog" label.

@krokoko krokoko reopened this May 20, 2024
Copy link
Contributor

Closing this issue as it hasn't seen activity for a while. Please add a comment @mentioning a maintainer to reopen. If you wish to exclude this issue from being marked as stale, add the "backlog" label.

@krokoko krokoko added backlog and removed stale labels May 28, 2024
@krokoko krokoko reopened this May 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backlog enhancement New feature or request
Projects
Development

No branches or pull requests

2 participants