Skip to content

Conversation

@rgadagot
Copy link
Contributor

Added support for multiple instance types in InferenceEndpointConfig by introducing a new optional instanceTypes field that allows specifying instance preferences in priority order. Made the original instanceType field optional to support this mutually exclusive deployment option.

@rgadagot rgadagot requested a review from a team as a code owner January 26, 2026 18:23
@rgadagot rgadagot deployed to manual-approval January 26, 2026 20:09 — with GitHub Actions Active
@Aditi2424 Aditi2424 merged commit f4fc838 into aws:main Jan 27, 2026
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants