Copyright Amazon.com, Inc. or its affiliates. All Rights Reserved.
SPDX-License-Identifier: CC-BY-SA-4.0
Describes weight and capacities for a production variant associated with an endpoint. If you sent a request to the UpdateEndpointWeightsAndCapacities
API and the endpoint status is Updating
, you get different desired and current values.
CurrentInstanceCount The number of instances associated with the variant.
Type: Integer
Valid Range: Minimum value of 1.
Required: No
CurrentWeight The weight associated with the variant.
Type: Float
Valid Range: Minimum value of 0.
Required: No
DeployedImages An array of DeployedImage
objects that specify the Amazon EC2 Container Registry paths of the inference images deployed on instances of this ProductionVariant
.
Type: Array of DeployedImage objects
Required: No
DesiredInstanceCount The number of instances requested in the UpdateEndpointWeightsAndCapacities
request.
Type: Integer
Valid Range: Minimum value of 1.
Required: No
DesiredWeight The requested weight, as specified in the UpdateEndpointWeightsAndCapacities
request.
Type: Float
Valid Range: Minimum value of 0.
Required: No
VariantName The name of the variant.
Type: String
Length Constraints: Maximum length of 63.
Pattern: ^[a-zA-Z0-9](-*[a-zA-Z0-9])*
Required: Yes
For more information about using this API in one of the language-specific AWS SDKs, see the following: + AWS SDK for C++ + AWS SDK for Go + AWS SDK for Go - Pilot + AWS SDK for Java + AWS SDK for Ruby V2