Class: Aws::SageMaker::Types::ScalingPolicyMetric
- Inherits:
-
Struct
- Object
- Struct
- Aws::SageMaker::Types::ScalingPolicyMetric
- Defined in:
- gems/aws-sdk-sagemaker/lib/aws-sdk-sagemaker/types.rb
Overview
The metric for a scaling policy.
Constant Summary collapse
- SENSITIVE =
[]
Instance Attribute Summary collapse
-
#invocations_per_instance ⇒ Integer
The number of invocations sent to a model, normalized by
InstanceCount
in each ProductionVariant. -
#model_latency ⇒ Integer
The interval of time taken by a model to respond as viewed from SageMaker.
Instance Attribute Details
#invocations_per_instance ⇒ Integer
The number of invocations sent to a model, normalized by
InstanceCount
in each ProductionVariant. 1/numberOfInstances
is
sent as the value on each request, where numberOfInstances
is the
number of active instances for the ProductionVariant behind the
endpoint at the time of the request.
38191 38192 38193 38194 38195 38196 |
# File 'gems/aws-sdk-sagemaker/lib/aws-sdk-sagemaker/types.rb', line 38191 class ScalingPolicyMetric < Struct.new( :invocations_per_instance, :model_latency) SENSITIVE = [] include Aws::Structure end |
#model_latency ⇒ Integer
The interval of time taken by a model to respond as viewed from SageMaker. This interval includes the local communication times taken to send the request and to fetch the response from the container of a model and the time taken to complete the inference in the container.
38191 38192 38193 38194 38195 38196 |
# File 'gems/aws-sdk-sagemaker/lib/aws-sdk-sagemaker/types.rb', line 38191 class ScalingPolicyMetric < Struct.new( :invocations_per_instance, :model_latency) SENSITIVE = [] include Aws::Structure end |