Class: Aws::SageMaker::Types::ScalingPolicyMetric

Inherits:
Struct
  • Object
show all
Defined in:
gems/aws-sdk-sagemaker/lib/aws-sdk-sagemaker/types.rb

Overview

The metric for a scaling policy.

Constant Summary collapse

SENSITIVE =
[]

Instance Attribute Summary collapse

Instance Attribute Details

#invocations_per_instanceInteger

The number of invocations sent to a model, normalized by InstanceCount in each ProductionVariant. 1/numberOfInstances is sent as the value on each request, where numberOfInstances is the number of active instances for the ProductionVariant behind the endpoint at the time of the request.

Returns:

  • (Integer)


38191
38192
38193
38194
38195
38196
# File 'gems/aws-sdk-sagemaker/lib/aws-sdk-sagemaker/types.rb', line 38191

class ScalingPolicyMetric < Struct.new(
  :invocations_per_instance,
  :model_latency)
  SENSITIVE = []
  include Aws::Structure
end

#model_latencyInteger

The interval of time taken by a model to respond as viewed from SageMaker. This interval includes the local communication times taken to send the request and to fetch the response from the container of a model and the time taken to complete the inference in the container.

Returns:

  • (Integer)


38191
38192
38193
38194
38195
38196
# File 'gems/aws-sdk-sagemaker/lib/aws-sdk-sagemaker/types.rb', line 38191

class ScalingPolicyMetric < Struct.new(
  :invocations_per_instance,
  :model_latency)
  SENSITIVE = []
  include Aws::Structure
end