Class: Aws::BedrockAgentRuntime::Types::InferenceConfiguration

Inherits:
Struct
  • Object
show all
Defined in:
gems/aws-sdk-bedrockagentruntime/lib/aws-sdk-bedrockagentruntime/types.rb

Overview

Specifications about the inference parameters that were provided alongside the prompt. These are specified in the PromptOverrideConfiguration object that was set when the agent was created or updated. For more information, see Inference parameters for foundation models.

Constant Summary collapse

SENSITIVE =
[]

Instance Attribute Summary collapse

Instance Attribute Details

#maximum_lengthInteger

The maximum number of tokens allowed in the generated response.

Returns:

  • (Integer)


758
759
760
761
762
763
764
765
766
# File 'gems/aws-sdk-bedrockagentruntime/lib/aws-sdk-bedrockagentruntime/types.rb', line 758

class InferenceConfiguration < Struct.new(
  :maximum_length,
  :stop_sequences,
  :temperature,
  :top_k,
  :top_p)
  SENSITIVE = []
  include Aws::Structure
end

#stop_sequencesArray<String>

A list of stop sequences. A stop sequence is a sequence of characters that causes the model to stop generating the response.

Returns:

  • (Array<String>)


758
759
760
761
762
763
764
765
766
# File 'gems/aws-sdk-bedrockagentruntime/lib/aws-sdk-bedrockagentruntime/types.rb', line 758

class InferenceConfiguration < Struct.new(
  :maximum_length,
  :stop_sequences,
  :temperature,
  :top_k,
  :top_p)
  SENSITIVE = []
  include Aws::Structure
end

#temperatureFloat

The likelihood of the model selecting higher-probability options while generating a response. A lower value makes the model more likely to choose higher-probability options, while a higher value makes the model more likely to choose lower-probability options.

Returns:

  • (Float)


758
759
760
761
762
763
764
765
766
# File 'gems/aws-sdk-bedrockagentruntime/lib/aws-sdk-bedrockagentruntime/types.rb', line 758

class InferenceConfiguration < Struct.new(
  :maximum_length,
  :stop_sequences,
  :temperature,
  :top_k,
  :top_p)
  SENSITIVE = []
  include Aws::Structure
end

#top_kInteger

While generating a response, the model determines the probability of the following token at each point of generation. The value that you set for topK is the number of most-likely candidates from which the model chooses the next token in the sequence. For example, if you set topK to 50, the model selects the next token from among the top 50 most likely choices.

Returns:

  • (Integer)


758
759
760
761
762
763
764
765
766
# File 'gems/aws-sdk-bedrockagentruntime/lib/aws-sdk-bedrockagentruntime/types.rb', line 758

class InferenceConfiguration < Struct.new(
  :maximum_length,
  :stop_sequences,
  :temperature,
  :top_k,
  :top_p)
  SENSITIVE = []
  include Aws::Structure
end

#top_pFloat

While generating a response, the model determines the probability of the following token at each point of generation. The value that you set for Top P determines the number of most-likely candidates from which the model chooses the next token in the sequence. For example, if you set topP to 80, the model only selects the next token from the top 80% of the probability distribution of next tokens.

Returns:

  • (Float)


758
759
760
761
762
763
764
765
766
# File 'gems/aws-sdk-bedrockagentruntime/lib/aws-sdk-bedrockagentruntime/types.rb', line 758

class InferenceConfiguration < Struct.new(
  :maximum_length,
  :stop_sequences,
  :temperature,
  :top_k,
  :top_p)
  SENSITIVE = []
  include Aws::Structure
end