You are viewing documentation for version 2 of the AWS SDK for Ruby. Version 3 documentation can be found here.

Class: Aws::CloudSearch::Types::AnalysisScheme

Inherits:
Struct
  • Object
show all
Defined in:
(unknown)

Overview

Note:

When passing AnalysisScheme as input to an Aws::Client method, you can use a vanilla Hash:

{
  analysis_scheme_name: "StandardName", # required
  analysis_scheme_language: "ar", # required, accepts ar, bg, ca, cs, da, de, el, en, es, eu, fa, fi, fr, ga, gl, he, hi, hu, hy, id, it, ja, ko, lv, mul, nl, no, pt, ro, ru, sv, th, tr, zh-Hans, zh-Hant
  analysis_options: {
    synonyms: "String",
    stopwords: "String",
    stemming_dictionary: "String",
    japanese_tokenization_dictionary: "String",
    algorithmic_stemming: "none", # accepts none, minimal, light, full
  },
}

Configuration information for an analysis scheme. Each analysis scheme has a unique name and specifies the language of the text to be processed. The following options can be configured for an analysis scheme: Synonyms, Stopwords, StemmingDictionary, JapaneseTokenizationDictionary and AlgorithmicStemming.

Returned by:

Instance Attribute Summary collapse

Instance Attribute Details

#analysis_optionsTypes::AnalysisOptions

Synonyms, stopwords, and stemming options for an analysis scheme. Includes tokenization dictionary for Japanese.

Returns:

#analysis_scheme_languageString

An IETF RFC 4646 language code or mul for multiple languages.

Possible values:

  • ar
  • bg
  • ca
  • cs
  • da
  • de
  • el
  • en
  • es
  • eu
  • fa
  • fi
  • fr
  • ga
  • gl
  • he
  • hi
  • hu
  • hy
  • id
  • it
  • ja
  • ko
  • lv
  • mul
  • nl
  • no
  • pt
  • ro
  • ru
  • sv
  • th
  • tr
  • zh-Hans
  • zh-Hant

Returns:

  • (String)

    An IETF RFC 4646 language code or mul for multiple languages.

    .

#analysis_scheme_nameString

Names must begin with a letter and can contain the following characters: a-z (lowercase), 0-9, and _ (underscore).

Returns:

  • (String)

    Names must begin with a letter and can contain the following characters: a-z (lowercase), 0-9, and _ (underscore).

    .