Copyright | (c) 2013-2021 Brendan Hay |
---|---|
License | Mozilla Public License, v. 2.0. |
Maintainer | Brendan Hay <brendan.g.hay+amazonka@gmail.com> |
Stability | auto-generated |
Portability | non-portable (GHC extensions) |
Safe Haskell | None |
Synopsis
- data AnalysisOptions = AnalysisOptions' {}
- newAnalysisOptions :: AnalysisOptions
- analysisOptions_algorithmicStemming :: Lens' AnalysisOptions (Maybe AlgorithmicStemming)
- analysisOptions_stopwords :: Lens' AnalysisOptions (Maybe Text)
- analysisOptions_japaneseTokenizationDictionary :: Lens' AnalysisOptions (Maybe Text)
- analysisOptions_synonyms :: Lens' AnalysisOptions (Maybe Text)
- analysisOptions_stemmingDictionary :: Lens' AnalysisOptions (Maybe Text)
Documentation
data AnalysisOptions Source #
Synonyms, stopwords, and stemming options for an analysis scheme. Includes tokenization dictionary for Japanese.
See: newAnalysisOptions
smart constructor.
AnalysisOptions' | |
|
Instances
newAnalysisOptions :: AnalysisOptions Source #
Create a value of AnalysisOptions
with all optional fields omitted.
Use generic-lens or optics to modify other optional fields.
The following record fields are available, with the corresponding lenses provided for backwards compatibility:
$sel:algorithmicStemming:AnalysisOptions'
, analysisOptions_algorithmicStemming
- The level of algorithmic stemming to perform: none
, minimal
,
light
, or full
. The available levels vary depending on the language.
For more information, see
Language Specific Text Processing Settings
in the Amazon CloudSearch Developer Guide
$sel:stopwords:AnalysisOptions'
, analysisOptions_stopwords
- A JSON array of terms to ignore during indexing and searching. For
example, ["a", "an", "the", "of"]
. The stopwords dictionary
must explicitly list each word you want to ignore. Wildcards and regular
expressions are not supported.
$sel:japaneseTokenizationDictionary:AnalysisOptions'
, analysisOptions_japaneseTokenizationDictionary
- A JSON array that contains a collection of terms, tokens, readings and
part of speech for Japanese Tokenizaiton. The Japanese tokenization
dictionary enables you to override the default tokenization for selected
terms. This is only valid for Japanese language fields.
$sel:synonyms:AnalysisOptions'
, analysisOptions_synonyms
- A JSON object that defines synonym groups and aliases. A synonym group
is an array of arrays, where each sub-array is a group of terms where
each term in the group is considered a synonym of every other term in
the group. The aliases value is an object that contains a collection of
string:value pairs where the string specifies a term and the array of
values specifies each of the aliases for that term. An alias is
considered a synonym of the specified term, but the term is not
considered a synonym of the alias. For more information about specifying
synonyms, see
Synonyms
in the Amazon CloudSearch Developer Guide.
$sel:stemmingDictionary:AnalysisOptions'
, analysisOptions_stemmingDictionary
- A JSON object that contains a collection of string:value pairs that each
map a term to its stem. For example,
{"term1": "stem1", "term2": "stem2", "term3": "stem3"}
.
The stemming dictionary is applied in addition to any algorithmic
stemming. This enables you to override the results of the algorithmic
stemming to correct specific cases of overstemming or understemming. The
maximum size of a stemming dictionary is 500 KB.
analysisOptions_algorithmicStemming :: Lens' AnalysisOptions (Maybe AlgorithmicStemming) Source #
The level of algorithmic stemming to perform: none
, minimal
,
light
, or full
. The available levels vary depending on the language.
For more information, see
Language Specific Text Processing Settings
in the Amazon CloudSearch Developer Guide
analysisOptions_stopwords :: Lens' AnalysisOptions (Maybe Text) Source #
A JSON array of terms to ignore during indexing and searching. For
example, ["a", "an", "the", "of"]
. The stopwords dictionary
must explicitly list each word you want to ignore. Wildcards and regular
expressions are not supported.
analysisOptions_japaneseTokenizationDictionary :: Lens' AnalysisOptions (Maybe Text) Source #
A JSON array that contains a collection of terms, tokens, readings and part of speech for Japanese Tokenizaiton. The Japanese tokenization dictionary enables you to override the default tokenization for selected terms. This is only valid for Japanese language fields.
analysisOptions_synonyms :: Lens' AnalysisOptions (Maybe Text) Source #
A JSON object that defines synonym groups and aliases. A synonym group is an array of arrays, where each sub-array is a group of terms where each term in the group is considered a synonym of every other term in the group. The aliases value is an object that contains a collection of string:value pairs where the string specifies a term and the array of values specifies each of the aliases for that term. An alias is considered a synonym of the specified term, but the term is not considered a synonym of the alias. For more information about specifying synonyms, see Synonyms in the Amazon CloudSearch Developer Guide.
analysisOptions_stemmingDictionary :: Lens' AnalysisOptions (Maybe Text) Source #
A JSON object that contains a collection of string:value pairs that each
map a term to its stem. For example,
{"term1": "stem1", "term2": "stem2", "term3": "stem3"}
.
The stemming dictionary is applied in addition to any algorithmic
stemming. This enables you to override the results of the algorithmic
stemming to correct specific cases of overstemming or understemming. The
maximum size of a stemming dictionary is 500 KB.