ngram
Generates character n-grams for fuzzy and substring matching.
Options
| Option | Type | Default | Description |
|---|---|---|---|
MINGRAM | integer | 2 | Minimum n-gram length |
MAXGRAM | integer | 3 | Maximum n-gram length |
PRESERVEORIGINAL | boolean | false | Emit original token alongside n-grams |
INPUTTYPE | string | 'utf8' | Input encoding: 'binary', 'utf8' |
STARTMARKER | string | — | Prefix marker at n-gram boundary |
ENDMARKER | string | — | Suffix marker at n-gram boundary |
Examples
CREATE TEXT SEARCH DICTIONARY ngram_dict (
TEMPLATE = 'ngram',
MINGRAM = 2,
MAXGRAM = 3
);
Unigrams and bigrams
CREATE TEXT SEARCH DICTIONARY unigram_dict (
TEMPLATE = 'ngram',
MINGRAM = 1,
MAXGRAM = 2
);