norm
Normalizes text (case, accents) without tokenization. Returns a single token for the entire input.
Options
| Option | Type | Default | Description |
|---|---|---|---|
LOCALE | string | — | ICU locale |
CASE | string | 'none' | Case conversion: 'none', 'lower', 'upper' |
ACCENT | boolean | true | Preserve accent marks |
Examples
CREATE TEXT SEARCH DICTIONARY norm_dict (
TEMPLATE = 'norm',
LOCALE = 'en_US.UTF-8',
CASE = 'lower',
ACCENT = false
);
Uppercase normalization
CREATE TEXT SEARCH DICTIONARY norm_upper (
TEMPLATE = 'norm',
LOCALE = 'en_US.UTF-8',
CASE = 'upper',
ACCENT = false
);