Skip to main content

norm

Normalizes text (case, accents) without tokenization. Returns a single token for the entire input.

Options

OptionTypeDefaultDescription
LOCALEstringICU locale
CASEstring'none'Case conversion: 'none', 'lower', 'upper'
ACCENTbooleantruePreserve accent marks

Examples

CREATE TEXT SEARCH DICTIONARY norm_dict (
TEMPLATE = 'norm',
LOCALE = 'en_US.UTF-8',
CASE = 'lower',
ACCENT = false
);

Uppercase normalization

CREATE TEXT SEARCH DICTIONARY norm_upper (
TEMPLATE = 'norm',
LOCALE = 'en_US.UTF-8',
CASE = 'upper',
ACCENT = false
);

See also