Default Character Filters
- Capella Operational
- reference
Character filters remove unwanted characters from your search input.
You can use a character filter when you create a custom analyzer. Choose a default character filter or create your own.
The following default character filters are available:
Character Filter | Description |
---|---|
asciifolding |
The analyzer converts any characters that aren’t in the Basic Latin Unicode block to their ASCII equivalent. This means the filter converts any alphabetic, numeric, or symbol characters that aren’t in the first 127 ASCII characters. For example, the character filter converts |
html |
The analyzer removes all HTML tags from search input. For example, the character filter removes the <p> tags from indexed content, but keeps the text inside the <p> tag. |
zero_width_spaces |
The analyzer replaces zero-width non-joiner spaces with regular space characters. Zero-width non-joiner spaces are unicode characters that interrupt ligatures(joins between characters) in text formatting. |