Search Index Features
- Capella Operational
- concept
Search indexes in Couchbase Capella have multiple features that you can configure to improve performance and fine tune your search results.
Some features are only available in Advanced Mode.
You can add the following components and configure the following options for a Search index:
Option | Quick Mode | Advanced Mode | Description |
---|---|---|---|
✓ |
✓ |
Use a type mapping to include or exclude specific documents in a collection from an index. Type mappings can also set a field’s data type and other settings. You can create two types of type mappings with the Search Service:
Type mappings start at the collection level. Create additional mappings for child fields or JSON objects under a collection’s type mapping to restrict the documents added to your index. This can improve Search index performance over indexing entire collections. If your cluster is running Couchbase Server version 7.6.2 and later, you can also choose to include document metadata inside your Search index by creating an XATTRs mapping. For more information about how to configure settings for mappings and type mappings in Quick Mode, see Quick Index Mapping and Field Options. For more information about how to configure a type mapping in Advanced Mode, see Create a Type Mapping in Advanced Mode. |
|
Advanced Settings |
✓ |
Set advanced settings with Advanced Mode to change your index’s default analyzer, replication, and more. For more information about how to change general settings, see Set Search Index General Settings. |
|
✓ |
In Advanced Mode, configure a document filter to add or remove documents in your Search index that meet certain conditions:
For more information about how to configure a document filter, see Set a Search Index Type Identifier. |
||
✓ |
In Advanced Mode, use analyzers to improve and customize the search results in your index. Analyzers transform input text into tokens, which give you greater control over your index’s text matching. You can use one of Couchbase’s built-in analyzers or create your own. For more information about how to create a custom analyzer, see Create a Custom Analyzer. Analyzers have different components that control how text is transformed for search. When you create a custom analyzer, you can choose these components. For more information, see Custom Filters. |
||
✓ |
In Advanced Mode, use custom filters to add more customization to a custom analyzer. For more information about these filters, see the Custom Filters section. |
||
✓ |
In Advanced Mode, if the documents in your index contain date and time data in a format other than RFC-3339 (ISO-8601), then you need to create a date/time parser. A custom date/time parser tells the Search index how to interpret date data from your documents. For more information about how to add a custom date/time parser, see Create a Custom Date/Time Parser. |
Custom Filters
Custom filters are components of a Search index analyzer.
Create and add these components to a custom analyzer to improve search results and performance for an index in Advanced Mode.
You can create the following custom filters:
Character Filters
Character filters remove unwanted characters from the input for a search. For example, the default html character filter removes HTML tags from your search content.
You can use a default character filter in an analyzer or create your own.
For more information about the available default character filters, see Default Character Filters.
For more information about how to create your own custom character filter, see Create a Custom Character Filter.
Tokenizers
Tokenizers separate input strings into individual tokens. These tokens are combined into token streams. The Search Service takes token streams from search queries to determine matches for token streams in search results.
You can use a default tokenizer in an analyzer or create your own.
For more information about the available default tokenizers, see Default Tokenizers.
For more information about how to create your own tokenizer, see Create a Custom Tokenizer.
Token Filters
Token filters take the token stream from a tokenizer and modify the tokens.
A token filter can create stems from tokens to increase the matches for a search term.
For example, if a token filter creates the stem play
, a search can return matches for player
, playing
, and playable
.
The Search Service has default tokenizers available. For a list of all available tokenizers, see Default Token Filters.
You can also create your own token filters. Custom token filters can use Wordlists to modify their tokens. For more information about how to create your own token filter, see Create a Custom Token Filter.
Wordlists
Wordlists define a list of words that you can use with a token filter to create tokens.
You can use a wordlist to find words and create tokens, or remove words from a tokenizer’s token stream.
When you create a custom token filter, the Search Service has a set of default wordlists. For more information about the available default wordlists, see Default Wordlists.
For more information about how to create your own wordlist, see Create a Custom Wordlist.