Full Text Search (FTS) Using the Go SDK with Couchbase Server
- how-to
You can use the Full Text Search service (FTS) to create queryable full-text indexes in Couchbase Server.
Full Text Search or FTS allows you to create, manage, and query full text indexes on JSON documents stored in Couchbase buckets. It uses natural language processing for querying documents, provides relevance scoring on the results of your queries, and has fast indexes for querying a wide range of possible text searches. Some of the supported query types include simple queries like Match and Term queries; range queries like Date Range and Numeric Range; and compound queries for conjunctions, disjunctions, and/or boolean queries. The Go SDK exposes an API for performing FTS queries which abstracts some of the complexity of using the underlying REST API.
When using a Couchbase version < 6.5 you must create a valid Bucket connection using cluster.Bucket(name) before you can use Search.
|
Getting Started
After familiarizing yourself with how to create and query a search index in the UI you can query it from the SDK.
Search queries are executed at Cluster level (not bucket or collection).
As of Couchbase Server 6.5+ they do also not require a bucket to be opened first.
In older versions of Couchbase Server, even though executed at Cluster level, a bucket must be opened before performing queries.
Search queries, facets, and sorting have a slightly different import from other components of gocb
, you can import them using import "github.com/couchbase/gocb/v2/search"
.
Here is a simple MatchQuery that looks for the text “swanky” using a defined index:
cluster, err := gocb.Connect("localhost", opts)
if err != nil {
panic(err)
}
// For Server versions 6.5 or later you do not need to open a bucket here
bucket := cluster.Bucket("travel-sample")
// We wait until the bucket is definitely connected and setup.
// For Server versions 6.5 or later if we hadn't opened a bucket then we could use cluster.WaitUntilReady here.
err = bucket.WaitUntilReady(5*time.Second, nil)
if err != nil {
panic(err)
}
matchResult, err := cluster.SearchQuery(
"travel-sample-index",
search.NewMatchQuery("swanky"),
&gocb.SearchOptions{
Limit: 10,
Fields: []string{"description"},
},
)
if err != nil {
panic(err)
}
We have also included the Fields
option which will get the content of the specified (indexed) field as a part of the response.
All simple query types are created in the same manner, although some have additional properties, which can be seen in common query type descriptions. Couchbase FTS’s range of query types enable powerful searching using multiple options, to ensure results are just within the range wanted. Here is a date range query that looks for dates between 1st January 2019 and 1st February, the second parameter is whether the date should be considered inclusive:
dateRangeResult, err := cluster.SearchQuery(
"travel-sample-index",
search.NewDateRangeQuery().Start("2019-01-01", true).End("2019-02-01", false),
&gocb.SearchOptions{
Limit: 10,
},
)
if err != nil {
panic(err)
}
Queries can also be combined together. A conjunction query contains multiple child queries; its result documents must satisfy all of the child queries:
conjunctionResult, err := cluster.SearchQuery(
"travel-sample-index",
search.NewConjunctionQuery(
search.NewMatchQuery("swanky"),
search.NewDateRangeQuery().Start("2019-01-01", true).End("2019-02-01", false),
),
&gocb.SearchOptions{
Limit: 10,
},
)
if err != nil {
panic(err)
}
Working with Results
The result of a search query has three components: rows, facets, and metdata. Rows are the documents that match the query. Facets allow the aggregation of information collected on a particular result set. Metdata holds additional information not directly related to your query, such as success total hits and how long the query took to execute in the cluster.
Here we are iterating over the rows that were returned in the results.
Note that Fields
is a special case, where it’s a function.
Fields
will include any fields that were requested as part of the SearchQuery (Fields
option within the options block).
for matchResult.Next() {
row := matchResult.Row()
docID := row.ID
score := row.Score
var fields interface{}
err := row.Fields(&fields)
if err != nil {
panic(err)
}
fmt.Printf("Document ID: %s, search score: %f, fields included in result: %v\n", docID, score, fields)
}
// always check for errors after iterating
err = matchResult.Err()
if err != nil {
panic(err)
}
Take care to ensure you call Err
after accessing rows and check for any errors returned.
Facets can only be accessed once all the rows have been iterated.
facetsResult, err := cluster.SearchQuery(
"travel-sample-index",
search.NewMatchAllQuery(),
&gocb.SearchOptions{
Limit: 10,
Facets: map[string]search.Facet{
"type": search.NewTermFacet("type", 5),
},
},
)
if err != nil {
panic(err)
}
for facetsResult.Next() {
}
facets, err := facetsResult.Facets()
if err != nil {
panic(err)
}
for _, facet := range facets {
field := facet.Field
total := facet.Total
fmt.Printf("Facet field: %s, total: %d\n", field, total)
}
Consistency
Like the Couchbase Query Service,
FTS allows RequestPlus
queries — Read-Your-Own_Writes (RYOW) consistency, ensuring results contain information from updated indexes:
collection := bucket.Scope("inventory").Collection("hotel")
hotel := struct {
Description string `json:"description"`
}{Description: "super swanky"}
myWriteResult, err := collection.Upsert("a-new-hotel", hotel, nil)
if err != nil {
panic(err)
}
time.Sleep(5 * time.Second)
consistentWithResult, err := cluster.SearchQuery(
"travel-sample-index",
search.NewMatchQuery("swanky"),
&gocb.SearchOptions{
Limit: 10,
ConsistentWith: gocb.NewMutationState(*myWriteResult.MutationToken()),
},
)
if err != nil {
panic(err)
}