N1QL Covering Indexes / Performance for Sorting and Where Conditions

benbenwilde · September 15, 2016, 7:29pm

I have some concerns related to sorting and where conditions when using n1ql queries.

Let’s say I have 1,000,000 documents similar to:
{ type: “good”, num: 1, other:“stuff” }

With the following index:
CREATE INDEX index_name ON bucket_name(type, num) WHERE type="good"

The following queries are covered by the index right? (no document fetching):
SELECT type,num FROM bucket_name WHERE type="good" AND num=1
SELECT type,num FROM bucket_name WHERE type="good" ORDER BY num

But the following queries are not covered:
SELECT * FROM bucket_name WHERE type="good" AND num=1
SELECT * FROM bucket_name WHERE type="good" ORDER BY num

For the not covered queries, it seems logical that the WHERE and the ORDER BY would be covered since the field they are dealing with is in the index. And then once the resulting set of index entries was determined, then the documents would be fetched in order to get the fields for the SELECT part of the query.

What I am afraid of is that this is not happening, but instead, every single document is fetched, and evaluated for the WHERE and for the ORDER BY.

In the case of the WHERE - this mean a very large number of documents will be fetched if a small number of documents meet the condition. I’m assuming it stops once it reaches the LIMIT. That would be terrible for performance.
In the case of the ORDER BY - this means that ALL documents must be fetched to sort them. That is even more terrible for performance.

I never want to make a query in my application where a where condition or a sort is not covered by the index because of the terrible performance implications. Ideally - the result set will always be determined by what’s in the index, and then additional properties would be fetched from the documents if necessary.

Is all of the above correct? Does anyone have thoughts or suggestions about this?

geraldss · September 15, 2016, 9:41pm

No, only the matching documents are fetched. That is the meaning of an index scan. What you are afraid of is a full scan / primary scan. Just use EXPLAIN, and if you see index scan, you are good.

benbenwilde · September 15, 2016, 9:46pm

So you are saying that (given 1,000,000):
CREATE INDEX index_name ON bucket_name(type, num) WHERE type="good"
SELECT * FROM bucket_name WHERE type="good" ORDER BY num LIMIT 10
Will for sure only fetch 10 documents?

geraldss · September 15, 2016, 9:55pm

For that specific index and query:

In 4.5, it will only fetch (and sort) those documents where type=‘good’, whatever that cardinality is.

In 4.5.1, it will fetch at most 10 documents, and will not perform any sort.

benbenwilde · September 15, 2016, 9:58pm

So the Order By gets ignored in 4.5.1? Not ordering isn’t acceptable in my case, but at least you’re avoiding overloading the server.
Is there no optimization for the fact that the property that is being ordered is directly in the index?

geraldss · September 15, 2016, 10:30pm

The ORDER BY does not get ignored. It does the right thing.

benbenwilde · September 15, 2016, 10:31pm

Can you clarify the above statement then?
Thanks.

geraldss · September 15, 2016, 10:38pm

Hi @benbenwilde,

You already clarified the statement yourself. The index is already sorted, so no further sort is performed.

benbenwilde · September 16, 2016, 2:32am

Ok sorry I think I finally understand your answer now.

In 4.5.1 the Order By or the Where condition is run on the index directly if possible. Then after that the documents are fetched in order to project the result if needed.
Makes sense.

So 4.5.0 has the issue that it will fetch for sorting and conditions, even if that part is covered by the index. So it seems my best alternative for now with 4.5.0 is to only select the id, and then turn around and fetch the documents manually. Does that sound right?

How long do you expect until 4.5.1 is fully released?
Thanks

geraldss · September 16, 2016, 5:15am

Yes, you described 4.5.1 correctly.

4.5 is as I described it. As I said, it will only fetch the documents where type = “good”.

Let’s ask @prasad and @keshav_m. 4.5.1 EE should be available in about a month.

Topic		Replies	Views
Select query with sorting performance advice SQL++ query , n1ql	3	1334	October 25, 2017
N1QL: performance and advanced features SQL++	3	1790	September 6, 2016
Covering index for select * Couchbase Server n1ql , index	3	992	April 7, 2021
N1QL query default order with offset and limit SQL++ n1ql , index	2	668	June 26, 2023
N1QL select with type filtering SQL++ n1ql	2	2650	April 24, 2016

N1QL Covering Indexes / Performance for Sorting and Where Conditions

Related topics