N1ql array index performance

littlewitchanita · November 30, 2016, 10:03am

Hi,

I’m using CB 4.5.1-2844 Enterprise Edition and trying to compare 2 approaches to query my data below:

Case:
We have groups and users, which a group have multiple users and a user will belong to multiple groups.
Will need to list by 2 ways, (1) by group, (2) by user

Approach 1:
Store the information inside the group object, like:
{ “groupId”: 1,
“users”: [
{“userId”:11},
{“userId”:12}
]}

using array index, it took in average 90ms to query by “userId”

Approach 2:
Store information in a “row-by-row” way, like:
{ "groupId: 1, “userId”: 11}
{ "groupId: 1, “userId”: 12}

using 2 indexes, 1 for groupId and 1 for userId, it took me in average 45ms

For document databases, I believe approach 1 is better in data structure, but test result shows that approach 2 has better performance. Are there anyway to improve the performance? Please let me know if I did anything wrong.

Thanks.

atom_yang · November 30, 2016, 10:26am

If you only want to query performance for list by 2 ways, (1) by group, (2) by user,you can store group list inside the user object,and store user list inside the group object,such as
group object:

{ "id": 1,
"users": [
{"userId":11},
{"userId":12}
]}

user object:

{ "id": 11,
"groups": [
{"groupId":1}
]}

and

{ "id": 12,
"groups": [
{"groupId":1}
]}

all query only need one hit by keyScan,and you only need create Primary index.

but I think you should also need balance the cost of update the document (also with updating index)/query document.

littlewitchanita · November 30, 2016, 10:32am

thx, this is what we are doing currently without applying n1ql

The problem here is whenever there’s any disaster (e.g. overloaded, reboot), if not all the updates are executed successfully, there will be an issue that the result is different when list by group and user. That’s why we are switching to use n1ql.

atom_yang · November 30, 2016, 10:42am

I think N1QL can not solve this problem.
and FYI

littlewitchanita · November 30, 2016, 10:44am

as long as I keep the data in 1 single object, then no transaction will be needed, it can just be a success or fail update

prasad · November 30, 2016, 6:29pm

if updating two documents, then your application will need to make sure either both docs are updated, or none.

yes

can you post the query. Are you using array indexing?

-Prasad

littlewitchanita · December 1, 2016, 6:32am

Yes, I’m using array indexing:

CREATE INDEX group_user ON dev ( DISTINCT ARRAY x.userId FOR x IN d.users END ) where t = “group”;

SELECT meta(dev).id from dev WHERE t = “group” AND ANY x IN d.users SATISFIES x.userId = 11 END;

the exact data structure is:
{
“t”: “group”,
“d”: {
“users”: [
{“userId”: 11},
{“userId”: 12}
]
}
}

geraldss · December 1, 2016, 6:28pm

Hi @littlewitchanita,

What is wrong with your original approach 2? It looks good to me, especially if you are getting desired performance. It seems to address the transaction issue.

littlewitchanita · December 2, 2016, 4:25am

For performance, approach 2 seems to be better, it’s just violating the best practice of nosql, so I’m wondering if there’ll be a better solution

geraldss · December 2, 2016, 4:32am

Your approach #2 is completely consistent with the best practices of NoSQL. No issue there.

geraldss · December 2, 2016, 4:33am

NoSQL gives you flexibility. It does not mean you must always do the opposite of relational.

Topic		Replies	Views
N1ql index performance is different on same query SQL++ query , n1ql , index	3	814	March 30, 2020
Performance of single vs multiple n1ql queries for small data sets SQL++	6	2442	August 23, 2017
I have some doubt about N1ql SQL++	8	3043	July 28, 2015
How to speedup n1ql query - what to scale (index, query, data)? Couchbase Server n1ql	1	1755	May 8, 2016
Can N1QL query to stored index or create indexes? SQL++	1	2045	June 21, 2014

N1ql array index performance

Related topics