Covering Index with performance using select query

rkumar · August 25, 2016, 12:07pm

Hi All,

I have document structure like below:

{

“SURVEY_TYPE”:“FEEDBACK”,
“MENTION_TIME”:“2008-06-07 05:30:00”,
“ID”:21206,
“ACTIVE_FLAG”:true,
“SOURCE_ID”:100115,
“CONTENTS”:[

  { 
     "ACTIVE_FLAG":true, 
     "PREDEFINED":false, 
     "MD5":"911d10c91c6c80afb0a1a4d4ab786d24", 
     "QUESTION_ID":1000000000, 
     "STATUS":"P", 
     "PUBLISHING_API_STATUS":"N", 
     "OPTION_ID":1, 
     "MODIFIED_TIME":"2015-08-18 10:31:08", 
     "SENTIMENTS":[ 
        { 
           "GROUP_ID":"npaaru@bfountain.com_1394536948561", 
           "CONTENT_ID":2175967,
           "CATEGORY_ID":4015, 
           "ACTIVE_FLAG":true, 
           "SCORE":1, 
          
        } 
       
     ], 
    
     "NLP_FLAG":true, 
     "CREATED_TIME":"2014-03-11 09:18:29" 
  }

],
“REVIEW_COUNT”:1,
“LOCATION_ID”:0,
“OWNER_POST_FLAG”:false,
“PERSON_ID”:11489,
“RECOMMEND_FLAG”:true,
“HARVESTER_ID”:74177,

}

so i’ve the requirement to get the average score in minimum time using indexing,so i go through the docs having covering index example and i wrote below index:

CREATE INDEX idx_score ON default
((DISTINCT ARRAY
(DISTINCT ARRAY s.SCORE AND s.CATEGORY_ID FOR s IN c.SENTIMENTS END)
FOR c IN CONTENTS END),PERSON_ID,SURVEY_TYPE,LOCATION_ID,SOURCE_ID,CONTENTS)
WHERE MENTION_TIME>=‘2001-01-01’;

then i use below SELECT query to fetch the value using index “idx_score”,

EXPLAIN SELECT AVG( s.SCORE)
FROM default t
UNNEST t.CONTENTS c
UNNEST c.SENTIMENTS s
WHERE t.PERSON_ID=11489 AND s.CATEGORY_ID IN [4015,4015,4026,4034]and t.MENTION_TIME >=‘2001-01-01’ and t.MENTION_TIME<‘2016-03-07’ ;

BUT EXPLAIN SHOW BELOW:
{
“plan”: {
“#operator”: “Sequence”,
“~children”: [
{
“#operator”: “PrimaryScan”,
“index”: “#primary”,
“keyspace”: “default”,
“namespace”: “default”,
“using”: “gsi”
},

WHICH USING PRIMARY INDEX KEY.

so i have below question here:
Q1: Is the problem i’m using ALIAS name ,like default t and using it.
how to make se select query to using this index.
Q2: how to use this if i have a alias name in select query
Q3: if i use any one of attributes mention in create index in my select query it ,should take this index in run time.

Any help will be appropriated.

keshav_m · August 26, 2016, 1:04am

Hi @rkumar, Use this example as the model and work your way on exploiting array indexing for UNNEST.

insert into b values("k1", {"x":1, "y": [10, 20, 30]});
create index ib1 on b(DISTINCT ARRAY v FOR v IN y END);
explain select v
from b unnest y v
where v = 10;

delete from b;
insert into b values("k1", {"x":1, "y": [ {"p":"a", "q": [11, 22,33]}, {"p":"b", "q": [44, 55,66]}]});
create index ib2 on b
   (DISTINCT ARRAY
         (DISTINCT ARRAY v2 FOR v2 IN v1.q END)
         FOR v1 IN y END );
explain select * from b where ANY v1 in y satisfies (any v2 in v1.q satisfies v2 = 44 end) end;
explain select * from b unnest y v1 unnest v1.q as v2 where v2= 44;

rkumar · August 26, 2016, 7:17am

Hi @keshav_m,@geraldss

Thank you for this explanation,
The structure you created for example is very much similar to ours,that cool, but still its not working,i did this previously and it not working for me in all scenario, below what i did according what you suggested.

CREATE INDEX idx_score ON default
(DISTINCT ARRAY
(DISTINCT ARRAY s.SCORE AND s.CATEGORY_ID FOR s IN c.SENTIMENTS END)
FOR c IN CONTENTS END) ;

OR
CREATE INDEX idx_score ON default
(DISTINCT ARRAY
(DISTINCT ARRAY s FOR s IN c.SENTIMENTS END)
FOR c IN CONTENTS END) ;

below is the EXPLAIN:

explain select s.SCORE from default
unnest CONTENTS c
unnest c.SENTIMENTS as s
where s.CATEGORY_ID IN [4015];

{
“plan”: {
“#operator”: “Sequence”,
“~children”: [
{
“#operator”: “PrimaryScan”,
“index”: “#primary”,
“keyspace”: “default”,
“namespace”: “default”,
“using”: “gsi”
},

EVEN THUG IF I DID SELECT * WITH s.CATEGORY_ID=4015,GIVE THE SAME.
addon on this ,what if i want to include other attributes in create index,like

PERSON_ID,SOURCE_ID from parent array.

Can i create index like below:

CREATE INDEX idx_score_adv ON default (DISTINCT ARRAY (DISTINCT ARRAY s.SCORE AND s.CATEGORY_ID FOR s IN c.SENTIMENTS END)
FOR c IN CONTENTS END,PERSON_ID,SURVEY_TYPE,LOCATION_ID,SOURCE_ID) WHERE MENTION_TIME>=‘2001-01-01’;

Thank you.

keshav_m · August 29, 2016, 12:28am

Hi @rkumar, please post a FULL example document here.

Topic		Replies	Views
Array Covering index with UNNEST and condition SQL++ n1ql , index	30	4450	April 16, 2018
Indexes for n1QL queries using UNNEST and how to make it covered SQL++ n1ql , index	1	1331	June 6, 2018
Covering index not working (or I did something wrong) SQL++	3	718	September 5, 2019
Covering index for nested array SQL++ index	10	2379	June 15, 2019
Create a covering index for two array sub-items SQL++ index	1	689	January 25, 2022

Covering Index with performance using select query

Related topics