Copy documents within same bucket matching condition

Subasish · April 22, 2022, 6:17pm

While copying 15000 documents matching some condition and at the same time trying to update two attribute values using below N1QL query, I get “the query was canceled.” exception. I use .Net SDK
I wanted to modify two attributes in the newly copied document (id, scenarioId)

“INSERT INTO Schedules(KEY x.id, VALUE x)
SELECT OBJECT_PUT(Y,"scenarioId", newScenarioId) AS x FROM (SELECT RAW OBJECT_PUT(d,"id",newScheduleId) FROM Schedules AS d USE KEYS $oldScheduleId LET newScheduleId = $newScheduleId) AS Y LET newScenarioId = $newScenarioId”;

var copySchedulesQueryResults = await cluster.QueryAsync(
copySchedulesQuery, options => options
.Parameter(“oldScheduleId”, (string)scheduleId)
.Parameter(“newScheduleId”, newScheduleId.ToString())
.Parameter(“newScenarioId”, newScenarioId.ToString()));

What else can be done to quickly make copies of such documents?

vsr1 · April 22, 2022, 8:53pm

If already know keys why don’t use KV get and KV write by changing the document in SDK.

Simplified query. May not help your error

INSERT INTO Schedules(KEY x.id, VALUE x)
SELECT OBJECT_CONCAT(d, {"id":$newScheduleId, "scenarioId": $newScenarioId}) AS x
FROM Schedules AS d USE KEYS $oldScheduleId ;

One query sending more than one value as object


INSERT INTO Schedules(KEY x.id, VALUE x)
SELECT OBJECT_CONCAT(d, $obj.[META(d).id]) AS x
FROM Schedules AS d USE KEYS OBJECT_NAMES($obj);

$obj = {"oldScheduleId1":{"id":"newScheduleId1", "scenarioId": "newScenarioId1"},
        "oldScheduleId2":{"id":"newScheduleId2", "scenarioId": "newScenarioId2"},
         ......
       }

Subasish · April 22, 2022, 9:14pm

Could you please explain the working of the below query? Does it mean I have to get the $obj through a N1QL query?
SELECT OBJECT_CONCAT(d, $obj.[META(d).id]) AS x
FROM Schedules AS d USE KEYS OBJECT_NAMES($obj);

vsr1 · April 22, 2022, 9:22pm

Not through N1QL you need construct yourself in SDK,

You mentioned 15000 documents might doing some loop and calling 15000 queries?
Instead construct objet that 15000 fields with old, new object u want replace and call once.

ar copySchedulesQueryResults = await cluster.QueryAsync(
copySchedulesQuery, options => options
.Parameter(“oldScheduleId”, (string)scheduleId)
.Parameter(“newScheduleId”, newScheduleId.ToString())
.Parameter(“newScenarioId”, newScenarioId.ToString()));

Or post the query that matches 15000 condition and how you are generation new values newScheduleId, newScenarioId.

Subasish · April 23, 2022, 11:44am

@vsr1 , @btburnett3 Here’s the entire query/logic pasted below and yes I do loop through individual 15000 id’s to copy the old to new schedule.

var getScheduleQuery = “Select RAW d.id as scheduleIds from SomeBucket.Schedules AS d WHERE d.type = $type AND d.scenarioId= $oldScenarioId”;
var getSchedulesResult = await cluster.QueryAsync(
getScheduleQuery, options => options
.Parameter(“type”, “Schedule”)
.Parameter(“oldScenarioId”, oldScenarioId));
var scheduleIds = await getSchedulesResult.Rows.ToListAsync();
foreach (var scheduleId in scheduleIds)
{
var newScheduleId = Guid.NewGuid();
//first replace id with new schedule Id, then change scenarioId with new scenarioId
var copySchedulesQuery = "INSERT INTO SomeBucket.Schedules(KEY x.id, VALUE x) " +
"SELECT OBJECT_PUT(Y,"scenarioId", newScenarioId) AS x FROM (SELECT RAW OBJECT_PUT(d,"id",newScheduleId) FROM " +
“SomeBucket.Schedules AS d USE KEYS $oldScheduleId LET newScheduleId = $newScheduleId) AS Y LET newScenarioId = $newScenarioId”;
var copySchedulesQueryResults = await cluster.QueryAsync(
copySchedulesQuery, options => options
.Parameter(“oldScheduleId”, (string)scheduleId)
.Parameter(“newScheduleId”, newScheduleId.ToString())
.Parameter(“newScenarioId”, newScenarioId.ToString()));
}

vsr1 · April 23, 2022, 12:39pm

var newScheduleId = Guid.NewGuid();

wha is newScenarioId ???

Subasish · April 23, 2022, 1:01pm

My bad, newScenarioId is a new Guid() passed as parameter to the function

vsr1 · April 23, 2022, 1:17pm

The following query all matched document are inserted into new target.
If required increase query timeout
If one time u can use Query Work Bench

INSERT INTO Schedules(KEY x.id, VALUE x)
    SELECT OBJECT_CONCAT(s, {"id":UUID(), "scenarioId": $newScenarioId}) AS x
    FROM Schedules AS s
    WHERE s.type = "Schedule" AND s.scenarioId = $oldScenarioId;

Subasish · April 23, 2022, 2:05pm

@vsr1 one doubt in the above query since matching “schedule” already has these two attributes(‘id’ and ‘scenarioId’), when I use OBJECT_CONCAT(s, {“id”:UUID(), “scenarioId”: $newScenarioId}), will it not duplicate these attributes again?

Subasish · April 23, 2022, 2:20pm

vsr1:

INSERT INTO Schedules(KEY x.id, VALUE x)
    SELECT OBJECT_CONCAT(s, {"id":UUID(), "scenarioId": $newScenarioId}) AS x
    FROM Schedules AS s
    WHERE s.type = "Schedule" AND s.scenarioId = $oldScenarioId;

This works like a charm. @vsr1 You are a lifesaver

vsr1 · April 23, 2022, 2:36pm

object_concat() if field is there replaces new value, not there adds

Subasish · April 23, 2022, 3:35pm

Thanks @vsr1 . For some reason query takes 4.6 s to execute in console but when used with .Net SDK takes 12 s without any serialization just to retrieve the results as dynamic. Its not the first time I see it happening though, happens in other examples as well.

vsr1 · April 23, 2022, 6:18pm

Not sure why it took so long via .NET SDK. Try to see what is issue.

enable request profile for the statement in question and analyze
Force USE INDEX in the SELECT (of same index used in Query Work Bench console ) and checkout.

Topic		Replies	Views
Fastest way to copy document and modify content before saving SQL++ n1ql , dot-net	2	1285	April 19, 2022
Concurrent update and rollback capabilities SQL++ query , n1ql	1	638	January 30, 2021
N1QL Query for copying data from one bucket to another SQL++ query	2	1865	January 6, 2022
Error while Upserting Documet Node.js SDK query , n1ql	1	746	February 21, 2022
Easier way to mass update with n1ql? SQL++	7	2412	May 16, 2016

Copy documents within same bucket matching condition

Related topics