Pagination in bulk get

ppliatsik · September 18, 2015, 12:14pm

Hi,

Can we implement pagination in bulk get with java?

daschl · September 21, 2015, 9:00am

@ppliatsik can you provide a little more detail what you need? Since a bulk get is done on a discrete set of document IDs, there is no “next page” to load from. Only you know the next set of keys.

ppliatsik · September 21, 2015, 9:08am

Sorry @daschl my question was not so clear!

I mean if we can implement pagination on the list of document IDs we provide to bulk get and not provide from the beginning the IDs we want to get. For example i want to take the first 10 documents of the list but the list will contain more IDs. But i think you answer my question!

Also the bulk get does not check for double documents’ IDs in the list we provide, is this right?

daschl · September 21, 2015, 9:16am

@ppliatsik ah I see - so you man you have a huge list and you want to load them “batch by batch” and not all at once - is that correct?

Is the list changing at runtime?

ppliatsik · September 21, 2015, 9:20am

Yes the list may have many IDs. No, i suppose the list will contain the uptodated IDs.

daschl · September 21, 2015, 10:05am

@ppliatsik you could use the RxJava operators to make that batching effect happen.

So for example the following code

List<String> ids = Arrays.asList("id1", "id2", "id3", "id4");

Observable
    .from(ids)
    .buffer(2)
    .subscribe(new Action1<List<String>>() {
        @Override
        public void call(List<String> strings) {
            System.out.println(strings);
        }
    });

Prints out

[id1, id2]
[id3, id4]

Optionally, you can zip it with an interval to “delay” each batch:

List<String> ids = Arrays.asList("id1", "id2", "id3", "id4");
        
Observable<Long> waitTime = Observable.interval(0, 1, TimeUnit.SECONDS);

Observable
    .from(ids)
    .buffer(2)
    .zipWith(waitTime, (strings, aLong) -> strings)
    .subscribe(System.out::println);

And and then you can feed each batch into a new Observable and run a bulk get. The following example fetches the docs and prints them on an iterator:

Cluster cluster = CouchbaseCluster.create();
Bucket bucket = cluster.openBucket();

List<String> ids = Arrays.asList("id1", "id2", "id3", "id4");

Observable<Long> waitTime = Observable.interval(0, 1, TimeUnit.SECONDS);

Observable
    .from(ids)
    .buffer(2)
    .zipWith(waitTime, (strings, aLong) -> strings)
    .flatMap(Observable::from)
    .flatMap(id -> bucket.async().get(id))
    .toBlocking()
    .forEach(System.out::println);

simonbasle · September 21, 2015, 10:09am

also you are correct this pattern of bulk get doesn’t remove duplicate keys (it handles your list as is).

but once again you can use RxJava provided operators:
for that case we can chain distinct() after the from(...) (or simply use a Set from the get go as a collection of IDs).

ppliatsik · September 21, 2015, 10:12am

The code was really helps me understand how it works

Thanks @daschl and @simonbasle!

Topic		Replies	Views
How to do bulk get in 2.x java sdk Java SDK	6	4177	July 30, 2015
bulkGet in java sdk 2.x with preserved order Java SDK java	2	1565	May 4, 2016
Bulk with sub-document API Java SDK	4	2655	August 3, 2022
Retrieving multiple documents for which I know the key in one call, Java SDK Java SDK	1	1205	September 6, 2018
Issue with reactive bulk get operation Java SDK	6	2018	July 20, 2018

Pagination in bulk get

Related topics