Retrieving missed in cache keys performance

osblinnikov · June 12, 2013, 11:14pm

Hi all,
I’m evaluating Couchbase right now and I’m not very satisfied with the getBulk performance results for values stored in Hard Drive.
The tests are performed on the machine with 10 Gb Ram.
I’m trying to increase the speed of retrieving 1200 keys from 60 Gb bucket stored mostly in Hard Drive.
Retrieving keys which are missed in cache decreasing from 300-400 down to 30-40 keys per second with increasing size of bucket. one value is approximately 6000 bytes.
Am I correct that performance of retrieving for missed in cache keys should not decrease significantly with increasing of total keys number in bucket?
What could you suggest to increase speed of fetching data from HDD/SSD?
Thank you!
OB

househippo · June 15, 2013, 11:00pm

Are you having problems with just with getting the documents the first time you ask for the keys from the hard drive only? Do you get faster result the second time you call them from memory?
What method are you using to get the data from CB? SDK ? memcached protocal?

osblinnikov · June 16, 2013, 11:00pm

Yes, I have problems with hard drive only. At the second time when I’m calling keys from memory it works fast. I do understand that Couchbase was designed as in-memory database, but when my data grows I can’t store all the data in memory.
Could you confirm/not confirm that the delays of retrieving from Hard Drive should not grow significantly with growing of total data size?
If you confirm I will try to identify my problems with hard drive.
I use getBulk method from CB Java client.

househippo · June 16, 2013, 11:04pm

Yes if you are seeing xx% of your calls for documents are coming from HD. Those calls will have delays no matter how big your cluster is.
If you are having concerns with cache misses and delay from HD. I would recommend you increase your working set of memory.
Go to this link it talks more about it. www.couchbase.com/docs/couchbase-manual-2.0/couchbase-bestpractice-sizing-ram.html
Here is a link that goes into how the working set works www.couchbase.com/docs/couchbase-manual-2.0/couchbase-introduction-architecture-ejection-eviction.html

tgrall · June 23, 2013, 11:11pm

Hello,
To add some information to HouseHippo comment, most of the time when Couchbase cluster starts to have cache misses with a high rate (based on your application) it is necessary to add more memory (to be able to deal with your working set). To achieve that you can do it in both ways

add more RAM Quota to your Bucket
and/or
add new nodes to your cluster (this has some benefits: your add more RAM but also distribute the read/write to more queues)
Regards

Topic		Replies	Views
Cache Misses Couchbase Server	4	2942	November 25, 2013
Can couchbase work if not all documents in DB can be stored in memory? Couchbase Server	8	2962	June 8, 2016
Understanding Bucket Health Metrics Couchbase Server	3	2375	October 17, 2014
15x slowness with retrieving keys from Couchbase bucket compare to our memcache setup Couchbase Server	0	699	December 26, 2018
Issue with data in cache Couchbase Server	11	4771	October 26, 2015

Retrieving missed in cache keys performance

Related topics