IO util 100% during compaction and long time requests cost

YongMan · April 24, 2017, 9:10am

Hi all,
I am query couchbase using memcached protocal via moxi proxy.
My environment is 24* cpu core, 196G memory, nvme ssd disk.
During compaction, the disk iostat -dx 1 output as follows.

Device:         rrqm/s   wrqm/s     r/s     w/s    rkB/s    wkB/s avgrq-sz avgqu-sz   await r_await w_await  svctm  %util
nvme0n1           0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00    0.00    0.00   0.00   0.00
nvme1n1           0.00     0.00  216.00 28496.00  3236.00 113984.00     8.17   302.07    1.75    0.85    1.76   0.01  21.20
sda               0.00     6.00    0.00   37.00     0.00   172.00     9.30     0.06    1.51    0.00    1.51   0.05   0.20
sdc               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00    0.00    0.00   0.00   0.00
sdb               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00    0.00    0.00   0.00   0.00

Device:         rrqm/s   wrqm/s     r/s     w/s    rkB/s    wkB/s avgrq-sz avgqu-sz   await r_await w_await  svctm  %util
nvme0n1           0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00    0.00    0.00   0.00   0.00
nvme1n1           0.00     0.00   11.00 47315.00    48.00 189260.00     8.00 17794.98  284.04   69.00  284.09   0.02  99.90
sda               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00    0.00    0.00   0.00   0.00
sdc               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00    0.00    0.00   0.00   0.00
sdb               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00    0.00    0.00   0.00   0.00

Device:         rrqm/s   wrqm/s     r/s     w/s    rkB/s    wkB/s avgrq-sz avgqu-sz   await r_await w_await  svctm  %util
nvme0n1           0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00    0.00    0.00   0.00   0.00
nvme1n1           0.00     0.00    4.00 63488.00    32.00 253952.00     8.00 12103.99  246.37  225.50  246.37   0.02 100.00
sda               0.00     0.00    0.00    3.00     0.00    16.00    10.67     0.00    0.00    0.00    0.00   0.00   0.00
sdc               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00    0.00    0.00   0.00   0.00
sdb               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00    0.00    0.00   0.00   0.00

Device:         rrqm/s   wrqm/s     r/s     w/s    rkB/s    wkB/s avgrq-sz avgqu-sz   await r_await w_await  svctm  %util
nvme0n1           0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00    0.00    0.00   0.00   0.00
nvme1n1           0.00     0.00    7.00 98304.00    32.00 393216.00     8.00 13535.54  128.20  143.43  128.20   0.01 100.00
sda               0.00    12.00    1.00    8.00     8.00    84.00    20.44     0.07    2.89   20.00    0.75   7.78   7.00
sdc               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00    0.00    0.00   0.00   0.00
sdb               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00    0.00    0.00   0.00   0.00

Device:         rrqm/s   wrqm/s     r/s     w/s    rkB/s    wkB/s avgrq-sz avgqu-sz   await r_await w_await  svctm  %util
nvme0n1           0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00    0.00    0.00   0.00   0.00
nvme1n1           0.00     0.00 1546.00 35234.00 82552.00 140936.00    12.15  4589.60  179.12    0.53  186.95   0.02  76.90
sda               0.00     0.00    0.00    1.00     0.00     0.00     0.00     0.01   52.00    0.00   52.00   8.00   0.80
sdc               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00    0.00    0.00   0.00   0.00
sdb               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00    0.00    0.00   0.00   0.00

Because of io util is 100%, some read and write request keeps long time cost, cost up to 3 seconds which is unacceptable to the users.

I want to know how to limit the io usage during compaction.

Any help?

drigby · April 24, 2017, 1:41pm

What of Couchbase version are you using? in 4.5.1 we reduced the default number of parallel compactors (see discussion on Disk write spikes & performance degradation) which might help you.

YongMan · April 24, 2017, 2:14pm

@drigby hi, I am complie the project from source code. I use repo sync released version 4.6.1, but it show me 4.0.0-2038 Community Edition (build-2038) both in admin ui and sdk’s output.
Ok, I will check the new version’s update.
Thanks your help.

YongMan · April 25, 2017, 2:22am

I check the source code,and the default compaction_number_of_kv_workers config is 1.

NWorkers = ns_config:read_key_fast(compaction_number_of_kv_workers, 1),

how to limit the speed with 1 compaction workers?

Topic		Replies	Views
Couchbase 4.0.0 load high but CPU percentage low periodicity Couchbase Server	4	3045	August 5, 2016
Suggested OS System Tweaks for High Performance Couchbase Server	19	6123	January 19, 2016
Compaction and High CPU & RAM utilization Couchbase Server	3	2665	September 10, 2013
High disk write load and flood of memcached logs Couchbase Server	7	2830	December 22, 2016
Couchbase 4.1.0-5005 - Unavailable for Accessing Documents Couchbase Server query , connections , java	3	2563	March 8, 2017

IO util 100% during compaction and long time requests cost

Related topics