DCP drain rate extremely high

kshmir1 · January 15, 2016, 5:22am

Hello,

We’ve been having every now and then this issue, which reduces the stability of our cluster a lot…

Basically, the DCP drain rate of our cluster get’s increments slowly up to 10M, which is fairly high, all connected clients send an high amount of bandwidth (~1.5MBps each), and each node in the cluster (4) also sends a lot of traffic (~4MBps each, 12 in one, which is the server seems to be having the problem).

We have about 5GB of data.

Reducing the amount of replicas, reduces the constant amount of 10 to 6 for example… it looks like a bug in DCP and I don’t know how to stop it, it eventually always goes off, but it worries me since it also propagates to XDCR, which means an increased bandwidth bill.

kshmir1 · January 15, 2016, 5:24am

This is a more detailed graph showing how these values evolved.

kshmir1 · January 15, 2016, 5:28am

Also,

I did a rebalance yesterday morning after a planned maintenance from azure, but it shouldn’t interfere since this problem has happened at least three other times in the past.

kshmir1 · January 15, 2016, 5:46am

I’m glad to say you can close this, I found the issue was at the application level!

WillGardella · January 16, 2016, 9:12pm

@kshmir1 Just curious… what was the issue? Others might see the same behavior…

kshmir1 · January 17, 2016, 8:24pm

Sure!

We track information of users through a pixel, some visitors abused this and stored too many information. Just one pixel could make the document size go up so much that each request would make a whole lot of network time, none of our monitoring systems realized this was going on sadly.

The only indicator that showed this was an increased usage of network, and the replication rate in couchbase, just 5 or 6 documents were updated every some seconds, but we realized one of those must have been huge, we were right.

This was really hard to track, it took us some months, but thankfully it never made couchbase or our systems fail at all. We use node.js with express and the node sdk with couchbase, all hosted in azure.

WillGardella · January 17, 2016, 8:45pm

Cool, glad you tracked it down. Thanks for sharing the war story, always good to hear what people are up to.

Topic		Replies	Views
High intra-cluster xdcr bandwidth usage Couchbase Server 4.1	5	2764	January 14, 2016
XDCR performance drops dramatically after a period of time Couchbase Server	3	2011	July 10, 2013
/opt/couchbase/bin/goxdcr always consuming 20% CPU Couchbase Server	2	1421	October 6, 2017
Growing DCP Replication Items Remaining Couchbase Server	7	1688	December 14, 2020
Couchbase Server 6.5 is dropping my internet speed Couchbase Server connections , xdcr	3	1085	June 30, 2020

DCP drain rate extremely high

Related topics