Rebalance stuck at 0% and does not cancel

robert_hamon_ · June 3, 2016, 11:58pm

I have the exact same issue… I have a cluster of 18 nodes and node 004 had a network issue that forced me to do a hard failover on it. After a reboot and the network link fixed, the node was added back with “delta-recovery” and a rebalance was started.
After 3 days of nothing progressing (stuck at rebalancing 0 nodes) I figured I needed to fix it myself before it came crashing down hard.

So I ran the following command on all the cluster nodes in parallel:
curl -X POST -u Administrator:Password http://localhost:8091/diag/eval --data ‘erlang:halt().’

At that point the nodes were all in a standby state and in the “data buckets” tabs, all buckets had a yellow pie in the “data nodes” column.
I quickly hard-failover my problematic node 004 and the cluster and all buckets went back to ready in less than a minute. I have over 9 billion items in there so for sure there was no “warm up” that happened.

So I rebooted node 004 again and once it came back up, I tried a full recovery this time.
Rebalance is now in progress (I can see it progressing) and I’ll update later on success or failure.

update
I’m not sure when the rebalance finished, but the cluster is now healthy with all 18 nodes in.

Topic		Replies	Views
Rebalance is stuck at 0% Couchbase Server	0	1140	November 19, 2016
Rebalance failed and rebalance button is now disabled Couchbase Server	1	1641	February 2, 2018
Rebalance failed after removing node Couchbase Server	2	1520	June 18, 2020
Rebalancing does not work at all Couchbase Server	4	2152	January 22, 2015
Rebalance failed with error Couchbase Server	0	1776	May 23, 2016

Rebalance stuck at 0% and does not cancel

Related topics