I have the exact same issue… I have a cluster of 18 nodes and node 004 had a network issue that forced me to do a hard failover on it. After a reboot and the network link fixed, the node was added back with “delta-recovery” and a rebalance was started.
After 3 days of nothing progressing (stuck at rebalancing 0 nodes) I figured I needed to fix it myself before it came crashing down hard.
So I ran the following command on all the cluster nodes in parallel:
curl -X POST -u Administrator:Password http://localhost:8091/diag/eval --data ‘erlang:halt().’
At that point the nodes were all in a standby state and in the “data buckets” tabs, all buckets had a yellow pie in the “data nodes” column.
I quickly hard-failover my problematic node 004 and the cluster and all buckets went back to ready in less than a minute. I have over 9 billion items in there so for sure there was no “warm up” that happened.
So I rebooted node 004 again and once it came back up, I tried a full recovery this time.
Rebalance is now in progress (I can see it progressing) and I’ll update later on success or failure.
update
I’m not sure when the rebalance finished, but the cluster is now healthy with all 18 nodes in.