Is it normal that couchbase05 a doubled number of connections on port 11210 and a halved number of connection on port 9100 with respect to the other nodes?
Thanks
Service 'goxdcr' exited with status 1. Restarting. Messages: MetadataService 2017-02-23T12:41:46.072+01:00 [ERROR] metakv.ListAllChildren failed. path=/remoteCluster/, err=Get http://127.0.0.1:8091/_metakv/remoteCluster/: CBAuth database is stale. Was never updated yet., num_of_retry=2
MetadataService 2017-02-23T12:41:46.072+01:00 [ERROR] metakv.ListAllChildren failed. path=/remoteCluster/, err=Get http://127.0.0.1:8091/_metakv/remoteCluster/: CBAuth database is stale. Was never updated yet., num_of_retry=3
MetadataService 2017-02-23T12:41:46.072+01:00 [ERROR] metakv.ListAllChildren failed. path=/remoteCluster/, err=Get http://127.0.0.1:8091/_metakv/remoteCluster/: CBAuth database is stale. Was never updated yet., num_of_retry=4
RemoteClusterService 2017-02-23T12:41:46.072+01:00 [ERROR] Failed to get all entries, err=metakv failed for max number of retries = 5
[goport] 2017/02/23 12:41:46 /opt/couchbase/bin/goxdcr terminated: exit status 1
Hi Alessandro -
What version are you running? There is at least one known issue that looks like what you’re experiencing that was fixed in 4.5.0: MB-16568 .
You should also be able to fail over that node and re-add it and it should recover.
Just as an aside, you seem to have a lot of connections open on port 8093 - are you sure your applications are not leaking connections? Usually, the number of connections on 11210 per node would be about the same as the number of client objects, and that would also be about the same as the number of connections you have on port 8093.
Thank you for the reply
We are currently using version: 4.0.0-4051
We are considering upgrading to 4.5, what would be the best procedure for a cluster of 8 machines?
About to active connections, we are updating our service to have fewer active connections, in order to lighten the load on the cluster.
Is the failover operation heavy in terms of resources and time?
thank you
On how to upgrade, I recommend taking a look at the information here: https://developer.couchbase.com/documentation/server/current/install/upgrade.html and also look at the page called “Upgrade Options”. We recently re-wrote that to be easier to understand, so I hope it helps you decide which one is right for you.
Failover should not be a heavy operation, but you need to rebalance to bring back in the nodes, and rebalance is a heavier operation. Depending on your production use case, an offline upgrade can be faster. Some of our big customers would rather take the downtime than perform rebalances while the system is online.