IP address seems to have changed

gmiit · December 2, 2013, 11:41pm

Over the past couple of days, we’ve received 3 times an automatic email saying:
"IP address seems to have changed. Unable to listen on ‘ns_1@10.10.4.15’."
We’ve got a 4 node cluster, with static local IP, and all 3 alerts came from the same node. The cluster has been running for a couple of months already, with possibly an increase in traffic, but no new usage patterns.

In the logs, we see this related entry:
[ns_server:error,2013-12-01T15:58:58.861,ns_1@10.10.4.15:<0.5021.5975>:menelaus_web_alerts_srv:can_listen:345]Cannot listen due to nxdomain from inet:getaddr

The node seems to be marked as down by the other nodes for a short time, and then come back up to life, as if nothing happened.
What are the possible triggers for this error? Could it be from overload on the node?

Edit:
And additional message that happens just before the one above
[error_logger:error,2013-12-02T23:34:23.142,ns_1@10.64.4.162:error_logger<0.6.0>:ale_error_logger_handler:log_msg:76]Detected time forward jump (or too large erlang scheduling latency). Skipping 10 samples (or 8000 milliseconds) ({{1386027254845, #Ref<0.0.62.54885>}, {repeat,800, <0.16858.17>},{timer2,send,[<0.16858.17>,{cascade,minute,hour,4}]}})

tgrall · December 4, 2013, 4:09pm

Which operating system are you using? Are you on the cloud? (EC2)

May be you can force the IP address using the configuration and tools described here:
http://docs.couchbase.com/couchbase-manual-2.2/#couchbase-getting-started-hostnames
http://docs.couchbase.com/couchbase-manual-2.2/#handling-changes-in-ip-addresses

Regards
Tug
@tgrall

gmiit · December 4, 2013, 6:08pm

Thanks for the reply.
The servers are locally hosted in our internal network, with static IPs, which haven’t changed, on Centos 6.4. The nodes are already referenced by their IP as their name. We could use hostnames, but since the IPs are static I don’t think it’d make much of a difference.

TimMeade · December 7, 2013, 1:30pm

We are having this exact same issue also.

New cluster created last week. 6 nodes. All centos 6.4, static ip, private network, our datacenter.

Under the host name field at setup we put the IP 192.168.10.xxx for each node. Thinking the IP would be ok. Once the ip address change email started, we setup the hosts files to have all the ip and hostnames for the entire cluster.

Still getting the emails.

From research I don’t think I can change these to the actual hostname without redoing the cluster.

HELP! Thoughts? Our NOC freaks out every time the email comes in. About 10 a day from various nodes.

Thanks

Tim

alkondratenko · December 19, 2013, 6:36pm

Those checks can sometimes fail (false positively) if erlang is overloaded (i.e. with views and/or xdcr and/or smart clients torturing it with excessively frequent bucket metadata requests). So it can be an early sign of under-sized configuration or wrongly configured host (i.e. check recommendations for swappiness and disabling of transparent huge pages).

Another possibility is due to xdcr bug it’s possible to exhaust tcp ports space with ports “occupied” in TIME_WAIT state. In that state creating binding any socket (client or server) will not work. This condition is easy to verify by looking at netstat output and/or by examining server logs (you would see tons of eaddinuse errors in this case).

gmiit · December 19, 2013, 10:47pm

Thanks for the answer, I think you are right on the cause. The beam.smp process was quite busy (CPU and RAM intensive) at the time. We had quite a few XDCR streams going, which we consolidated to a single stream (per bucket), to a secondary cluster, where we do the XDCR stream splitting.
We also added a couple of nodes to the cluster, to be on the safe side.

Topic		Replies	Views
Couchbase 4.0 community "IP address seems to have changed. Unable to listen on 'ns_1@hostname'." Couchbase Server	4	2817	October 4, 2017
Couchbase started behaving slowing Kubernetes	4	1226	September 30, 2019
IP Address Seem to be be changed Kubernetes	4	2162	October 5, 2021
IP address seems to have changed. Unable to listen on 'ns_1@127.0.0.1' Couchbase Server	5	3349	March 2, 2020
Error message - IP address seems to have changed. Unable to listen on 'ns_1@10.xx.x.xxxx'. (Underlaying POSIX error code: 'nxdomain') Couchbase Server server	2	3225	August 14, 2019

IP address seems to have changed

Related topics