Details
-
Type:
Bug
-
Status:
Closed
-
Priority:
Critical
-
Resolution: Cannot Reproduce
-
Affects Version/s: 2.0
-
Fix Version/s: 2.0.1
-
Component/s: ns_server, view-engine
-
Security Level: Public
-
Labels:None
-
Environment:Hidebuild 1865
<manifest><remote name="couchbase" fetch="git://10.1.1.210/"/><remote name="membase" fetch="git://10.1.1.210/"/><remote name="apache" fetch="git://github.com/apache/"/><remote name="erlang" fetch="git://github.com/erlang/"/><default remote="couchbase" revision="master"/><project name="tlm" path="tlm" revision="ab70f6d42f46621ec576889e57cb37ac2d64a84b"><copyfile src="Makefile.top" dest="Makefile"/></project><project name="bucket_engine" path="bucket_engine" revision="70b3624abc697b7d18bf3d57f331b7674544e1e7"/><project name="ep-engine" path="ep-engine" revision="10b593cf4d97eaf062a6076878c5f8000d093ee9"/><project name="libconflate" path="libconflate" revision="2cc8eff8e77d497d9f03a30fafaecb85280535d6"/><project name="libmemcached" path="libmemcached" revision="ca739a890349ac36dc79447e37da7caa9ae819f5" remote="membase"/><project name="libvbucket" path="libvbucket" revision="00d3763593c116e8e5d97aa0b646c42885727398"/><project name="membase-cli" path="membase-cli" revision="7fe4121e7e83952a4cb032e25a2cb9fca1709354" remote="membase"/><project name="memcached" path="memcached" revision="06ab906e6702917c4b6b90a6b0051644719a357d" remote="membase"/><project name="moxi" path="moxi" revision="52a5fa887bfff0bf719c4ee5f29634dd8707500e"/><project name="ns_server" path="ns_server" revision="82c7c95e33445edcc6665e64216983e1027756e1"/><project name="portsigar" path="portsigar" revision="1bc865e1622fb93a3fe0d1a4cdf18eb97ed9d600"/><project name="sigar" path="sigar" revision="63a3cd1b316d2d4aa6dd31ce8fc66101b983e0b0"/><project name="couchbase-examples" path="couchbase-examples" revision="21e6161a1d064979b5c6aa99cd34ccc41c9d7aca"/><project name="couchbase-python-client" path="couchbase-python-client" revision="86b398e4fbc1f2e38d356e14df0c1bb4e3d2427b"/><project name="couchdb" path="couchdb" revision="69032c6dbcd64c056265416d2c49b1a9ee06f9c3"/><project name="couchdbx-app" path="couchdbx-app" revision="76d79be79c1454cff0f878d5a88a792270ec1b17"/><project name="couchstore" path="couchstore" revision="772f66a29a81be59d8bdaaa74b3898bb44fcc7e2"/><project name="geocouch" path="geocouch" revision="b0bd742551639c52030c070e5bf9390edbb536ba"/><project name="mccouch" path="mccouch" revision="88701cc326bc3dde4ed072bb8441be83adcfb2a5"/><project name="testrunner" path="testrunner" revision="503766d1695f9bc7c81f124dbb38a9a24ab0310f"/><project name="otp" path="otp" revision="b6dc1a844eab061d0a7153d46e7e68296f15a504" remote="erlang"/><project name="icu4c" path="icu4c" revision="26359393672c378f41f2103a8699c4357c894be7" remote="couchbase"/><project name="snappy" path="snappy" revision="5681dde156e9d07adbeeab79666c9a9d7a10ec95" remote="couchbase"/><project name="v8" path="v8" revision="447decb75060a106131ab4de934bcc374648e7f2" remote="couchbase"/><project name="gperftools" path="gperftools" revision="8f60ba949fb8576c530ef4be148bff97106ddc59" remote="couchbase"/><project name="pysqlite" path="pysqlite" revision="0ff6e32ea05037fddef1eb41a648f2a2141009ea" remote="couchbase"/></manifest>Showbuild 1865 <manifest><remote name="couchbase" fetch=" git://10.1.1.210/ "/><remote name="membase" fetch=" git://10.1.1.210/ "/><remote name="apache" fetch=" git://github.com/apache/ "/><remote name="erlang" fetch=" git://github.com/erlang/ "/><default remote="couchbase" revision="master"/><project name="tlm" path="tlm" revision="ab70f6d42f46621ec576889e57cb37ac2d64a84b"><copyfile src="Makefile.top" dest="Makefile"/></project><project name="bucket_engine" path="bucket_engine" revision="70b3624abc697b7d18bf3d57f331b7674544e1e7"/><project name="ep-engine" path="ep-engine" revision="10b593cf4d97eaf062a6076878c5f8000d093ee9"/><project name="libconflate" path="libconflate" revision="2cc8eff8e77d497d9f03a30fafaecb85280535d6"/><project name="libmemcached" path="libmemcached" revision="ca739a890349ac36dc79447e37da7caa9ae819f5" remote="membase"/><project name="libvbucket" path="libvbucket" revision="00d3763593c116e8e5d97aa0b646c42885727398"/><project name="membase-cli" path="membase-cli" revision="7fe4121e7e83952a4cb032e25a2cb9fca1709354" remote="membase"/><project name="memcached" path="memcached" revision="06ab906e6702917c4b6b90a6b0051644719a357d" remote="membase"/><project name="moxi" path="moxi" revision="52a5fa887bfff0bf719c4ee5f29634dd8707500e"/><project name="ns_server" path="ns_server" revision="82c7c95e33445edcc6665e64216983e1027756e1"/><project name="portsigar" path="portsigar" revision="1bc865e1622fb93a3fe0d1a4cdf18eb97ed9d600"/><project name="sigar" path="sigar" revision="63a3cd1b316d2d4aa6dd31ce8fc66101b983e0b0"/><project name="couchbase-examples" path="couchbase-examples" revision="21e6161a1d064979b5c6aa99cd34ccc41c9d7aca"/><project name="couchbase-python-client" path="couchbase-python-client" revision="86b398e4fbc1f2e38d356e14df0c1bb4e3d2427b"/><project name="couchdb" path="couchdb" revision="69032c6dbcd64c056265416d2c49b1a9ee06f9c3"/><project name="couchdbx-app" path="couchdbx-app" revision="76d79be79c1454cff0f878d5a88a792270ec1b17"/><project name="couchstore" path="couchstore" revision="772f66a29a81be59d8bdaaa74b3898bb44fcc7e2"/><project name="geocouch" path="geocouch" revision="b0bd742551639c52030c070e5bf9390edbb536ba"/><project name="mccouch" path="mccouch" revision="88701cc326bc3dde4ed072bb8441be83adcfb2a5"/><project name="testrunner" path="testrunner" revision="503766d1695f9bc7c81f124dbb38a9a24ab0310f"/><project name="otp" path="otp" revision="b6dc1a844eab061d0a7153d46e7e68296f15a504" remote="erlang"/><project name="icu4c" path="icu4c" revision="26359393672c378f41f2103a8699c4357c894be7" remote="couchbase"/><project name="snappy" path="snappy" revision="5681dde156e9d07adbeeab79666c9a9d7a10ec95" remote="couchbase"/><project name="v8" path="v8" revision="447decb75060a106131ab4de934bcc374648e7f2" remote="couchbase"/><project name="gperftools" path="gperftools" revision="8f60ba949fb8576c530ef4be148bff97106ddc59" remote="couchbase"/><project name="pysqlite" path="pysqlite" revision="0ff6e32ea05037fddef1eb41a648f2a2141009ea" remote="couchbase"/></manifest>
Description
-t viewquerytests.ViewQueryTests.test_employee_dataset_startkey_endkey_queries_rebalance_in,max-dupe-result-count=10,num_nodes_to_add=3,skip_rebalance=true,docs-per-day=1000
Rebalance out 4->3 is failed:
[rebalance:error,2012-10-18T19:39:33.308,ns_1@10.3.3.58:<0.26558.105>:ns_rebalancer:do_wait_buckets_shutdown:204]Failed to wait deletion of some buckets on some nodes: [{'ns_1@10.3.3.58',
{'EXIT',
{old_buckets_shutdown_wait_failed,
["default"]}}}]
[user:info,2012-10-18T19:39:33.311,ns_1@10.3.3.58:<0.4365.96>:ns_orchestrator:handle_info:319]Rebalance exited with reason {buckets_shutdown_wait_failed,
[{'ns_1@10.3.3.58',
{'EXIT',
{old_buckets_shutdown_wait_failed,
["default"]}}}]}
logs:
http://qa.hq.northscale.net/job/centos-64-2.0-views-regression/29/artifact/logs/testrunner-12-Oct-17_23-58-20/483c326b-c52d-4555-8fc2-482a16dca098-10.3.3.58-diag.txt.gz
http://qa.hq.northscale.net/job/centos-64-2.0-views-regression/29/artifact/logs/testrunner-12-Oct-17_23-58-20/483c326b-c52d-4555-8fc2-482a16dca098-10.3.3.64-diag.txt.gz
http://qa.hq.northscale.net/job/centos-64-2.0-views-regression/29/artifact/logs/testrunner-12-Oct-17_23-58-20/483c326b-c52d-4555-8fc2-482a16dca098-10.3.3.68-diag.txt.gz
http://qa.hq.northscale.net/job/centos-64-2.0-views-regression/29/artifact/logs/testrunner-12-Oct-17_23-58-20/483c326b-c52d-4555-8fc2-482a16dca098-10.3.3.71-diag.txt.gz
Rebalance out 4->3 is failed:
[rebalance:error,2012-10-18T19:39:33.308,ns_1@10.3.3.58:<0.26558.105>:ns_rebalancer:do_wait_buckets_shutdown:204]Failed to wait deletion of some buckets on some nodes: [{'ns_1@10.3.3.58',
{'EXIT',
{old_buckets_shutdown_wait_failed,
["default"]}}}]
[user:info,2012-10-18T19:39:33.311,ns_1@10.3.3.58:<0.4365.96>:ns_orchestrator:handle_info:319]Rebalance exited with reason {buckets_shutdown_wait_failed,
[{'ns_1@10.3.3.58',
{'EXIT',
{old_buckets_shutdown_wait_failed,
["default"]}}}]}
logs:
http://qa.hq.northscale.net/job/centos-64-2.0-views-regression/29/artifact/logs/testrunner-12-Oct-17_23-58-20/483c326b-c52d-4555-8fc2-482a16dca098-10.3.3.58-diag.txt.gz
http://qa.hq.northscale.net/job/centos-64-2.0-views-regression/29/artifact/logs/testrunner-12-Oct-17_23-58-20/483c326b-c52d-4555-8fc2-482a16dca098-10.3.3.64-diag.txt.gz
http://qa.hq.northscale.net/job/centos-64-2.0-views-regression/29/artifact/logs/testrunner-12-Oct-17_23-58-20/483c326b-c52d-4555-8fc2-482a16dca098-10.3.3.68-diag.txt.gz
http://qa.hq.northscale.net/job/centos-64-2.0-views-regression/29/artifact/logs/testrunner-12-Oct-17_23-58-20/483c326b-c52d-4555-8fc2-482a16dca098-10.3.3.71-diag.txt.gz
[ns_server:debug,2012-10-18T19:38:41.542,ns_1@10.3.3.58:'ns_memcached-default':ns_memcached:terminate:686]Proceeding into vbuckets dbs deletions
And then next message from ns_memcached is:
[ns_server:info,2012-10-18T19:39:55.646,ns_1@10.3.3.58:'ns_memcached-default':ns_storage_conf:delete_couch_database:431]Deleting database <<"default/master">>: ok
You can see that it's over 60 seconds later. There's no significant work between those two calls except actual master db deletion. So that's weirdness. And that delay about bucket deletion taking so much time is actually causing that rebalance breakage.
I.e. previous cleanup failed to complete bucket's deletion and thus next test that starts with rebalance fails due to timeout waiting for old bucket's shutdown.
[user:info,2012-10-18T19:39:13.277,ns_1@10.3.3.58:<0.4365.96>:ns_orchestrator:idle:492]Starting rebalance, KeepNodes = ['ns_1@10.3.3.58'], EjectNodes = ['ns_1@10.3.3.64',
'ns_1@10.3.3.68',
'ns_1@10.3.3.71']