[MB-5293] rebalance failed with "gen_fsm,sync_send_event," error during online upgrade from 1.7.2 to 1.8.1 on windows ( workaround is to wait 30 seconds between add_node and starting the rebalance ) Created: 11/May/12  Updated: 09/Jan/13  Resolved: 12/May/12

Status: Closed
Project: Couchbase Server
Component/s: ns_server
Affects Version/s: 1.8.1
Fix Version/s: 1.8.1
Security Level: Public

Type: Bug Priority: Major
Reporter: Thuan Nguyen Assignee: Aleksey Kondratenko
Resolution: Duplicate Votes: 0
Labels: 1.8.1-release-notes
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified
Environment: windows server 2008 R2 64 bit

Attachments: GZip Archive 10.3.121.182-8091-diag.txt.gz     GZip Archive 10.3.121.183-8091-diag.txt.gz     GZip Archive 10.3.121.200-8091-diag.txt.gz     GZip Archive 10.3.121.201-8091-diag.txt.gz     GZip Archive 10.3.121.202-8091-diag.txt.gz    

 Description   
Install membase server 1.7.2r-20 on 4 nodes windows server 2008 R2 64bit and couchbase server 1.8.0-810 on one node.
Create 4 nodes cluster of membase server 1.7.2.
Load 1000000 items to default bucket.
Add node 200 with couchbase server 1.8.1-810 to cluster.
Rebalance. Failed
Try rebalance one more time. Complete rebalance.

Error in log page
Server error during processing: ["web request failed",
{path,"/controller/rebalance"},
{type,exit},
{what,
{noproc,
{gen_fsm,sync_send_event,
[{global,ns_orchestrator},
{start_rebalance,
['ns_1@10.3.121.182',
'ns_1@10.3.121.183',
'ns_1@10.3.121.201',
'ns_1@10.3.121.202',
'ns_1@10.3.121.200'],
[],[]}]}}},
{trace,
[{gen_fsm,sync_send_event,2},
{menelaus_web,do_handle_rebalance,3},
{menelaus_web,loop,3},
{mochiweb_http,headers,5},
{proc_lib,init_p_do_apply,3}]}]

 Comments   
Comment by Aleksey Kondratenko [ 12/May/12 ]
Duplicated somewhere. You need to send rebalance request to 1.8.1 node or wait 10-15 seconds before asking older node for start rebalance.
Comment by Farshid Ghods (Inactive) [ 13/May/12 ]
MB-5108
Rolling upgrade from 172 to latest 181 fails with failed rebalance {type,exit}, {what,{noproc, {gen_fsm,sync_send_event,}}}
Comment by Dipti Borkar [ 13/May/12 ]
Trying rebalance again after waiting for 20 seconds works.
Generated at Fri Aug 22 03:34:21 CDT 2014 using JIRA 5.2.4#845-sha1:c9f4cc41abe72fb236945343a1f485c2c844dac9.