<!-- 
RSS generated by JIRA (5.2.4#845-sha1:c9f4cc41abe72fb236945343a1f485c2c844dac9) at Tue May 21 18:19:31 CDT 2013

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary add field=key&field=summary to the URL of your request.
For example:
http://www.couchbase.com/issues/si/jira.issueviews:issue-xml/MB-7382/MB-7382.xml?field=key&field=summary
-->
<rss version="0.92" >
<channel>
    <title>Couchbase</title>
    <link>http://www.couchbase.com/issues</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>5.2.4</version>
        <build-number>845</build-number>
        <build-date>26-12-2012</build-date>
    </build-info>

<item>
            <title>[MB-7382] rebalance froze when node failed over and added back (observed mem used &gt; high water mark for bucket)</title>
                <link>http://www.couchbase.com/issues/browse/MB-7382</link>
                <project id="10010" key="MB">Couchbase Server</project>
                        <description>- 2 node cluster&lt;br/&gt;
- 2 buckets&lt;br/&gt;
- Bucket &amp;#39;bkt&amp;#39; had a very high percentage of sets in its front end load.&lt;br/&gt;
- Failed over ec2-54-252-25-132.ap-southeast-2.compute.amazonaws.com, added back, rebalance.&lt;br/&gt;
- Rebalance froze at around 98%.&lt;br/&gt;
- Stopped front end loads, disk write queue drained.&lt;br/&gt;
- Mem used for both nodes, greater than higher water mark.&lt;br/&gt;
- Restarted couchbase server, waited for warm up to complete, retried rebalance, rebalance remained frozen at 50%.&lt;br/&gt;
- Rebooted nodes, waited for warm up to complete, retried rebalance, rebalance remained frozen at 50%.&lt;br/&gt;
&lt;br/&gt;
Cluster diags:&lt;br/&gt;
1 &lt;a href=&quot;https://s3.amazonaws.com/bugdb/MB-7382/ec2-54-252-25-132.ap-southeast-2.compute.amazonaws.com-8091-diag.txt.gz&quot;&gt;https://s3.amazonaws.com/bugdb/MB-7382/ec2-54-252-25-132.ap-southeast-2.compute.amazonaws.com-8091-diag.txt.gz&lt;/a&gt;&lt;br/&gt;
&lt;br/&gt;
2 &lt;a href=&quot;https://s3.amazonaws.com/bugdb/MB-7382/ec2-54-252-20-171.ap-southeast-2.compute.amazonaws.com-8091-diag.txt.gz&quot;&gt;https://s3.amazonaws.com/bugdb/MB-7382/ec2-54-252-20-171.ap-southeast-2.compute.amazonaws.com-8091-diag.txt.gz&lt;/a&gt;&lt;br/&gt;
&lt;br/&gt;
Attached the cbstats of all, raw memory for both nodes.</description>
                <environment>&lt;a href=&quot;http://builds.hq.northscale.net/latestbuilds/couchbase-server-enterprise_x86_64_2.0.0-1976-rel.deb.manifest.xml&quot;&gt;http://builds.hq.northscale.net/latestbuilds/couchbase-server-enterprise_x86_64_2.0.0-1976-rel.deb.manifest.xml&lt;/a&gt;&lt;br/&gt;
12.04 Ubuntu LTS ec2</environment>
            <key id="21218">MB-7382</key>
            <summary>rebalance froze when node failed over and added back (observed mem used &gt; high water mark for bucket)</summary>
                <type id="1" iconUrl="http://www.couchbase.com/issues/images/icons/issuetypes/bug.png">Bug</type>
                                <priority id="3" iconUrl="http://www.couchbase.com/issues/images/icons/priorities/major.png">Major</priority>
                    <status id="5" iconUrl="http://www.couchbase.com/issues/images/icons/statuses/resolved.png">Resolved</status>
                    <resolution id="2">Won&apos;t Fix</resolution>
                    <security id="10011">Public</security>
                        <assignee username="abhinav">Abhinav Dangeti</assignee>
                                <reporter username="abhinav">Abhinav Dangeti</reporter>
                        <labels>
                    </labels>
                <created>Sun, 9 Dec 2012 01:55:47 -0600</created>
                <updated>Mon, 10 Dec 2012 19:48:27 -0600</updated>
                    <resolved>Mon, 10 Dec 2012 18:49:42 -0600</resolved>
                            <version>2.0</version>
                                <fixVersion>2.0.1</fixVersion>
                                <component>couchbase-bucket</component>
                                <votes>0</votes>
                        <watches>0</watches>
                                                    <comments>
                    <comment id="45677" author="chiyoung" created="Sun, 9 Dec 2012 03:17:08 -0600"  >The load rate from clients was too high, which caused the cluster to be highly overloaded during rebalance. There were lots of backlogs in the replication queues, which caused the bucket &amp;quot;bkt&amp;quot; to have memory usage more than 90% of bucket quota. If memory usage is above 90% of bucket quota, the replication or vbucket takeover would stop.&lt;br/&gt;
&lt;br/&gt;
If we don&amp;#39;t set up the cluster with the enough capacity, we could have the rebalance issues.&lt;br/&gt;
&lt;br/&gt;
Please set up the cluster with the enough capacity. </comment>
                    <comment id="45678" author="chiyoung" created="Sun, 9 Dec 2012 03:19:50 -0600"  >Rebalance tests with two nodes wouldn&amp;#39;t be good for system tests. All of our customers use three node cluster at least.</comment>
                    <comment id="45679" author="abhinav" created="Sun, 9 Dec 2012 04:30:09 -0600"  >Not part of a system test, this was the cluster where I was checking the deleted items&amp;#39; status, I just tried failing over and adding back one of the nodes.</comment>
                </comments>
                    <attachments>
                    <attachment id="15985" name="ec2-54-252-20-171.ap-southeast-2.compute.amazonaws.com.txt" size="13099" author="abhinav" created="Sun, 9 Dec 2012 01:55:47 -0600" />
                    <attachment id="15986" name="ec2-54-252-25-132.ap-southeast-2.compute.amazonaws.com.txt" size="13049" author="abhinav" created="Sun, 9 Dec 2012 01:55:47 -0600" />
                </attachments>
            <subtasks>
        </subtasks>
                <customfields>
                                                                        <customfield id="customfield_10180" key="com.atlassian.jira.ext.charting:firstresponsedate">
                <customfieldname>Date of First Response</customfieldname>
                <customfieldvalues>
                    <customfieldvalue>Sun, 9 Dec 2012 03:17:08 -0600</customfieldvalue>

                </customfieldvalues>
            </customfield>
                                                                                                                                                                                                            <customfield id="customfield_10081" key="com.pyxis.greenhopper.jira:gh-global-rank">
                <customfieldname>Rank</customfieldname>
                <customfieldvalues>
                    <customfieldvalue>3456</customfieldvalue>
                </customfieldvalues>
            </customfield>
                                                                                                                                                                                        <customfield id="customfield_10181" key="com.atlassian.jira.ext.charting:timeinstatus">
                <customfieldname>Time In Status</customfieldname>
                <customfieldvalues>
                    
                </customfieldvalues>
            </customfield>
                                                </customfields>
    </item>
</channel>
</rss>