<!-- 
RSS generated by JIRA (5.2.4#845-sha1:c9f4cc41abe72fb236945343a1f485c2c844dac9) at Sat May 25 02:29:11 CDT 2013

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary add field=key&field=summary to the URL of your request.
For example:
http://www.couchbase.com/issues/si/jira.issueviews:issue-xml/MB-6649/MB-6649.xml?field=key&field=summary
-->
<rss version="0.92" >
<channel>
    <title>Couchbase</title>
    <link>http://www.couchbase.com/issues</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>5.2.4</version>
        <build-number>845</build-number>
        <build-date>26-12-2012</build-date>
    </build-info>

<item>
            <title>[MB-6649] beam.smp memory usage grows to 2 GB when xdcr feature is enabled and rebalancing is in progress</title>
                <link>http://www.couchbase.com/issues/browse/MB-6649</link>
                <project id="10010" key="MB">Couchbase Server</project>
                        <description>- Created default buckets on a 2:2 cluster&lt;br/&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;[10.1.3.235, 10.1.3.236] : [10.1.3.237, 10.1.3.238]&lt;br/&gt;
- Set up bidirectional replication for the bucket, ran load on both the buckets.&lt;br/&gt;
- Swap rebalanced a node on both clusters&lt;br/&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;[10.1.3.235, 10.3.2.54] : [10.1.3.237, 10.3.2.55]&lt;br/&gt;
- Upon completion of rebalance, stopped load on default buckets.&lt;br/&gt;
- Created standard buckets on both the clusters.&lt;br/&gt;
- Set up unidirectional replication for the standard bucket from cluster 1 to cluster 2, ran load on cluster 1.&lt;br/&gt;
- Stopped load after a point.&lt;br/&gt;
- With replication still going on, Rebalance-in the removed nodes on each cluster (to make it 3:3)&lt;br/&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;[10.1.3.235, 10.3.2.54, 10.1.3.236] : [10.1.3.237, 10.3.2.55, 10.1.3.238]&lt;br/&gt;
- During rebalance, load is not going on on either cluster, however replication is still going on.&lt;br/&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;- Heavy swap on the orchestrators of both the clusters&lt;br/&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;- Erlang using up a lot of memory ( &amp;gt; 2.5G )&lt;br/&gt;
- Rebalance gradually completed on cluster 2.&lt;br/&gt;
- Rebalance fails on cluster 1:&lt;br/&gt;
&lt;br/&gt;
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - &lt;br/&gt;
Rebalance exited with reason {{badmatch,{error,timeout}},&lt;br/&gt;
{gen_server,call,&lt;br/&gt;
[{&amp;#39;ns_memcached-bucket&amp;#39;,&amp;#39;&lt;a href=&apos;mailto:ns_1@10.1.3.235&apos;&gt;ns_1@10.1.3.235&lt;/a&gt;&amp;#39;},&lt;br/&gt;
{get_vbucket,835},&lt;br/&gt;
60000]}}&lt;br/&gt;
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - &lt;br/&gt;
&lt;br/&gt;
- If tried to re-rebalance, rebalance fails again:&lt;br/&gt;
&lt;br/&gt;
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - &lt;br/&gt;
Rebalance exited with reason {not_all_nodes_are_ready_yet,[&amp;#39;&lt;a href=&apos;mailto:ns_1@10.1.3.235&apos;&gt;ns_1@10.1.3.235&lt;/a&gt;&amp;#39;]}&lt;br/&gt;
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - &lt;br/&gt;
&lt;br/&gt;
- This is probably because, 10.1.3.235 is in &amp;quot;PEND&amp;quot; state on the UI.&lt;br/&gt;
&lt;br/&gt;
- Uploading grabbed diags onto s3.&lt;br/&gt;
&lt;br/&gt;
&lt;a href=&quot;https://s3.amazonaws.com/bugdb/MB-6649/10.1.3.235-8091-diag.txt.gz&quot;&gt;https://s3.amazonaws.com/bugdb/MB-6649/10.1.3.235-8091-diag.txt.gz&lt;/a&gt;&lt;br/&gt;
&lt;a href=&quot;https://s3.amazonaws.com/bugdb/MB-6649/10.1.3.236-8091-diag.txt.gz&quot;&gt;https://s3.amazonaws.com/bugdb/MB-6649/10.1.3.236-8091-diag.txt.gz&lt;/a&gt;&lt;br/&gt;
&lt;a href=&quot;https://s3.amazonaws.com/bugdb/MB-6649/10.3.2.54-8091-diag.txt.gz&quot;&gt;https://s3.amazonaws.com/bugdb/MB-6649/10.3.2.54-8091-diag.txt.gz&lt;/a&gt;&lt;br/&gt;
&lt;a href=&quot;https://s3.amazonaws.com/bugdb/MB-6649/10.1.3.237-8091-diag.txt.gz&quot;&gt;https://s3.amazonaws.com/bugdb/MB-6649/10.1.3.237-8091-diag.txt.gz&lt;/a&gt;&lt;br/&gt;
&lt;a href=&quot;https://s3.amazonaws.com/bugdb/MB-6649/10.1.3.238-8091-diag.txt.gz&quot;&gt;https://s3.amazonaws.com/bugdb/MB-6649/10.1.3.238-8091-diag.txt.gz&lt;/a&gt;&lt;br/&gt;
&lt;a href=&quot;https://s3.amazonaws.com/bugdb/MB-6649/10.3.2.55-8091-diag.txt.gz&quot;&gt;https://s3.amazonaws.com/bugdb/MB-6649/10.3.2.55-8091-diag.txt.gz&lt;/a&gt;</description>
                <environment>2.0.0-1721-rel&lt;br/&gt;
Centos 4G RAM 64-bit machines&lt;br/&gt;
1024 vbuckets&lt;br/&gt;
</environment>
            <key id="19748">MB-6649</key>
            <summary>beam.smp memory usage grows to 2 GB when xdcr feature is enabled and rebalancing is in progress</summary>
                <type id="1" iconUrl="http://www.couchbase.com/issues/images/icons/issuetypes/bug.png">Bug</type>
                                <priority id="2" iconUrl="http://www.couchbase.com/issues/images/icons/priorities/critical.png">Critical</priority>
                    <status id="6" iconUrl="http://www.couchbase.com/issues/images/icons/statuses/closed.png">Closed</status>
                    <resolution id="5">Cannot Reproduce</resolution>
                    <security id="10011">Public</security>
                        <assignee username="abhinav">Abhinav Dangeti</assignee>
                                <reporter username="abhinav">Abhinav Dangeti</reporter>
                        <labels>
                        <label>2.0-beta-release-notes</label>
                    </labels>
                <created>Thu, 13 Sep 2012 16:10:27 -0500</created>
                <updated>Wed, 26 Sep 2012 12:08:45 -0500</updated>
                    <resolved>Wed, 19 Sep 2012 18:40:02 -0500</resolved>
                            <version>2.0-beta-2</version>
                                <fixVersion>2.0-beta-2</fixVersion>
                                <component>cross-datacenter-replication</component>
                                <votes>0</votes>
                        <watches>2</watches>
                                                    <comments>
                    <comment id="38773" author="junyi" created="Thu, 13 Sep 2012 16:25:31 -0500"  >It is good for QE to aggressively catch bugs, but IHMO it would be nice to investigate and test a little bit before simply dumping bugs to developers  :-).  &lt;br/&gt;
&lt;br/&gt;
For example, in this case, is the heavy memory and swap usage caused by XDCR or rebalance? It is pretty easy to verify, just need to re-test without the rebalance to see if the issue persists.&lt;br/&gt;
&lt;br/&gt;
Also, it is known that XDCR consumes resource at destination (there are already a few bugs filed about it)), so is it the bug a duplicate of previous ones?  </comment>
                    <comment id="38777" author="abhinav" created="Thu, 13 Sep 2012 16:35:15 -0500"  >This issue isn&amp;#39;t seen with XDCR alone, or for that matter rebalance alone.&lt;br/&gt;
Verified that with just rebalance: Memory usage is not high.&lt;br/&gt;
Verified that with just replication: Memory usage is not high.&lt;br/&gt;
&lt;br/&gt;
Its a combination of the two that sometimes is causing the heavy memory usage and swap.&lt;br/&gt;
&lt;br/&gt;
Also observed:&lt;br/&gt;
- Rebalance failed on cluster 1.&lt;br/&gt;
- After rebalance finished on Cluster 2, beam.smp&amp;#39;s resident memory usage is still at 1.1G (Replication is still going on)</comment>
                    <comment id="39277" author="junyi" created="Tue, 18 Sep 2012 21:01:02 -0500"  >Can you please verify with latest build?</comment>
                    <comment id="39345" author="abhinav" created="Wed, 19 Sep 2012 18:40:02 -0500"  >Tried reproducing the same scenario on build 1744: Beam.smp had a memory usage of between 300 and 500MB, Rebalance completed successfully on both clusters.</comment>
                </comments>
                    <attachments>
                    <attachment id="15013" name="Screen Shot 2012-09-13 at 1.49.42 PM.png" size="190491" author="abhinav" created="Thu, 13 Sep 2012 16:10:27 -0500" />
                    <attachment id="15014" name="Screen Shot 2012-09-13 at 2.21.56 PM.png" size="130149" author="abhinav" created="Thu, 13 Sep 2012 16:22:59 -0500" />
                    <attachment id="15015" name="Screen Shot 2012-09-13 at 2.22.14 PM.png" size="160130" author="abhinav" created="Thu, 13 Sep 2012 16:22:59 -0500" />
                </attachments>
            <subtasks>
        </subtasks>
                <customfields>
                                                                        <customfield id="customfield_10180" key="com.atlassian.jira.ext.charting:firstresponsedate">
                <customfieldname>Date of First Response</customfieldname>
                <customfieldvalues>
                    <customfieldvalue>Thu, 13 Sep 2012 16:25:31 -0500</customfieldvalue>

                </customfieldvalues>
            </customfield>
                                                                                                                                                                                                            <customfield id="customfield_10081" key="com.pyxis.greenhopper.jira:gh-global-rank">
                <customfieldname>Rank</customfieldname>
                <customfieldvalues>
                    <customfieldvalue>3972</customfieldvalue>
                </customfieldvalues>
            </customfield>
                                                                                                                                                                                        <customfield id="customfield_10181" key="com.atlassian.jira.ext.charting:timeinstatus">
                <customfieldname>Time In Status</customfieldname>
                <customfieldvalues>
                    
                </customfieldvalues>
            </customfield>
                                                </customfields>
    </item>
</channel>
</rss>