<!-- 
RSS generated by JIRA (5.2.4#845-sha1:c9f4cc41abe72fb236945343a1f485c2c844dac9) at Sun May 19 07:07:18 CDT 2013

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary add field=key&field=summary to the URL of your request.
For example:
http://www.couchbase.com/issues/si/jira.issueviews:issue-xml/MB-5546/MB-5546.xml?field=key&field=summary
-->
<rss version="0.92" >
<channel>
    <title>Couchbase</title>
    <link>http://www.couchbase.com/issues</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>5.2.4</version>
        <build-number>845</build-number>
        <build-date>26-12-2012</build-date>
    </build-info>

<item>
            <title>[MB-5546] Increasing the default timeouts on ns_server to avoid rebalance failures due to ep-engine stats timeout issues in large cluster or clusters where some nodes are actively using swap</title>
                <link>http://www.couchbase.com/issues/browse/MB-5546</link>
                <project id="10010" key="MB">Couchbase Server</project>
                        <description>Related issues:- &lt;br/&gt;
&lt;a href=&quot;http://www.couchbase.com/issues/browse/MB-5360&quot; title=&quot;Rebalance failed due to timeout in gen_server,call &amp;#39;ns_memcached-default&amp;#39; &quot;&gt;&lt;strike&gt;MB-5360&lt;/strike&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href=&quot;http://www.couchbase.com/issues/browse/MB-5352&quot; title=&quot;Rebalance failed due to shutdown gen_server call (Got error while trying to read close ack:{error,closed})&quot;&gt;&lt;strike&gt;MB-5352&lt;/strike&gt;&lt;/a&gt;&lt;br/&gt;
&lt;br/&gt;
We have multiple bugs related to the timeouts we are hitting on ns_server :-&lt;br/&gt;
1) When in swap&lt;br/&gt;
2) On windows even on a small cluster. &lt;br/&gt;
&lt;br/&gt;
This bug is to recommend increasing the default timeouts. &lt;br/&gt;
&lt;br/&gt;
We used the following timeouts on most of the params, its not all in one solution, but hopefully would cover basic secnarios. &lt;br/&gt;
ns_memcached_outer, 60000&lt;br/&gt;
ns_memcached_open_checkpoint, 60000&lt;br/&gt;
ns_memcached_outer_heavy, 60000&lt;br/&gt;
ns_memcached_outer_very_heavy, 120000&lt;br/&gt;
ns_memcached_connected, 10000&lt;br/&gt;
ebucketmigrator_connect, 60000&lt;br/&gt;
&lt;br/&gt;
Summary, some error messages and fixes that worked:- &lt;br/&gt;
1) Rebalance exited with reason {exited}  {&amp;#39;EXIT&amp;#39;,&amp;lt;0.22700.12&amp;gt;,{timeout,{gen_server,call,[{&amp;#39;ns_memcached-default&amp;#39;,&amp;#39;&lt;a href=&apos;mailto:ns_1@10.3.2.81&apos;&gt;ns_1@10.3.2.81&lt;/a&gt;&amp;#39;},{stats,&amp;lt;&amp;lt;&amp;quot;tap&amp;quot;&amp;gt;&amp;gt;},30000]}}}}&lt;br/&gt;
Fix : adjust timeout value - 120sec - ns_memcached_outer_very_heavy&lt;br/&gt;
2) Rebalance exited with reason {exited,&lt;br/&gt;
{replicator_died,&lt;br/&gt;
&lt;br/&gt;
Fix: Adjust timeout value - 120 sec  - ns_memcached_outer_heavy&lt;br/&gt;
&lt;br/&gt;
3) Rebalance exited with reason {exited,&lt;br/&gt;
{&amp;#39;EXIT&amp;#39;,&amp;lt;0.24287.15&amp;gt;,&lt;br/&gt;
{timeout,&lt;br/&gt;
{gen_server,call,&lt;br/&gt;
[{&amp;#39;ns_memcached-default&amp;#39;,&amp;#39;&lt;a href=&apos;mailto:ns_1@10.3.2.81&apos;&gt;ns_1@10.3.2.81&lt;/a&gt;&amp;#39;},&lt;br/&gt;
{stats,&amp;lt;&amp;lt;&amp;quot;tap&amp;quot;&amp;gt;&amp;gt;},&lt;br/&gt;
30000]}}}}&lt;br/&gt;
&lt;br/&gt;
Fix : Adjust timeout to 120sec &lt;br/&gt;
4) Rebalance exited with reason {{change_filter_failed,&lt;br/&gt;
{&amp;#39;EXIT&amp;#39;,&lt;br/&gt;
{timeout,&lt;br/&gt;
&lt;br/&gt;
Fix : Adjust timeout values -&lt;br/&gt;
ebucketmigrator_connect                                 120 secs&lt;br/&gt;
ns_memcached_connected                              1 sec</description>
                <environment>Windows small/large cluster&lt;br/&gt;
Linux small/large cluster&lt;br/&gt;
&lt;br/&gt;
Bucket	        1, default&lt;br/&gt;
vbuckets	1024&lt;br/&gt;
RAM	        18.7G&lt;br/&gt;
Nodes          4 ( 2 form the base-cluster)&lt;br/&gt;
Items	        Setup for 20M items</environment>
            <key id="17761">MB-5546</key>
            <summary>Increasing the default timeouts on ns_server to avoid rebalance failures due to ep-engine stats timeout issues in large cluster or clusters where some nodes are actively using swap</summary>
                <type id="1" iconUrl="http://www.couchbase.com/issues/images/icons/issuetypes/bug.png">Bug</type>
                                <priority id="1" iconUrl="http://www.couchbase.com/issues/images/icons/priorities/blocker.png">Blocker</priority>
                    <status id="6" iconUrl="http://www.couchbase.com/issues/images/icons/statuses/closed.png">Closed</status>
                    <resolution id="1">Fixed</resolution>
                    <security id="10011">Public</security>
                        <assignee username="alkondratenko">Aleksey Kondratenko</assignee>
                                <reporter username="karan">Karan Kumar</reporter>
                        <labels>
                    </labels>
                <created>Wed, 13 Jun 2012 13:50:32 -0500</created>
                <updated>Wed, 9 Jan 2013 22:59:12 -0600</updated>
                    <resolved>Wed, 13 Jun 2012 15:35:31 -0500</resolved>
                            <version>1.8.1-release-candidate</version>
                                <fixVersion>1.8.1</fixVersion>
                                <component>ns_server</component>
                                <votes>0</votes>
                        <watches>1</watches>
                                                    <comments>
                    <comment id="29810" author="alkondratenko" created="Wed, 13 Jun 2012 14:27:28 -0500"  >I&amp;#39;m a little bit reluctant to change ns_memcached_connected timeout. It&amp;#39;s timeout we&amp;#39;re using when asking if ns_memcached is alive. It&amp;#39;ll just mark bucket as not quite healthy without failing anything. So raising timeout has some effects on autofailover and other things. Something I don&amp;#39;t want to do.</comment>
                    <comment id="29822" author="alkondratenko" created="Wed, 13 Jun 2012 15:32:17 -0500"  >Timeouts were bumped in a commit merged for branch-181 and merged up to master. Except, as noted above, ns_memcached_connected timeout</comment>
                    <comment id="29823" author="karan" created="Wed, 13 Jun 2012 15:35:31 -0500"  >Thanks Alk. &lt;br/&gt;
Duly noted the concerns. &lt;br/&gt;
&lt;a href=&quot;http://review.couchbase.org/#change,17230&quot;&gt;http://review.couchbase.org/#change,17230&lt;/a&gt;</comment>
                    <comment id="29903" author="thuan" created="Wed, 13 Jun 2012 21:05:09 -0500"  >Integrated in github-ns-server-2-0 #374 (See [&lt;a href=&quot;http://qa.hq.northscale.net/job/github-ns-server-2-0/374/&quot;&gt;http://qa.hq.northscale.net/job/github-ns-server-2-0/374/&lt;/a&gt;])&lt;br/&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;a href=&quot;http://www.couchbase.com/issues/browse/MB-5546&quot; title=&quot;Increasing the default timeouts on ns_server to avoid rebalance failures due to ep-engine stats timeout issues in large cluster or clusters where some nodes are actively using swap&quot;&gt;&lt;strike&gt;MB-5546&lt;/strike&gt;&lt;/a&gt;: raised some timeouts to cope with some paging (Revision 0998b9c92a78185eae31dcbdd55ad92e07e0e6a8)&lt;br/&gt;
&lt;br/&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;Result = SUCCESS&lt;br/&gt;
Aliaksey Artamonau : &lt;br/&gt;
Files : &lt;br/&gt;
* src/ns_memcached.erl&lt;br/&gt;
* src/ebucketmigrator_srv.erl&lt;br/&gt;
</comment>
                </comments>
                    <attachments>
                </attachments>
            <subtasks>
        </subtasks>
                <customfields>
                                                                        <customfield id="customfield_10180" key="com.atlassian.jira.ext.charting:firstresponsedate">
                <customfieldname>Date of First Response</customfieldname>
                <customfieldvalues>
                    <customfieldvalue>Wed, 13 Jun 2012 14:27:28 -0500</customfieldvalue>

                </customfieldvalues>
            </customfield>
                                                                                                                                                                                                            <customfield id="customfield_10081" key="com.pyxis.greenhopper.jira:gh-global-rank">
                <customfieldname>Rank</customfieldname>
                <customfieldvalues>
                    <customfieldvalue>4972</customfieldvalue>
                </customfieldvalues>
            </customfield>
                                                                                                                                                                                        <customfield id="customfield_10181" key="com.atlassian.jira.ext.charting:timeinstatus">
                <customfieldname>Time In Status</customfieldname>
                <customfieldvalues>
                    
                </customfieldvalues>
            </customfield>
                                                </customfields>
    </item>
</channel>
</rss>