<!-- 
RSS generated by JIRA (5.2.4#845-sha1:c9f4cc41abe72fb236945343a1f485c2c844dac9) at Thu May 23 01:53:57 CDT 2013

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary add field=key&field=summary to the URL of your request.
For example:
http://www.couchbase.com/issues/si/jira.issueviews:issue-xml/MB-4366/MB-4366.xml?field=key&field=summary
-->
<rss version="0.92" >
<channel>
    <title>Couchbase</title>
    <link>http://www.couchbase.com/issues</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>5.2.4</version>
        <build-number>845</build-number>
        <build-date>26-12-2012</build-date>
    </build-info>

<item>
            <title>[MB-4366] ns_server is reusing tap names unsafely which causes data loss or inconsistency in replication when a node is removed and added back</title>
                <link>http://www.couchbase.com/issues/browse/MB-4366</link>
                <project id="10010" key="MB">Couchbase Server</project>
                        <description>screenshot attached&lt;br/&gt;
&lt;br/&gt;
NOTE: we&amp;#39;re converting this to main &amp;#39;named tap issues&amp;#39; ticket.&lt;br/&gt;
&lt;br/&gt;
So what&amp;#39;s not safe about reusing named taps as of 1.8.0?&lt;br/&gt;
&lt;br/&gt;
If something happened to destination node after tap was disconnected. And if that something affected data for vbuckets replicated as part of named tap, then subsequent reuse of named tap will incorrectly assume that we can continue sending stuff instead of re-negotiating which data needs to be resent.&lt;br/&gt;
</description>
                <environment></environment>
            <key id="15435">MB-4366</key>
            <summary>ns_server is reusing tap names unsafely which causes data loss or inconsistency in replication when a node is removed and added back</summary>
                <type id="1" iconUrl="http://www.couchbase.com/issues/images/icons/issuetypes/bug.png">Bug</type>
                                <priority id="1" iconUrl="http://www.couchbase.com/issues/images/icons/priorities/blocker.png">Blocker</priority>
                    <status id="6" iconUrl="http://www.couchbase.com/issues/images/icons/statuses/closed.png">Closed</status>
                    <resolution id="1">Fixed</resolution>
                    <security id="10011">Public</security>
                        <assignee username="alkondratenko">Aleksey Kondratenko</assignee>
                                <reporter username="farshid">Farshid Ghods</reporter>
                        <labels>
                        <label>1.8.1-release-notes</label>
                    </labels>
                <created>Wed, 19 Oct 2011 19:20:35 -0500</created>
                <updated>Wed, 9 Jan 2013 22:59:21 -0600</updated>
                    <resolved>Thu, 12 Apr 2012 22:44:34 -0500</resolved>
                            <version>1.7.2</version>
                <version>1.8.0</version>
                                <fixVersion>1.8.1</fixVersion>
                                <component>ns_server</component>
                                <votes>0</votes>
                        <watches>3</watches>
                                                    <comments>
                    <comment id="22419" author="farshid" created="Wed, 19 Oct 2011 19:23:48 -0500"  >another screenshot : 5 minutes after stopping the rebalance</comment>
                    <comment id="22423" author="farshid" created="Thu, 20 Oct 2011 01:33:32 -0500"  >tap stream only stops if there is no item added to the backlog&lt;br/&gt;
if the user keeps the load running this tap stream remains alive forever</comment>
                    <comment id="23143" author="alkondratenko" created="Thu, 22 Dec 2011 07:01:55 -0600"  >Farshid, cannot make sense of this screenshots. Can you elaborate?</comment>
                    <comment id="23146" author="farshid" created="Thu, 22 Dec 2011 09:41:25 -0600"  >basically that means there is still one tap_rebalance stream open and running even after rebalance was stopped.&lt;br/&gt;
&lt;br/&gt;
we seem to be stopping most of the streams except one</comment>
                    <comment id="23147" author="farshid" created="Thu, 22 Dec 2011 09:41:57 -0600"  >waiting 5 minutes will not work if there are ongoing mutuations in the cluster because this tap stream only times out after 5 minutes of inactivity</comment>
                    <comment id="23153" author="alkondratenko" created="Thu, 22 Dec 2011 09:56:56 -0600"  >so it&amp;#39;s ep-engine issue then ? I mean we close tap streams as much as possible in ns_server. Named tap streams are kept alive by ep-engine. If there&amp;#39;s anything ns_server can do to really stop those tap producers, I&amp;#39;ll be happy to do that.</comment>
                    <comment id="25370" author="steve" created="Fri, 30 Mar 2012 15:35:46 -0500"  >this is the main ticket for the named tap approach/fix</comment>
                    <comment id="25371" author="steve" created="Fri, 30 Mar 2012 15:36:18 -0500"  >is this a blocker for 1.8.1?</comment>
                    <comment id="25382" author="dipti" created="Fri, 30 Mar 2012 17:33:35 -0500"  >Yes, because this may be causing data loss in some conditions. &lt;br/&gt;
&lt;br/&gt;
Farshid, I believe there are a few other tickets where this is the underlying problem. Can you reference them here for completeness? Thanks </comment>
                    <comment id="25560" author="alkondratenko" created="Sat, 7 Apr 2012 15:58:09 -0500"  >&lt;a href=&quot;http://review.couchbase.org/14555&quot;&gt;http://review.couchbase.org/14555&lt;/a&gt; fixes it on 1.8.1.&lt;br/&gt;
&lt;br/&gt;
1.8 and master have a bit different code in this area so this work still needs some forward-porting.</comment>
                    <comment id="25600" author="steve" created="Mon, 9 Apr 2012 17:53:52 -0500"  >fix is in gerrit (but more work still needed to enable 1.8.2)</comment>
                    <comment id="25614" author="alkondratenko" created="Mon, 9 Apr 2012 21:13:45 -0500"  >let&amp;#39;s keep this open for now. While I&amp;#39;ll adapt it for 1.8.2 I may have to change 1.8.1 code to enable forward-compatibility with 1.8.2 and master</comment>
                    <comment id="25757" author="dipti" created="Wed, 11 Apr 2012 21:51:10 -0500"  >Aliaksey, code complete is friday and we need to merge everything in by then. &lt;br/&gt;
What changes need to be made to ensure forward-compatibility?</comment>
                    <comment id="25762" author="alkondratenko" created="Wed, 11 Apr 2012 22:03:54 -0500"  >Minor. I&amp;#39;ll be doing that tomorrow first-priority.</comment>
                    <comment id="25820" author="alkondratenko" created="Thu, 12 Apr 2012 22:44:34 -0500"  >I&amp;#39;ve found no further changes to 1.8.1 are needed. 1.8.2 implementation is here &lt;a href=&quot;http://review.couchbase.org/14827&quot;&gt;http://review.couchbase.org/14827&lt;/a&gt;</comment>
                    <comment id="26085" author="thuan" created="Fri, 20 Apr 2012 17:47:41 -0500"  >Integrated in github-ns-server-2-0 #333 (See [&lt;a href=&quot;http://qa.hq.northscale.net/job/github-ns-server-2-0/333/&quot;&gt;http://qa.hq.northscale.net/job/github-ns-server-2-0/333/&lt;/a&gt;])&lt;br/&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;only reuse tap name when changing vbucket filter.&lt;a href=&quot;http://www.couchbase.com/issues/browse/MB-4366&quot; title=&quot;ns_server is reusing tap names unsafely which causes data loss or inconsistency in replication when a node is removed and added back&quot;&gt;&lt;strike&gt;MB-4366&lt;/strike&gt;&lt;/a&gt; (Revision 61bf78355e64fff2e807939fea385862ca6919d5)&lt;br/&gt;
reimplemented named tap fix for branch-18. &lt;a href=&quot;http://www.couchbase.com/issues/browse/MB-4366&quot; title=&quot;ns_server is reusing tap names unsafely which causes data loss or inconsistency in replication when a node is removed and added back&quot;&gt;&lt;strike&gt;MB-4366&lt;/strike&gt;&lt;/a&gt; (Revision e3b833480ceb5b7832e22131ed5d3fb532e6ea83)&lt;br/&gt;
&lt;br/&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;Result = SUCCESS&lt;br/&gt;
Aliaksey Artamonau : &lt;br/&gt;
Files : &lt;br/&gt;
* src/ns_server_cluster_sup.erl&lt;br/&gt;
* src/ebucketmigrator_srv.erl&lt;br/&gt;
* src/ns_vbm_sup.erl&lt;br/&gt;
&lt;br/&gt;
Aliaksey Artamonau : &lt;br/&gt;
Files : &lt;br/&gt;
* src/ns_vbm_new_sup.erl&lt;br/&gt;
* src/ns_vbm_sup.erl&lt;br/&gt;
* src/ebucketmigrator_srv.erl&lt;br/&gt;
* src/ns_server_cluster_sup.erl&lt;br/&gt;
* src/cb_gen_vbm_sup.erl&lt;br/&gt;
</comment>
                    <comment id="26192" author="thuan" created="Wed, 25 Apr 2012 02:47:18 -0500"  >Integrated in github-ns-server-2-0 #337 (See [&lt;a href=&quot;http://qa.hq.northscale.net/job/github-ns-server-2-0/337/&quot;&gt;http://qa.hq.northscale.net/job/github-ns-server-2-0/337/&lt;/a&gt;])&lt;br/&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;fixed typo in start_vbucket_filter_change. &lt;a href=&quot;http://www.couchbase.com/issues/browse/MB-4366&quot; title=&quot;ns_server is reusing tap names unsafely which causes data loss or inconsistency in replication when a node is removed and added back&quot;&gt;&lt;strike&gt;MB-4366&lt;/strike&gt;&lt;/a&gt; (Revision 5db3c35e8a5ff6a5885271df4466b30c5369fa38)&lt;br/&gt;
&lt;br/&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;Result = SUCCESS&lt;br/&gt;
Steve Yen : &lt;br/&gt;
Files : &lt;br/&gt;
* src/ebucketmigrator_srv.erl&lt;br/&gt;
</comment>
                </comments>
                <issuelinks>
                        <issuelinktype id="10000">
                <name>Dependency</name>
                                                <inwardlinks description="blocks">
                                    </inwardlinks>
                            </issuelinktype>
                    </issuelinks>
                <attachments>
                    <attachment id="11768" name="Screen Shot 2011-10-19 at 11.25.13 PM.png" size="61469" author="farshid" created="Thu, 20 Oct 2011 01:25:17 -0500" />
                    <attachment id="11765" name="Screen Shot 2011-10-19 at 5.19.37 PM.png" size="13150" author="farshid" created="Wed, 19 Oct 2011 19:20:35 -0500" />
                    <attachment id="11766" name="Screen Shot 2011-10-19 at 5.23.19 PM.png" size="42395" author="farshid" created="Wed, 19 Oct 2011 19:23:57 -0500" />
                </attachments>
            <subtasks>
        </subtasks>
                <customfields>
                                                                        <customfield id="customfield_10180" key="com.atlassian.jira.ext.charting:firstresponsedate">
                <customfieldname>Date of First Response</customfieldname>
                <customfieldvalues>
                    <customfieldvalue>Thu, 22 Dec 2011 07:01:55 -0600</customfieldvalue>

                </customfieldvalues>
            </customfield>
                                                                                                                                                                                                            <customfield id="customfield_10081" key="com.pyxis.greenhopper.jira:gh-global-rank">
                <customfieldname>Rank</customfieldname>
                <customfieldvalues>
                    <customfieldvalue>6073</customfieldvalue>
                </customfieldvalues>
            </customfield>
                                                                                                                            <customfield id="customfield_10050" key="com.atlassian.jira.plugin.system.customfieldtypes:float">
                <customfieldname>Sprint Priority</customfieldname>
                <customfieldvalues>
                    <customfieldvalue>7.0</customfieldvalue>
                </customfieldvalues>
            </customfield>
                                                                                    <customfield id="customfield_10181" key="com.atlassian.jira.ext.charting:timeinstatus">
                <customfieldname>Time In Status</customfieldname>
                <customfieldvalues>
                    
                </customfieldvalues>
            </customfield>
                                                </customfields>
    </item>
</channel>
</rss>