<!-- 
RSS generated by JIRA (5.2.4#845-sha1:c9f4cc41abe72fb236945343a1f485c2c844dac9) at Wed Jun 19 22:12:33 CDT 2013

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary add field=key&field=summary to the URL of your request.
For example:
http://www.couchbase.com/issues/si/jira.issueviews:issue-xml/MB-6934/MB-6934.xml?field=key&field=summary
-->
<rss version="0.92" >
<channel>
    <title>Couchbase</title>
    <link>http://www.couchbase.com/issues</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>5.2.4</version>
        <build-number>845</build-number>
        <build-date>26-12-2012</build-date>
    </build-info>

<item>
            <title>[MB-6934] Displaying XDCR Replication error messages/warnings.</title>
                <link>http://www.couchbase.com/issues/browse/MB-6934</link>
                <project id="10010" key="MB">Couchbase Server</project>
                        <description>&lt;br/&gt;
Hi, &lt;br/&gt;
&lt;br/&gt;
With the new error logging code, we now display &amp;quot;recent 10 errors&amp;quot;. Added a screenshot at end of email.&lt;br/&gt;
&lt;br/&gt;
At any point, the last 10 error are displayed on the replication - 10 errors, which may or may not be valid depending upon the current time.&lt;br/&gt;
&lt;br/&gt;
This issue needs to be addressed at two levels -&lt;br/&gt;
1. Level of error logging - Currently too much information is displayed, which also gives misleading idea on state of replication.&lt;br/&gt;
2. Classification of errors v/s warnings.&lt;br/&gt;
&lt;br/&gt;
Having lower level information on the ns_logs can help trouble shoot , but having all of that information on the web-console might just confuse and overwhelm end-user IMO.&lt;br/&gt;
&lt;br/&gt;
&lt;br/&gt;
XDCR can have an error at any of the following levels&lt;br/&gt;
- xdc vbucket replicators - timing out, checkpoint failures, db_not_found&lt;br/&gt;
- xdc replication manager &lt;br/&gt;
- ns_server level - where it is unable to talk to the other remote cluster and so on.&lt;br/&gt;
&lt;br/&gt;
With some recent trials on the new code, we see a lot of errors on the level of bucket replicators, say vbucket XXX commit_checkpoint_failure.&lt;br/&gt;
But the replication is continuing as expected. Replication has not failed, but it is continuing minus the above checkpoint failure.&lt;br/&gt;
&lt;br/&gt;
It might be nicer to classify errors v/s warnings.&lt;br/&gt;
&lt;br/&gt;
Errors - When finally xdcr has stopped working . No more data is being sent over to the destination.&lt;br/&gt;
Replication will be attempted for X number of times, and is finally given up?&lt;br/&gt;
&lt;br/&gt;
Warnings - When there are timeouts, but it is a recoverable situation.&lt;br/&gt;
&lt;br/&gt;
-Ketaki&lt;br/&gt;
&lt;br/&gt;
Screenshot&lt;br/&gt;
</description>
                <environment>2.0-1856</environment>
            <key id="20273">MB-6934</key>
            <summary>Displaying XDCR Replication error messages/warnings.</summary>
                <type id="1" iconUrl="http://www.couchbase.com/issues/images/icons/issuetypes/bug.png">Bug</type>
                                <priority id="1" iconUrl="http://www.couchbase.com/issues/images/icons/priorities/blocker.png">Blocker</priority>
                    <status id="6" iconUrl="http://www.couchbase.com/issues/images/icons/statuses/closed.png">Closed</status>
                    <resolution id="1">Fixed</resolution>
                    <security id="10011">Public</security>
                        <assignee username="junyi">Junyi Xie</assignee>
                                <reporter username="ketaki">Ketaki Gangal</reporter>
                        <labels>
                    </labels>
                <created>Tue, 16 Oct 2012 18:24:45 -0500</created>
                <updated>Thu, 10 Jan 2013 01:14:17 -0600</updated>
                    <resolved>Tue, 23 Oct 2012 19:56:46 -0500</resolved>
                            <version>2.0</version>
                                <fixVersion>2.0</fixVersion>
                                <component>cross-datacenter-replication</component>
                <component>UI</component>
                                <votes>0</votes>
                        <watches>1</watches>
                                                    <comments>
                    <comment id="41563" author="ketaki" created="Tue, 16 Oct 2012 18:26:01 -0500"  >Comments from Product Mgmt&lt;br/&gt;
Hi Junyi, &lt;br/&gt;
&lt;br/&gt;
Is there a log level for the XDCR error messages? &lt;br/&gt;
Are the last 10 errors the only errors tracked? &lt;br/&gt;
Do these include info and warning messages or only errors in this list? &lt;br/&gt;
Do we clean up this error log periodically? (there is no way for ns_server to know if the error is relevant any more) &lt;br/&gt;
&lt;br/&gt;
Aliaksey, as we discussed, at a minimum we need to change the &amp;quot;10 errors&amp;quot; link that appears the first time this message buffer gets populated to a link in aqua blue (like the IP address in cluster reference) and should say &amp;quot;Recent XDCR log messages&amp;quot;  &lt;br/&gt;
&lt;br/&gt;
Junyi, if you can provide more visibility from the replicator side about warnings vs errors vs info messages, we can do something better, if not in 2.0 sometime in the future. But this basic level of error handling doesn&amp;#39;t give users enough visibility into what is going on. </comment>
                    <comment id="41566" author="junyi" created="Tue, 16 Oct 2012 18:52:45 -0500"  >Alk, &lt;br/&gt;
&lt;br/&gt;
Change within XDCR is at &lt;br/&gt;
&lt;br/&gt;
&lt;a href=&quot;http://review.couchbase.org/#/c/21694/&quot;&gt;http://review.couchbase.org/#/c/21694/&lt;/a&gt;&lt;br/&gt;
&lt;br/&gt;
Now the error returned to ns_server is a pair {Time, ErrorString} instead of a string.&lt;br/&gt;
&lt;br/&gt;
Please go ahead and modify UI code accordingly. Thanks.&lt;br/&gt;
</comment>
                    <comment id="42353" author="alkondratenko" created="Tue, 23 Oct 2012 19:43:23 -0500"  >Commit to filter out too old errors is in gerrit. I&amp;#39;ve also implemented Dipti&amp;#39;s proposal to display errors link in normal color rather than red.</comment>
                    <comment id="42354" author="junyi" created="Tue, 23 Oct 2012 19:55:13 -0500"  >All fixes are on gerrit&lt;br/&gt;
&lt;br/&gt;
&lt;a href=&quot;http://review.couchbase.org/#/c/21694/&quot;&gt;http://review.couchbase.org/#/c/21694/&lt;/a&gt;&lt;br/&gt;
&lt;br/&gt;
&lt;a href=&quot;http://review.couchbase.org/#/c/21903/&quot;&gt;http://review.couchbase.org/#/c/21903/&lt;/a&gt;&lt;br/&gt;
&lt;br/&gt;
&lt;a href=&quot;http://review.couchbase.org/#/c/21904/2&quot;&gt;http://review.couchbase.org/#/c/21904/2&lt;/a&gt;&lt;br/&gt;
&lt;br/&gt;
</comment>
                    <comment id="42355" author="junyi" created="Tue, 23 Oct 2012 19:56:46 -0500"  >fixes on gerrit</comment>
                </comments>
                    <attachments>
                    <attachment id="15449" name="Screen Shot 2012-10-16 at 4.24.53 PM.png" size="66831" author="ketaki" created="Tue, 16 Oct 2012 18:25:41 -0500" />
                    <attachment id="15450" name="Screen Shot 2012-10-16 at 4.24.58 PM.png" size="186091" author="ketaki" created="Tue, 16 Oct 2012 18:25:41 -0500" />
                </attachments>
            <subtasks>
        </subtasks>
                <customfields>
                                                                        <customfield id="customfield_10180" key="com.atlassian.jira.ext.charting:firstresponsedate">
                <customfieldname>Date of First Response</customfieldname>
                <customfieldvalues>
                    <customfieldvalue>Tue, 16 Oct 2012 18:52:45 -0500</customfieldvalue>

                </customfieldvalues>
            </customfield>
                                                                                                                                                                                                                                <customfield id="customfield_10081" key="com.pyxis.greenhopper.jira:gh-global-rank">
                <customfieldname>Rank</customfieldname>
                <customfieldvalues>
                    <customfieldvalue>3764</customfieldvalue>
                </customfieldvalues>
            </customfield>
                                                                                                                                                <customfield id="customfield_10052" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                <customfieldname>Sprint Status</customfieldname>
                <customfieldvalues>
                        <customfieldvalue key="10027"><![CDATA[Current Sprint]]></customfieldvalue>

                </customfieldvalues>
            </customfield>
                                                                <customfield id="customfield_10181" key="com.atlassian.jira.ext.charting:timeinstatus">
                <customfieldname>Time In Status</customfieldname>
                <customfieldvalues>
                    
                </customfieldvalues>
            </customfield>
                                                                    </customfields>
    </item>
</channel>
</rss>