Details
Description
Our integration testing is showing irregular operations failing during tests where a node is failed over, then added back and rebalanced. This is not expected, as there should be no failures during rebalance.
Assigning to Saakshi to further fill out the description.
Assigning to Saakshi to further fill out the description.
Activity
Matt Ingenthron
made changes -
| Field | Original Value | New Value |
|---|---|---|
| Planned Start | 2013-02-13 12:00 (initialized without having effort estimation) | |
| Planned End | 2013-02-13 12:00 (initialized without having effort estimation) |
Matt Ingenthron
made changes -
| Affects Version/s | 1.2.1 [ 10441 ] |
Matt Ingenthron
made changes -
| Fix Version/s | 1.2.2 [ 10472 ] |
Matt Ingenthron
made changes -
| Planned Start | 2013-02-13 12:00 | 2013-02-13 12:00 (set to new fixed version's start date) |
| Planned End | 2013-02-13 12:00 | 2013-02-13 12:00 (set to new fixed version's start date) |
Matt Ingenthron
made changes -
| Component/s | library [ 10122 ] |
Matt Ingenthron
made changes -
| Assignee | Mark Nunberg [ mnunberg ] | Saakshi Manocha [ saakshi.manocha ] |
Matt Ingenthron
made changes -
| Planned End | 2013-02-13 12:00 | (re-schedule end date based on new assignee) |
Matt Ingenthron
made changes -
| Description |
Our integration testing is showing irregular operations failing during tests where a node is failed over, then added back and rebalanced. This is not expected, as there should be no failures during rebalance.
Assigning to Mark to further fill out the description. |
Our integration testing is showing irregular operations failing during tests where a node is failed over, then added back and rebalanced. This is not expected, as there should be no failures during rebalance.
Assigning to Saakshi to further fill out the description. |
Saakshi Manocha
made changes -
| Status | Open [ 1 ] | In Progress [ 3 ] |
John Zablocki
made changes -
| Fix Version/s | 1.2.3 [ 10473 ] | |
| Fix Version/s | 1.2.2 [ 10472 ] |
John Zablocki
made changes -
| Planned Start | 2013-02-13 12:00 | 2013-02-13 12:00 (set to new fixed version's start date) |
| Planned End | (set to new fixed version's start date) |
John Zablocki
made changes -
| Fix Version/s | 1.2.5 [ 10525 ] | |
| Fix Version/s | 1.2.4 [ 10473 ] |
Saakshi Manocha
made changes -
| Status | In Progress [ 3 ] | Resolved [ 5 ] |
| Resolution | Fixed [ 1 ] |
- Ran the command:
python .\brun -C Sdkd.args -S dotnet-1.2-release -V 2.0.0-1976 -i cluster_config.ini -T HYBRID_readd-2
(This command will fail two nodes, add them back and then rebalance)
- Cluster_config.ini comprise of 4 nodes:
10.3.121.134 10.3.121.135 10.3.121.136 10.3.3.206
- Output is here:
http://sdk-testresults.couchbase.com.s3.amazonaws.com/sdkd/HWIN-335SPEPOCGT-IHYBRID_readd-2-Sdotnet-1.2-release-T2013-02-14-03.49.12-LV_CB_BASIC.txt
http://sdk-testresults.couchbase.com.s3.amazonaws.com/sdkd/HWIN-335SPEPOCGT-IHYBRID_readd-2-Sdotnet-1.2-release-T2013-02-14-03.49.12-LV_MC_BASIC.txt
http://sdk-testresults.couchbase.com.s3.amazonaws.com/sdkd/HWIN-335SPEPOCGT-IHYBRID_readd-2-Sdotnet-1.2-release-T2013-02-14-03.49.12-LV_HTTP_BASIC.txt
- Observations:
(a) Following errors occur continuously during CHANGE phase while the rebalance operation is undergoing:
[Enyim.Caching.Memcached.MemcachedNode|Error] System.IO.IOException: Failed to read from the socket '10.3.121.136:11210'. Error: SocketError value was Success, but 0 bytes were received
[Enyim.Caching.Memcached.MemcachedNode.InternalPoolImpl|Error] Could not init pool. System.NullReferenceException Object reference not set to an instance of an object.
[Sdkd.ViewQuery|Warn] Unrecognized error System.Net.WebException The operation has timed out
(b) No Errors occur during REBOUND phase, which is a good sign. This is the time when Rebalance operation is complete and no more topology changes occur.