added a comment - - edited
I tried the same setting as yours (1 -> 1 replication,
default@node1 ->
default@node2, and
default@node1 ->
default2@node2), and it seems there is nothing wrong.
From the log below, XDCR replication manager got notified from ns_server instantly after I deleted the replication doc from UI, it instantly shutdown all ongoing bucket replication process, with no delay. And all XDCR activity stopped at source right after that. However, there could be some activity on destination cluster even after XDCR stopped replication on source side, because it may take a while to persist all items in memory to storage. I am not sure if there is any delay between UI stats and the real activity. Also, if both nodes in your test are on the local machine with 1024 vbuckets, it may take longer to finish. I think the delay should be much shorter if we use VMs to conduct the test.
At this time I am not sure what to fix. I merged some logs for timing purpose, and will ask Ketaki to do the same test on VM. If it is really an issue, we will reopen this bug and investigate the logs from VM.
[couchdb:info,2012-08-28T14:43:47.255,
n_0@127.0.0.1:<0.742.0>:couch_log:info:39]127.0.0.1 - - DELETE /_replicator/1d38c26cdc5c5bb0e6be126e8ae272be%2Fdefault%2Fdefault?rev=1-9ee1a1c9 200
[xdcr:debug,2012-08-28T14:43:47.257,
n_0@127.0.0.1:xdc_rep_manager:xdc_rep_manager:process_update:174]replication doc deleted (docId: <<"1d38c26cdc5c5bb0e6be126e8ae272be/default/default">>), stop all replications
[xdcr:debug,2012-08-28T14:43:47.258,
n_0@127.0.0.1:xdc_rep_manager:xdc_replication_sup:stop_replication:49]all replications for DocId <<"1d38c26cdc5c5bb0e6be126e8ae272be/default/default">> have been stopped
[ns_server:debug,2012-08-28T14:43:47.259,
n_0@127.0.0.1:<0.2113.0>:ns_pubsub:do_subscribe_link:134]Parent process of subscription {ns_config_events,<0.2112.0>} exited with reason shutdown
[ns_server:debug,2012-08-28T14:43:47.260,
n_0@127.0.0.1:<0.2113.0>:ns_pubsub:do_subscribe_link:149]Deleting {ns_config_events,<0.2112.0>} event handler: ok
[xdcr:debug,2012-08-28T14:43:47.296,
n_0@127.0.0.1:<0.11655.0>:xdc_vbucket_rep_worker:find_missing:121]after conflict resolution at target ("
http://Administrator:asdasd@127.0.0.1:9501/default%2f87%3b5816\
f256233b9dffc119c2c32325a512/"), out of all 396 docs the number of docs we need to replicate is: 396
[couchdb:info,2012-08-28T14:43:47.304,
n_0@127.0.0.1:<0.1858.0>:couch_log:info:39]checkpointing view update at seq 5 for _replicator _design/_replicator_info
[couchdb:info,2012-08-28T14:43:47.320,
n_0@127.0.0.1:<0.1852.0>:couch_log:info:39]127.0.0.1 - - GET /_replicator/_design/_replicator_info/_view/infos?group_level=1&_=1346179427278 200
[ns_server:debug,2012-08-28T14:44:00.037,
n_0@127.0.0.1:compaction_daemon:compaction_daemon:handle_info:269]Starting compaction for the following buckets:
[<<"default">>]
[ns_server:info,2012-08-28T14:44:00.074,
n_0@127.0.0.1:<0.13612.0>:compaction_daemon:try_to_cleanup_indexes:439]Cleaning up indexes for bucket `default`
[ns_server:info,2012-08-28T14:44:00.164,
n_0@127.0.0.1:<0.13612.0>:compaction_daemon:spawn_bucket_compactor:404]Compacting bucket default with config:
[{database_fragmentation_threshold,{30,undefined}},
MB-6041: add logs to time replication stop (Revision 1b1cf1f99f6e84b0baaa90a9ac2504b46e1d583a)Result = SUCCESS
Junyi Xie :
Files :
* src/xdc_rep_manager.erl
* src/xdc_replication_sup.erl