Moving Data from couchbase to hadoop
I have a couchbase cluster running in the amazon cloud which have about 300 million documents(~ 150 Gb).
I want to migrate the entire data to hadoop cluster,i.e, I want to move the data from the couchbase to hadoop cluster not just copy.
I tried using the sqoop but it doesn't seams to copy the data and also all the data doesn't seams to copied.
Also it was giving me NPE when i used it on password protected bucket.
Sqoop will copy the data and not move them this is its goals. I am susprised about the NPE.
Note that when I am using Sqoop I am using a Cloudera distribution are you?
Maybe you can also test with Talend ETL that will allow you to copy/move the data (you have more option the the job you are creating)
(I have only used Talend with Couchbase+RDBMS but as you can see you have all the connectors needed for Hadoop too)