Couchsqoop to Hadoop problem
I created a vm centos 6.2 on windows azure platform. Installed Cloudera Hadoop Distribution 4.0 and CouchSqoop plugin
I execute this command
sqoop import --connect http://10.76.158.26:8091/pools --table garbage DUMP
but it says
12/07/05 09:56:48 ERROR tool.BaseSqoopTool: Error parsing arguments for import:
12/07/05 09:56:48 ERROR tool.BaseSqoopTool: Unrecognized argument: DUMP
how can i connect to my couchbase bucket and import data to hadoop?
sqoop import --connect http://168.63.52.78:8091/pools --table DUMP
this gives me the following error
12/07/09 08:09:08 ERROR sqoop.ConnFactory: Error loading ManagerFactory information from file /usr/lib/sqoop/conf/managers.d/couchsqoop-manager: java.io.IOException: Could not load jar lib/couchsqoop-plugin-1.0.jar into JVM. (Could not find class com.couchbase.sqoop.manager.CouchbaseFactory.)
at org.apache.sqoop.util.ClassLoaderStack.addJarFile(ClassLoaderStack.java:92)
at com.cloudera.sqoop.util.ClassLoaderStack.addJarFile(ClassLoaderStack.java:36)
at org.apache.sqoop.ConnFactory.addManagersFromFile(ConnFactory.java:159)
at org.apache.sqoop.ConnFactory.loadManagersFromConfDir(ConnFactory.java:218)
at org.apache.sqoop.ConnFactory.instantiateFactories(ConnFactory.java:83)
at org.apache.sqoop.ConnFactory.(ConnFactory.java:60)
at com.cloudera.sqoop.ConnFactory.(ConnFactory.java:36)
at org.apache.sqoop.tool.BaseSqoopTool.init(BaseSqoopTool.java:202)
at org.apache.sqoop.tool.ImportTool.init(ImportTool.java:83)
at org.apache.sqoop.tool.ImportTool.run(ImportTool.java:464)
at org.apache.sqoop.Sqoop.run(Sqoop.java:145)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
at org.apache.sqoop.Sqoop.runSqoop(Sqoop.java:181)
at org.apache.sqoop.Sqoop.runTool(Sqoop.java:220)
at org.apache.sqoop.Sqoop.runTool(Sqoop.java:229)
at org.apache.sqoop.Sqoop.main(Sqoop.java:238)
at com.cloudera.sqoop.Sqoop.main(Sqoop.java:57)
Caused by: java.lang.ClassNotFoundException: com.couchbase.sqoop.manager.CouchbaseFactory
at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
at java.net.FactoryURLClassLoader.loadClass(URLClassLoader.java:627)
at java.lang.ClassLoader.loadClass(ClassLoader.java:247)
at java.lang.Class.forName0(Native Method)
at java.lang.Class.forName(Class.java:247)
at org.apache.sqoop.util.ClassLoaderStack.addJarFile(ClassLoaderStack.java:88)
... 16 more
12/07/09 08:09:08 ERROR sqoop.ConnFactory: Could not load ManagerFactory com.couchbase.sqoop.manager.CouchbaseFactory (not found)
12/07/09 08:09:08 ERROR tool.BaseSqoopTool: Got error creating database manager: java.io.IOException: No manager for connect string: http://168.63.52.78:8091/pools
at org.apache.sqoop.ConnFactory.getManager(ConnFactory.java:119)
at org.apache.sqoop.tool.BaseSqoopTool.init(BaseSqoopTool.java:202)
at org.apache.sqoop.tool.ImportTool.init(ImportTool.java:83)
at org.apache.sqoop.tool.ImportTool.run(ImportTool.java:464)
at org.apache.sqoop.Sqoop.run(Sqoop.java:145)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
at org.apache.sqoop.Sqoop.runSqoop(Sqoop.java:181)
at org.apache.sqoop.Sqoop.runTool(Sqoop.java:220)
at org.apache.sqoop.Sqoop.runTool(Sqoop.java:229)
at org.apache.sqoop.Sqoop.main(Sqoop.java:238)
at com.cloudera.sqoop.Sqoop.main(Sqoop.java:57)
You need to specify your table as either DUMP or BACKFILL_ where num is a time in minutes. The plugin is complaining because you specified the table as garbage and this is an unrecognized type of stream. See below for the documentation on importing.
http://www.couchbase.com/docs/hadoop-plugin/hadoop-plugin-sqoop-import.html
Try this:
sqoop import --connect http://10.76.158.26:8091/pools --table DUMP