{"id":2208,"date":"2016-03-29T15:00:00","date_gmt":"2016-03-29T15:00:00","guid":{"rendered":"https:\/\/www.couchbase.com\/blog\/?p=2208"},"modified":"2023-06-23T05:20:40","modified_gmt":"2023-06-23T12:20:40","slug":"load-csv-data-into-couchbase-using-apache-spark","status":"publish","type":"post","link":"https:\/\/www.couchbase.com\/blog\/pt\/load-csv-data-into-couchbase-using-apache-spark\/","title":{"rendered":"Carregar dados CSV no Couchbase usando o Apache Spark"},"content":{"rendered":"<p>Ultimamente, tenho passado muito tempo trabalhando com ferramentas de Big Data, em especial com o Apache Spark. Caso voc\u00ea n\u00e3o esteja familiarizado, o Apache<br \/>\nO Spark \u00e9 uma ferramenta incrivelmente eficiente para processar grandes quantidades de dados. Seu desempenho \u00e9 significativamente melhor do que o do MapReduce, e em<br \/>\nNa realidade, ele n\u00e3o \u00e9 muito dif\u00edcil de usar.<\/p>\n<p>O Apache Spark funciona muito bem em combina\u00e7\u00e3o com o Couchbase por meio do Couchbase Spark Connector. Veremos o que \u00e9 necess\u00e1rio para<br \/>\ncarregar alguns dados brutos de valores separados por v\u00edrgula (CSV) no Couchbase usando o Apache Spark.<\/p>\n<h2>Os requisitos<\/h2>\n<p>N\u00e3o h\u00e1 muitos requisitos para colocar esse projeto em funcionamento. No m\u00ednimo, voc\u00ea precisar\u00e1 do seguinte:<\/p>\n<ul>\n<li><a href=\"https:\/\/spark.apache.org\/downloads.html\">Apache Spark<\/a> 1.6.1<\/li>\n<li>JDK 1.8+<\/li>\n<li>Apache Maven 3.3+<\/li>\n<li><a href=\"https:\/\/www.couchbase.com\/blog\/pt\/downloads\/\">Servidor Couchbase<\/a> 4.1+<\/li>\n<\/ul>\n<p>A maior parte do desenvolvimento ocorrer\u00e1 com o JDK 1.8 e o Maven, mas quando chegar a hora de executar o aplicativo, o Apache Spark<br \/>\nseja necess\u00e1rio, seja por meio de uma inst\u00e2ncia local ou remota.<\/p>\n<h2>Entendendo o conjunto de dados e o modelo de dados<\/h2>\n<p>Uma \u00f3tima maneira de se familiarizar com o Apache Spark \u00e9 obter um conjunto de dados de amostra no site de ci\u00eancia de dados,<br \/>\n<a href=\"https:\/\/www.kaggle.com\/\">Kaggle<\/a>. Para este exemplo, vamos dar uma olhada no conjunto de dados de amostra chamado<br \/>\n<a href=\"https:\/\/www.kaggle.com\/kaggle\/sf-salaries\">Sal\u00e1rios de SF<\/a> que cont\u00e9m informa\u00e7\u00f5es sobre a quantidade de dinheiro que o governo<br \/>\nos funcion\u00e1rios de S\u00e3o Francisco est\u00e3o ganhando.<\/p>\n<p>Do ponto de vista dos dados, h\u00e1 um \u00fanico arquivo de valor separado por v\u00edrgula (CSV) chamado <strong>sal\u00e1rios.csv<\/strong> com o seguinte<br \/>\ncolunas nele:<\/p>\n<ol>\n<li>Id<\/li>\n<li>Nome do funcion\u00e1rio<\/li>\n<li>Cargo<\/li>\n<li>BasePay<\/li>\n<li>Pagamento de horas extras<\/li>\n<li>Outros pagamentos<\/li>\n<li>Benef\u00edcios<\/li>\n<li>TotalPay<\/li>\n<li>TotalPayBenefits<\/li>\n<li>Ano<\/li>\n<li>Notas<\/li>\n<li>Ag\u00eancia<\/li>\n<li>Status<\/li>\n<\/ol>\n<p>Trabalhar com os dados no formato CSV \u00e9 quase imposs\u00edvel. Ainda mais quando se trata de grandes quantidades de dados. Em vez disso, esses dados ser\u00e3o<br \/>\narmazenados como dados NoSQL para que possam ser processados posteriormente. N\u00e3o entraremos em detalhes sobre o processamento de n\u00fameros e consultas aqui, mas isso vir\u00e1 em um<br \/>\nartigo futuro. No momento, queremos apenas coloc\u00e1-lo no formato NoSQL.<\/p>\n<p>Quando carregado no Couchbase, cada linha do CSV ter\u00e1 uma apar\u00eancia semelhante \u00e0 seguinte:<\/p>\n<pre><code>\r\n{\r\n    \"Id\": \"10029\",\r\n    \"EmployeeName\": \"FERGAL CLANCY\",\r\n    \"JobTitle\": \"BUILDING INSPECTOR\",\r\n    \"BasePay\": \"94529.22\",\r\n    \"OvertimePay\": \"0\",\r\n    \"OtherPay\": \"2502.6\",\r\n    \"Benefits\": \"\",\r\n    \"TotalPay\": \"97031.82\",\r\n    \"TotalPayBenefits\": \"97031.82\",\r\n    \"Year\": \"2011\",\r\n    \"Notes\": \"\",\r\n    \"Agency\": \"San Francisco\",\r\n    \"Status\": \"\"\r\n}\r\n<\/code><\/pre>\n<p>Sim, o bloco de dados acima \u00e9 um documento JSON, que \u00e9 o que o Couchbase suporta. Agora que conhecemos os objetivos dos dados, podemos come\u00e7ar<br \/>\ncarregar os dados CSV no Couchbase com o Apache Spark.<\/p>\n<h2>Transformando os dados brutos e gravando no Couchbase<\/h2>\n<p>Para usar o Apache Spark em um aplicativo Java, algumas depend\u00eancias devem ser inclu\u00eddas. Precisamos incluir o Spark Core, o Spark SQL, o Spark CSV e o<br \/>\nConector do Couchbase Spark. Como estamos usando o Maven, tudo pode ser inclu\u00eddo por meio do Maven <strong>pom.xml<\/strong> arquivo. Para incluir<br \/>\nSpark Core, inclua a seguinte depend\u00eancia em seu arquivo Maven:<\/p>\n<pre><code>\r\n\r\n    org.apache.spark\r\n    spark-core_2.10\r\n    1.6.1\r\n\r\n<\/code><\/pre>\n<p>Como os dados brutos estar\u00e3o na forma de CSV, podemos usar o pacote de conveni\u00eancia do Spark chamado Spark CSV. A depend\u00eancia do Maven<br \/>\npara o Spark CSV pode ser adicionado desta forma:<\/p>\n<pre><code>\r\n\r\n    com.databricks\r\n    spark-csv_2.10\r\n    1.4.0\r\n\r\n<\/code><\/pre>\n<p>Os dados CSV ser\u00e3o carregados em um Apache Spark DataFrame. Se voc\u00ea n\u00e3o estiver familiarizado com DataFrames, eles podem ser consultados usando o Spark<br \/>\nSQL. Isso faz parte de como colocaremos os dados no Couchbase. Para incluir o Spark SQL em seu projeto, adicione a depend\u00eancia Maven<br \/>\nassim:<\/p>\n<pre><code>\r\n\r\n    org.apache.spark\r\n    spark-sql_2.10\r\n    1.6.1\r\n    provided\r\n\r\n<\/code><\/pre>\n<p>Por fim, o Apache Spark precisa ser conectado ao Couchbase Server. Isso pode ser feito por meio do Couchbase Connector for Spark. Para<br \/>\nadicione essa depend\u00eancia em seu projeto Maven, adicione o seguinte ao seu <strong>pom.xml<\/strong> file:<\/p>\n<pre><code>\r\n\r\n    com.couchbase.client\r\n    spark-connector_2.10\r\n    1.1.0\r\n\r\n<\/code><\/pre>\n<p>Todas as depend\u00eancias do projeto est\u00e3o prontas para funcionar!<\/p>\n<p>Para come\u00e7ar a carregar dados CSV por meio de c\u00f3digo Java, o Apache Spark deve primeiro ser configurado em nosso projeto. Isso inclui definir o que o Spark<br \/>\na ser usada e em qual bucket do Couchbase os dados ser\u00e3o armazenados.<\/p>\n<pre><code>\r\nSparkConf conf = new SparkConf()\r\n        .setAppName(\"SF Salaries\")\r\n        .setMaster(\"local[*]\")\r\n        .set(\"com.couchbase.bucket.default\", \"\");\r\nJavaSparkContext javaSparkContext = new JavaSparkContext(conf);\r\n<\/code><\/pre>\n<p>O nome do aplicativo ser\u00e1 <strong>Sal\u00e1rios de SF<\/strong> e o cluster mestre do Spark ser\u00e1 a m\u00e1quina local<br \/>\nj\u00e1 que o Spark ser\u00e1 executado localmente neste exemplo. O bucket do Couchbase a ser usado \u00e9, mais uma vez, o bucket padr\u00e3o.<\/p>\n<p>Para criar um Spark DataFrame, um <code>SQLContexto<\/code> deve ser criado a partir do <code>JavaSparkContext<\/code>.<\/p>\n<pre><code>\r\nSQLContext sqlContext = new SQLContext(javaSparkContext);\r\n<\/code><\/pre>\n<p>Usando o <strong>SQLContexto<\/strong> os dados CSV podem ser lidos dessa forma:<\/p>\n<pre><code>\r\nDataFrame dataFrame = sqlContext.read()\r\n    .format(\"com.databricks.spark.csv\")\r\n    .option(\"inferSchema\", \"true\")\r\n    .option(\"header\", \"true\")\r\n    .load(\"PATH_TO_CSV_FILE\");\r\n<\/code><\/pre>\n<p>O processo de leitura usar\u00e1 o pacote Spark CSV e preservar\u00e1 as informa\u00e7\u00f5es de cabe\u00e7alho que existem na parte superior do arquivo CSV.<br \/>\nQuando lidos em um DataFrame, os dados CSV agora s\u00e3o algo que o Couchbase pode entender.<\/p>\n<p>Deve ser feito um ajuste nos dados de id. O Spark o reconhecer\u00e1 como um n\u00famero inteiro ou num\u00e9rico porque esse conjunto de dados tem apenas<br \/>\nvalores num\u00e9ricos como a coluna. O Couchbase espera uma string id.<\/p>\n<pre><code>\r\ndataFrame = dataFrame.withColumn(\"Id\", df.col(\"Id\").cast(\"string\"));\r\n<\/code><\/pre>\n<p>O DataFrame agora pode ser preparado para ser salvo no Couchbase.<\/p>\n<pre><code>\r\nDataFrameWriterFunctions dataFrameWriterFunctions = new DataFrameWriterFunctions(dataFrame.write());\r\nMap<\/code><\/pre>\n<p>Com os dados do DataFrame canalizados para o <code>Fun\u00e7\u00f5es do DataFrameWriter<\/code> o valor de id pode ser mapeado para um objeto<br \/>\nid do documento. Os dados nesse ponto podem ser salvos.<\/p>\n<pre><code>\r\ndataFrameWriterFunctions.couchbase(options);\r\n<\/code><\/pre>\n<p>Grandes quantidades de documentos do Couchbase ser\u00e3o salvas no bucket.<\/p>\n<h2>Executando o projeto com o Apache Spark<\/h2>\n<p>Empacote o projeto em um JAR execut\u00e1vel usando o Maven. O projeto pode ser executado ap\u00f3s ser empacotado, fazendo o seguinte<br \/>\nassim:<\/p>\n<pre><code>\r\n\/path\/to\/apache\/spark\/bin\/spark-submit --class \"com.app.Main\" target\/project-jar-with-dependencies.jar\r\n<\/code><\/pre>\n<p>Dependendo do tamanho do conjunto de dados e da velocidade de seu computador ou servidor, o processo de carregamento pode demorar um pouco.<\/p>\n<h2>Conclus\u00e3o<\/h2>\n<p>Voc\u00ea acabou de experimentar o carregamento de dados CSV sujos no Couchbase usando o Apache Spark e o Couchbase Spark Connector. O Spark foi<br \/>\nprojetado para poder processar rapidamente grandes quantidades de dados em tempo real. Combine-o com o Couchbase e seu sistema centrado na mem\u00f3ria<br \/>\ne voc\u00ea tem um \u00f3timo pacote de software.<\/p>","protected":false},"excerpt":{"rendered":"<p>I&#8217;ve been spending a lot of time working with Big Data tools lately, in particular Apache Spark. In case you&#8217;re unfamiliar, Apache Spark is an incredibly efficient tool for processing massive amounts of data. It performs significantly better than MapReduce, [&hellip;]<\/p>","protected":false},"author":63,"featured_media":13873,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"inline_featured_image":false,"footnotes":""},"categories":[1816,1818],"tags":[1613,1236,1614,1610],"ppma_author":[9032],"class_list":["post-2208","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-couchbase-server","category-java","tag-apache","tag-big-data","tag-csv","tag-spark"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v25.7.1 (Yoast SEO v25.7) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Load CSV Data into Couchbase using Apache Spark<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.couchbase.com\/blog\/pt\/load-csv-data-into-couchbase-using-apache-spark\/\" \/>\n<meta property=\"og:locale\" content=\"pt_BR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Load CSV Data into Couchbase using Apache Spark\" \/>\n<meta property=\"og:description\" content=\"I&#8217;ve been spending a lot of time working with Big Data tools lately, in particular Apache Spark. In case you&#8217;re unfamiliar, Apache Spark is an incredibly efficient tool for processing massive amounts of data. It performs significantly better than MapReduce, [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.couchbase.com\/blog\/pt\/load-csv-data-into-couchbase-using-apache-spark\/\" \/>\n<meta property=\"og:site_name\" content=\"The Couchbase Blog\" \/>\n<meta property=\"article:author\" content=\"https:\/\/www.facebook.com\/thepolyglotdeveloper\" \/>\n<meta property=\"article:published_time\" content=\"2016-03-29T15:00:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-06-23T12:20:40+00:00\" \/>\n<meta name=\"author\" content=\"Nic Raboy, Developer Advocate, Couchbase\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@nraboy\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Nic Raboy, Developer Advocate, Couchbase\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutos\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/\"},\"author\":{\"name\":\"Nic Raboy, Developer Advocate, Couchbase\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/bb545ebe83bb2d12f91095811d0a72e1\"},\"headline\":\"Load CSV Data into Couchbase using Apache Spark\",\"datePublished\":\"2016-03-29T15:00:00+00:00\",\"dateModified\":\"2023-06-23T12:20:40+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/\"},\"wordCount\":879,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2022\/11\/couchbase-nosql-dbaas.png\",\"keywords\":[\"apache\",\"Big Data\",\"csv\",\"spark\"],\"articleSection\":[\"Couchbase Server\",\"Java\"],\"inLanguage\":\"pt-BR\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/\",\"url\":\"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/\",\"name\":\"Load CSV Data into Couchbase using Apache Spark\",\"isPartOf\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2022\/11\/couchbase-nosql-dbaas.png\",\"datePublished\":\"2016-03-29T15:00:00+00:00\",\"dateModified\":\"2023-06-23T12:20:40+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/#breadcrumb\"},\"inLanguage\":\"pt-BR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-BR\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/#primaryimage\",\"url\":\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2022\/11\/couchbase-nosql-dbaas.png\",\"contentUrl\":\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2022\/11\/couchbase-nosql-dbaas.png\",\"width\":1800,\"height\":630},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.couchbase.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Load CSV Data into Couchbase using Apache Spark\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/#website\",\"url\":\"https:\/\/www.couchbase.com\/blog\/\",\"name\":\"The Couchbase Blog\",\"description\":\"Couchbase, the NoSQL Database\",\"publisher\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.couchbase.com\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"pt-BR\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/#organization\",\"name\":\"The Couchbase Blog\",\"url\":\"https:\/\/www.couchbase.com\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-BR\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2023\/04\/admin-logo.png\",\"contentUrl\":\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2023\/04\/admin-logo.png\",\"width\":218,\"height\":34,\"caption\":\"The Couchbase Blog\"},\"image\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/bb545ebe83bb2d12f91095811d0a72e1\",\"name\":\"Nic Raboy, Developer Advocate, Couchbase\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-BR\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/image\/8863514d8bed0cf6080f23db40e00354\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/bedeb68368d4681aca4c74fe5f697f0c423b80d498ec50fd915ba018b72c101f?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/bedeb68368d4681aca4c74fe5f697f0c423b80d498ec50fd915ba018b72c101f?s=96&d=mm&r=g\",\"caption\":\"Nic Raboy, Developer Advocate, Couchbase\"},\"description\":\"Nic Raboy is an advocate of modern web and mobile development technologies. He has experience in Java, JavaScript, Golang and a variety of frameworks such as Angular, NativeScript, and Apache Cordova. Nic writes about his development experiences related to making web and mobile development easier to understand.\",\"sameAs\":[\"https:\/\/www.thepolyglotdeveloper.com\",\"https:\/\/www.facebook.com\/thepolyglotdeveloper\",\"https:\/\/x.com\/nraboy\"],\"url\":\"https:\/\/www.couchbase.com\/blog\/pt\/author\/nic-raboy-2\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Carregar dados CSV no Couchbase usando o Apache Spark","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.couchbase.com\/blog\/pt\/load-csv-data-into-couchbase-using-apache-spark\/","og_locale":"pt_BR","og_type":"article","og_title":"Load CSV Data into Couchbase using Apache Spark","og_description":"I&#8217;ve been spending a lot of time working with Big Data tools lately, in particular Apache Spark. In case you&#8217;re unfamiliar, Apache Spark is an incredibly efficient tool for processing massive amounts of data. It performs significantly better than MapReduce, [&hellip;]","og_url":"https:\/\/www.couchbase.com\/blog\/pt\/load-csv-data-into-couchbase-using-apache-spark\/","og_site_name":"The Couchbase Blog","article_author":"https:\/\/www.facebook.com\/thepolyglotdeveloper","article_published_time":"2016-03-29T15:00:00+00:00","article_modified_time":"2023-06-23T12:20:40+00:00","author":"Nic Raboy, Developer Advocate, Couchbase","twitter_card":"summary_large_image","twitter_creator":"@nraboy","twitter_misc":{"Written by":"Nic Raboy, Developer Advocate, Couchbase","Est. reading time":"5 minutos"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/#article","isPartOf":{"@id":"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/"},"author":{"name":"Nic Raboy, Developer Advocate, Couchbase","@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/bb545ebe83bb2d12f91095811d0a72e1"},"headline":"Load CSV Data into Couchbase using Apache Spark","datePublished":"2016-03-29T15:00:00+00:00","dateModified":"2023-06-23T12:20:40+00:00","mainEntityOfPage":{"@id":"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/"},"wordCount":879,"commentCount":0,"publisher":{"@id":"https:\/\/www.couchbase.com\/blog\/#organization"},"image":{"@id":"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/#primaryimage"},"thumbnailUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2022\/11\/couchbase-nosql-dbaas.png","keywords":["apache","Big Data","csv","spark"],"articleSection":["Couchbase Server","Java"],"inLanguage":"pt-BR","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/","url":"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/","name":"Carregar dados CSV no Couchbase usando o Apache Spark","isPartOf":{"@id":"https:\/\/www.couchbase.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/#primaryimage"},"image":{"@id":"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/#primaryimage"},"thumbnailUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2022\/11\/couchbase-nosql-dbaas.png","datePublished":"2016-03-29T15:00:00+00:00","dateModified":"2023-06-23T12:20:40+00:00","breadcrumb":{"@id":"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/#breadcrumb"},"inLanguage":"pt-BR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/"]}]},{"@type":"ImageObject","inLanguage":"pt-BR","@id":"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/#primaryimage","url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2022\/11\/couchbase-nosql-dbaas.png","contentUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2022\/11\/couchbase-nosql-dbaas.png","width":1800,"height":630},{"@type":"BreadcrumbList","@id":"https:\/\/www.couchbase.com\/blog\/load-csv-data-into-couchbase-using-apache-spark\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.couchbase.com\/blog\/"},{"@type":"ListItem","position":2,"name":"Load CSV Data into Couchbase using Apache Spark"}]},{"@type":"WebSite","@id":"https:\/\/www.couchbase.com\/blog\/#website","url":"https:\/\/www.couchbase.com\/blog\/","name":"Blog do Couchbase","description":"Couchbase, o banco de dados NoSQL","publisher":{"@id":"https:\/\/www.couchbase.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.couchbase.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"pt-BR"},{"@type":"Organization","@id":"https:\/\/www.couchbase.com\/blog\/#organization","name":"Blog do Couchbase","url":"https:\/\/www.couchbase.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"pt-BR","@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2023\/04\/admin-logo.png","contentUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2023\/04\/admin-logo.png","width":218,"height":34,"caption":"The Couchbase Blog"},"image":{"@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/bb545ebe83bb2d12f91095811d0a72e1","name":"Nic Raboy, defensor dos desenvolvedores, Couchbase","image":{"@type":"ImageObject","inLanguage":"pt-BR","@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/image\/8863514d8bed0cf6080f23db40e00354","url":"https:\/\/secure.gravatar.com\/avatar\/bedeb68368d4681aca4c74fe5f697f0c423b80d498ec50fd915ba018b72c101f?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/bedeb68368d4681aca4c74fe5f697f0c423b80d498ec50fd915ba018b72c101f?s=96&d=mm&r=g","caption":"Nic Raboy, Developer Advocate, Couchbase"},"description":"Nic Raboy \u00e9 um defensor das modernas tecnologias de desenvolvimento m\u00f3vel e da Web. Ele tem experi\u00eancia em Java, JavaScript, Golang e uma variedade de estruturas, como Angular, NativeScript e Apache Cordova. Nic escreve sobre suas experi\u00eancias de desenvolvimento relacionadas a tornar o desenvolvimento m\u00f3vel e da Web mais f\u00e1cil de entender.","sameAs":["https:\/\/www.thepolyglotdeveloper.com","https:\/\/www.facebook.com\/thepolyglotdeveloper","https:\/\/x.com\/nraboy"],"url":"https:\/\/www.couchbase.com\/blog\/pt\/author\/nic-raboy-2\/"}]}},"authors":[{"term_id":9032,"user_id":63,"is_guest":0,"slug":"nic-raboy-2","display_name":"Nic Raboy, Developer Advocate, Couchbase","avatar_url":"https:\/\/secure.gravatar.com\/avatar\/bedeb68368d4681aca4c74fe5f697f0c423b80d498ec50fd915ba018b72c101f?s=96&d=mm&r=g","first_name":"Nic","last_name":"Raboy","user_url":"https:\/\/www.thepolyglotdeveloper.com","author_category":"","description":"Nic Raboy \u00e9 um defensor das modernas tecnologias de desenvolvimento m\u00f3vel e da Web. Ele tem experi\u00eancia em Java, JavaScript, Golang e uma variedade de estruturas, como Angular, NativeScript e Apache Cordova. Nic escreve sobre suas experi\u00eancias de desenvolvimento relacionadas a tornar o desenvolvimento m\u00f3vel e da Web mais f\u00e1cil de entender."}],"_links":{"self":[{"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/posts\/2208","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/users\/63"}],"replies":[{"embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/comments?post=2208"}],"version-history":[{"count":0,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/posts\/2208\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/media\/13873"}],"wp:attachment":[{"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/media?parent=2208"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/categories?post=2208"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/tags?post=2208"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/ppma_author?post=2208"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}