{"id":16518,"date":"2024-10-29T05:50:01","date_gmt":"2024-10-29T12:50:01","guid":{"rendered":"https:\/\/www.couchbase.com\/blog\/?p=16518"},"modified":"2025-06-16T10:43:45","modified_gmt":"2025-06-16T17:43:45","slug":"supercharge-rag-couchbase-vector-unstructured-io","status":"publish","type":"post","link":"https:\/\/www.couchbase.com\/blog\/pt\/supercharge-rag-couchbase-vector-unstructured-io\/","title":{"rendered":"Turbine seu aplicativo RAG com o Couchbase Vector Search e o Unstructured.io"},"content":{"rendered":"<p><span style=\"font-weight: 400;\">Hoje, temos o prazer de anunciar o lan\u00e7amento do Couchbase e do <a href=\"https:\/\/unstructured.io\">Unstructured.io<\/a> que agiliza o processo de ingest\u00e3o de dados n\u00e3o estruturados em seu pipeline RAG criado com base no Couchbase como o armazenamento de vetores. Com esse conector, agora voc\u00ea pode converter documentos n\u00e3o estruturados e pouco estruturados em arquivos JSON e prepar\u00e1-los para o consumo por aplicativos RAG por meio da gera\u00e7\u00e3o de embeddings vetoriais em apenas algumas linhas de c\u00f3digo.<\/span><\/p>\n<h2><span style=\"font-weight: 400;\">Por que a ingest\u00e3o de dados n\u00e3o estruturados \u00e9 importante para os desenvolvedores?\u00a0<\/span><\/h2>\n<p><span style=\"font-weight: 400;\">Uma quantidade esmagadora de dados corporativos n\u00e3o \u00e9 estruturada e \u00e9 improv\u00e1vel que isso mude em um futuro pr\u00f3ximo. A presen\u00e7a de dados em formatos n\u00e3o estruturados tem implica\u00e7\u00f5es para os desenvolvedores que v\u00e3o al\u00e9m do tempo e do custo. Isso significa que a tomada de decis\u00f5es nas empresas \u00e9 baseada na quantidade limitada de dados estruturados e consum\u00edveis, em vez de todos os dados que residem neles. Al\u00e9m disso, isso significa que uma grande variedade de fluxos de trabalho empresariais (internos e voltados para o cliente) exige interven\u00e7\u00e3o manual, o que os torna mais caros, mais lentos e mais propensos a erros. \u00c9 prov\u00e1vel que esse problema se torne mais grave \u00e0 medida que as pegadas de dados corporativos aumentem.<\/span><\/p>\n<h2><span style=\"font-weight: 400;\">Como os dados n\u00e3o estruturados s\u00e3o aproveitados pelos desenvolvedores?<\/span><\/h2>\n<p><span style=\"font-weight: 400;\">Uma das maneiras mais eficazes de aproveitar dados n\u00e3o estruturados \u00e9 ingeri-los em um pipeline RAG, tornando os dados dispon\u00edveis para recupera\u00e7\u00e3o por meio de <a href=\"https:\/\/www.couchbase.com\/blog\/pt\/what-is-vector-search\/\">pesquisas de vetores<\/a>. Isso tem uma ampla gama de aplica\u00e7\u00f5es em v\u00e1rios setores. Os aplicativos RAG podem ser aproveitados para aumentar a efici\u00eancia operacional, facilitando o acesso a documentos mais relevantes, o que resulta em tempos de resolu\u00e7\u00e3o mais r\u00e1pidos e custos mais baixos. Alguns dos casos de uso que podem ser resolvidos s\u00e3o:\u00a0<\/span><\/p>\n<ol>\n<li style=\"list-style-type: none;\">\n<ol>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Permitir que as equipes de suporte ao cliente de todos os setores encontrem documentos relevantes para a solu\u00e7\u00e3o de problemas<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Permitir que os profissionais da \u00e1rea m\u00e9dica extraiam artigos relevantes e registros de pacientes armazenados em bancos de dados de documentos para auxiliar no diagn\u00f3stico e no planejamento do tratamento<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Sistemas de recomenda\u00e7\u00e3o que aproveitam os dados do cliente para sugerir o produto mais adequado<\/span><\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<div id=\"attachment_16519\" style=\"width: 910px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image2-8.png\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-16519\" class=\"wp-image-16519 size-large\" src=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image2-8-1024x506.png\" alt=\"\" width=\"900\" height=\"445\" srcset=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image2-8-1024x506.png 1024w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image2-8-300x148.png 300w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image2-8-768x380.png 768w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image2-8-1536x760.png 1536w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image2-8-1320x653.png 1320w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image2-8.png 1990w\" sizes=\"auto, (max-width: 900px) 100vw, 900px\" \/><\/a><p id=\"caption-attachment-16519\" class=\"wp-caption-text\">Figura 1. Pipeline de ingest\u00e3o de dados n\u00e3o estruturados com unstructured.io e Capella VectorDB<\/p><\/div>\n<h2><span style=\"font-weight: 400;\">Qual \u00e9 a maneira atual de processar dados n\u00e3o estruturados?<\/span><\/h2>\n<p><span style=\"font-weight: 400;\">A maneira atual de realizar isso (ingest\u00e3o de dados n\u00e3o estruturados para aplicativos RAG) com o Couchbase Capella exigiria que os desenvolvedores escrevessem aplicativos para se conectar a um extrator de dados n\u00e3o estruturados, analisassem sua sa\u00edda, dividissem-na em partes e a enviassem para um modelo de incorpora\u00e7\u00e3o para gerar vetores que, em seguida, teriam de ser enviados para um banco de dados de vetores no Couchbase Capella.\u00a0<\/span><\/p>\n<h2><span style=\"font-weight: 400;\">Como o nosso conector aprimora o m\u00e9todo atual de ingest\u00e3o de dados n\u00e3o estruturados?\u00a0<\/span><\/h2>\n<p><span style=\"font-weight: 400;\">Os conectores unstructured.io - Couchbase simplificam o processo de conex\u00e3o dos dois elementos prim\u00e1rios do pipeline de ingest\u00e3o mencionados anteriormente, facilitando:<\/span><\/p>\n<ul>\n<li style=\"list-style-type: none;\">\n<ul>\n<li><span style=\"font-weight: 400;\">Converta dados de texto n\u00e3o estruturados em documentos JSON estruturados<\/span><\/li>\n<li><span style=\"font-weight: 400;\">Gerar os vetores correspondentes<\/span><\/li>\n<li><span style=\"font-weight: 400;\"> Insira-os no Couchbase Capella<\/span><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p><span style=\"font-weight: 400;\">O <\/span><a href=\"https:\/\/docs.unstructured.io\/api-reference\/ingest\/source-connectors\/couchbase\"><span style=\"font-weight: 400;\">conector de fonte<\/span><\/a><span style=\"font-weight: 400;\"> ajuda a buscar dados do Couchbase Capella antes de serem divididos em peda\u00e7os (e, opcionalmente, vetorizados), enquanto o <\/span><a href=\"https:\/\/docs.unstructured.io\/api-reference\/ingest\/destination-connector\/couchbase\"><span style=\"font-weight: 400;\">conector de destino<\/span><\/a><span style=\"font-weight: 400;\"> ajuda a ingerir dados processados do unstructured.io no Couchbase Capella.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">O Capella \u00e9 um banco de dados vetorial de alto desempenho que permite configurar, indexar e consultar rapidamente um banco de dados vetorial. Veja como voc\u00ea pode aproveitar os conectores para come\u00e7ar a processar seus documentos com apenas algumas linhas de c\u00f3digo.\u00a0<\/span><\/p>\n<h3><span style=\"font-weight: 400;\">Etapa 1: Pr\u00e9-requisitos<\/span><\/h3>\n<p><span style=\"font-weight: 400;\">Antes de come\u00e7ar a usar o conector, voc\u00ea precisar\u00e1 atender a alguns pr\u00e9-requisitos. Voc\u00ea precisar\u00e1 de:<\/span><\/p>\n<ul>\n<li style=\"list-style-type: none;\">\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Um <\/span><a href=\"https:\/\/docs.unstructured.io\/api-reference\/api-services\/saas-api-development-guide\"><span style=\"font-weight: 400;\">Chave de API de unstructured.io<\/span><\/a><span style=\"font-weight: 400;\"> que pode ser obtido com a cria\u00e7\u00e3o de uma conta no site unstructured.io<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Uma conta Capella ativa com um cluster e um banco de dados configurados, bem como escopo e cole\u00e7\u00f5es definidos no banco de dados<\/span>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"2\"><a href=\"https:\/\/docs.couchbase.com\/cloud\/get-started\/create-account.html\"><span style=\"font-weight: 400;\">Criar uma conta gratuita e um banco de dados<\/span><\/a><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"2\"><a href=\"https:\/\/docs.couchbase.com\/cloud\/clusters\/data-service\/about-buckets-scopes-collections.html\"><span style=\"font-weight: 400;\">Configurar uma cole\u00e7\u00e3o<\/span><\/a><\/li>\n<\/ul>\n<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><a href=\"https:\/\/docs.couchbase.com\/cloud\/clusters\/allow-ip-address.html\"><span style=\"font-weight: 400;\">Para configurar o cluster para usar seu endere\u00e7o IP<\/span><\/a><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><a href=\"https:\/\/docs.couchbase.com\/cloud\/clusters\/manage-database-users.html\"><span style=\"font-weight: 400;\">Para configurar as credenciais do banco de dados<\/span><\/a><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<h3><span style=\"font-weight: 400;\">Etapa 2: Defina a origem de seus dados n\u00e3o estruturados e o destino<\/span><\/h3>\n<p><span style=\"font-weight: 400;\">Depois que os pr\u00e9-requisitos estiverem estabelecidos, voc\u00ea poder\u00e1 definir a origem dos documentos que deseja processar e usar como entradas para o pipeline RAG de produ\u00e7\u00e3o. O conector oferece suporte \u00e0 ingest\u00e3o de v\u00e1rias fontes: Couchbase, diret\u00f3rios locais, buckets S3 e outros servi\u00e7os de armazenamento. Unstructured.io <\/span><a href=\"https:\/\/docs.unstructured.io\/open-source\/introduction\/supported-file-types\"><span style=\"font-weight: 400;\">suporta uma ampla variedade de formatos de documentos n\u00e3o estruturados<\/span><\/a><span style=\"font-weight: 400;\"> incluindo PDFs, arquivos de imagem (JPEG, PNG), documentos de texto (DOCX, DOC), e-mails, planilhas e formatos de arquivo de apresenta\u00e7\u00e3o (PPT).\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Da mesma forma, defina o local intermedi\u00e1rio que ser\u00e1 usado para armazenar a sa\u00edda gerada pelo unstructured.io antes que o texto seja vetorizado. Pode ser uma cole\u00e7\u00e3o em um banco de dados escal\u00e1vel e de alto desempenho no Couchbase ou em qualquer outro servi\u00e7o de armazenamento que voc\u00ea esteja usando atualmente. Em seguida, voc\u00ea pode definir a cole\u00e7\u00e3o do banco de dados Vector no Couchbase, onde os documentos JSON que cont\u00eam o texto original, os metadados e o vetor de incorpora\u00e7\u00e3o correspondente ser\u00e3o armazenados.\u00a0<\/span><\/p>\n<h3><span style=\"font-weight: 400;\">Etapa 3: Defina sua estrat\u00e9gia de fragmenta\u00e7\u00e3o e selecione um modelo de incorpora\u00e7\u00e3o para a gera\u00e7\u00e3o de incorpora\u00e7\u00e3o de vetores<\/span><\/h3>\n<p><span style=\"font-weight: 400;\">Depois que os locais de entrada e sa\u00edda forem definidos, voc\u00ea poder\u00e1 <\/span><a href=\"https:\/\/docs.unstructured.io\/api-reference\/api-services\/chunking\"><span style=\"font-weight: 400;\">selecionar uma das estrat\u00e9gias de fragmenta\u00e7\u00e3o<\/span><\/a><span style=\"font-weight: 400;\"> suportado por unstructured.io e <\/span><a href=\"https:\/\/docs.unstructured.io\/open-source\/core-functionality\/embedding\"><span style=\"font-weight: 400;\">escolher um modelo de incorpora\u00e7\u00e3o<\/span><\/a><span style=\"font-weight: 400;\"> de sua escolha. O Unstructured.io suporta modelos de incorpora\u00e7\u00e3o de v\u00e1rios provedores, como Huggingface, OpenAI e Bedrock, entre outros.\u00a0<\/span><\/p>\n<h3><span style=\"font-weight: 400;\">Etapa 4: Execute seu aplicativo!<\/span><\/h3>\n<p><span style=\"font-weight: 400;\">Teste seu aplicativo. Voc\u00ea dever\u00e1 ser capaz de visualizar os novos documentos JSON estruturados inseridos em sua cole\u00e7\u00e3o Capella ap\u00f3s todas as etapas de processamento executadas via unstructured.io. Abaixo est\u00e1 um exemplo dos arquivos que convertemos de um PDF para JSON e ingerimos em uma cole\u00e7\u00e3o do Couchbase Capella. Para obter um guia passo a passo, juntamente com o c\u00f3digo sobre como fazer isso, confira nosso <\/span><a href=\"https:\/\/docs.unstructured.io\/api-reference\/ingest\/destination-connector\/couchbase\"><span style=\"font-weight: 400;\">tutorial completo aqui<\/span><\/a><span style=\"font-weight: 400;\">. Voc\u00ea tamb\u00e9m pode usar nosso notebook para acompanhar o processo.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Exemplo de documento n\u00e3o estruturado:<\/span><\/p>\n<p><a href=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image3-5.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-16520\" src=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image3-5.png\" alt=\"\" width=\"596\" height=\"756\" srcset=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image3-5.png 596w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image3-5-237x300.png 237w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image3-5-300x381.png 300w\" sizes=\"auto, (max-width: 596px) 100vw, 596px\" \/><\/a><\/p>\n<p><span style=\"font-weight: 400;\">Sa\u00edda de unstructured.io:<\/span><\/p>\n<p><a href=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image4-6.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-16521\" src=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image4-6.png\" alt=\"\" width=\"836\" height=\"551\" srcset=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image4-6.png 836w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image4-6-300x198.png 300w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image4-6-768x506.png 768w\" sizes=\"auto, (max-width: 836px) 100vw, 836px\" \/><\/a><\/p>\n<p><span style=\"font-weight: 400;\">Documentos ingeridos no Capella:<\/span><\/p>\n<p><a href=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image5-6.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-large wp-image-16522\" src=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image5-6-1024x323.png\" alt=\"\" width=\"900\" height=\"284\" srcset=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image5-6-1024x323.png 1024w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image5-6-300x95.png 300w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image5-6-768x242.png 768w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image5-6-1536x485.png 1536w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image5-6-1320x417.png 1320w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/image5-6.png 1999w\" sizes=\"auto, (max-width: 900px) 100vw, 900px\" \/><\/a><\/p>\n<p><span style=\"font-weight: 400;\">Agora, voc\u00ea pode executar seu aplicativo para processar documentos de texto n\u00e3o estruturados, identificar os componentes, extra\u00ed-los como documentos JSON e gerar embeddings vetoriais antes de inseri-los em sua cole\u00e7\u00e3o Capella.<\/span><\/p>\n<h2><span style=\"font-weight: 400;\">Recursos<\/span><\/h2>\n<ul>\n<li style=\"list-style-type: none;\">\n<ul>\n<li><a href=\"https:\/\/www.couchbase.com\/blog\/pt\/rag-applications-with-vector-search-and-couchbase\/\"><span style=\"font-weight: 400;\">Cria\u00e7\u00e3o de aplicativos RAG de ponta a ponta com o Couchbase Vector Search<\/span><\/a><\/li>\n<li><a href=\"https:\/\/www.couchbase.com\/blog\/pt\/couchbase-bedrock-rag-applications\/\"><span style=\"font-weight: 400;\">Crie aplicativos RAG de alto desempenho usando o Couchbase Vector Search e o Amazon Bedrock<\/span><\/a><\/li>\n<li><a href=\"https:\/\/info.couchbase.com\/webinar_Coding_With_AI_Vector_Search_RAG_2024M4_LP.html\"><span style=\"font-weight: 400;\">Codifica\u00e7\u00e3o com IA: pesquisa vetorial e RAG<\/span><\/a><span style=\"font-weight: 400;\">\u00a0(Webcast)<\/span><\/li>\n<li><a href=\"https:\/\/cloud.couchbase.com\/sign-up\"><span style=\"font-weight: 400;\">Experimente o Couchbase Capella gratuitamente hoje mesmo<\/span><\/a><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p><br style=\"font-weight: 400;\" \/><br style=\"font-weight: 400;\" \/><\/p>\n<p>&nbsp;<\/p>","protected":false},"excerpt":{"rendered":"<p>Today we\u2019re excited to announce the launch of the Couchbase and Unstructured.io connector which streamlines the process of ingesting unstructured data into your RAG pipeline built on top of Couchbase as the vector store. Using this connector, you can now [&hellip;]<\/p>","protected":false},"author":85541,"featured_media":16557,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"inline_featured_image":false,"footnotes":""},"categories":[1814,3917,2242,2225,9973,9921,9937],"tags":[10049,9924,10048],"ppma_author":[10050,10051],"class_list":["post-16518","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-application-design","category-company","category-connectors","category-cloud","category-generative-ai-genai","category-partners","category-vector-search","tag-data-prep","tag-rag-retrieval-augmented-generation","tag-unstructured-io"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v26.1 (Yoast SEO v26.1.1) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Supercharge Your RAG application With Couchbase Vector Search and Unstructured.io - The Couchbase Blog<\/title>\n<meta name=\"description\" content=\"Announcing the Couchbase and Unstructured.io connector\u2014quickly convert unstructured data into JSON and vector embeddings for seamless integration into your RAG pipeline.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.couchbase.com\/blog\/pt\/supercharge-rag-couchbase-vector-unstructured-io\/\" \/>\n<meta property=\"og:locale\" content=\"pt_BR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Supercharge Your RAG application With Couchbase Vector Search and Unstructured.io\" \/>\n<meta property=\"og:description\" content=\"Announcing the Couchbase and Unstructured.io connector\u2014quickly convert unstructured data into JSON and vector embeddings for seamless integration into your RAG pipeline.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.couchbase.com\/blog\/pt\/supercharge-rag-couchbase-vector-unstructured-io\/\" \/>\n<meta property=\"og:site_name\" content=\"The Couchbase Blog\" \/>\n<meta property=\"article:published_time\" content=\"2024-10-29T12:50:01+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-06-16T17:43:45+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/Unstructured-data-ingestion-pipeline-Diagram_3.png\" \/>\n\t<meta property=\"og:image:width\" content=\"2400\" \/>\n\t<meta property=\"og:image:height\" content=\"1256\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Vishwa Yeruru - Sr. Product Manager, Maria Khalusova - Staff Developer Advocate, Unstructured.io\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Vishwa Yeruru - Sr. Product Manager\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"6 minutos\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/\"},\"author\":{\"name\":\"Vishwa Yeruru - Sr. Product Manager\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/0670782b8878056390b6a256511c8858\"},\"headline\":\"Supercharge Your RAG application With Couchbase Vector Search and Unstructured.io\",\"datePublished\":\"2024-10-29T12:50:01+00:00\",\"dateModified\":\"2025-06-16T17:43:45+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/\"},\"wordCount\":970,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/Unstructured-data-ingestion-pipeline-Diagram_3.png\",\"keywords\":[\"data prep\",\"RAG retrieval-augmented generation\",\"unstructured.io\"],\"articleSection\":[\"Application Design\",\"Company\",\"Connectors\",\"Couchbase Capella\",\"Generative AI (GenAI)\",\"Partners\",\"Vector Search\"],\"inLanguage\":\"pt-BR\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/\",\"url\":\"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/\",\"name\":\"Supercharge Your RAG application With Couchbase Vector Search and Unstructured.io - The Couchbase Blog\",\"isPartOf\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/Unstructured-data-ingestion-pipeline-Diagram_3.png\",\"datePublished\":\"2024-10-29T12:50:01+00:00\",\"dateModified\":\"2025-06-16T17:43:45+00:00\",\"description\":\"Announcing the Couchbase and Unstructured.io connector\u2014quickly convert unstructured data into JSON and vector embeddings for seamless integration into your RAG pipeline.\",\"breadcrumb\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/#breadcrumb\"},\"inLanguage\":\"pt-BR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-BR\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/#primaryimage\",\"url\":\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/Unstructured-data-ingestion-pipeline-Diagram_3.png\",\"contentUrl\":\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/Unstructured-data-ingestion-pipeline-Diagram_3.png\",\"width\":2400,\"height\":1256},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.couchbase.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Supercharge Your RAG application With Couchbase Vector Search and Unstructured.io\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/#website\",\"url\":\"https:\/\/www.couchbase.com\/blog\/\",\"name\":\"The Couchbase Blog\",\"description\":\"Couchbase, the NoSQL Database\",\"publisher\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.couchbase.com\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"pt-BR\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/#organization\",\"name\":\"The Couchbase Blog\",\"url\":\"https:\/\/www.couchbase.com\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-BR\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2023\/04\/admin-logo.png\",\"contentUrl\":\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2023\/04\/admin-logo.png\",\"width\":218,\"height\":34,\"caption\":\"The Couchbase Blog\"},\"image\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/0670782b8878056390b6a256511c8858\",\"name\":\"Vishwa Yeruru - Sr. Product Manager\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-BR\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/image\/a7609300b8d22762330c56f24bc36684\",\"url\":\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/vishwa-yeruru.png\",\"contentUrl\":\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/vishwa-yeruru.png\",\"caption\":\"Vishwa Yeruru - Sr. Product Manager\"},\"url\":\"https:\/\/www.couchbase.com\/blog\/pt\/author\/vishwayeruru\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Supercharge Your RAG application With Couchbase Vector Search and Unstructured.io - The Couchbase Blog","description":"Anunciando o conector Couchbase e Unstructured.io - converta rapidamente dados n\u00e3o estruturados em JSON e incorpora\u00e7\u00f5es vetoriais para uma integra\u00e7\u00e3o perfeita em seu pipeline RAG.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.couchbase.com\/blog\/pt\/supercharge-rag-couchbase-vector-unstructured-io\/","og_locale":"pt_BR","og_type":"article","og_title":"Supercharge Your RAG application With Couchbase Vector Search and Unstructured.io","og_description":"Announcing the Couchbase and Unstructured.io connector\u2014quickly convert unstructured data into JSON and vector embeddings for seamless integration into your RAG pipeline.","og_url":"https:\/\/www.couchbase.com\/blog\/pt\/supercharge-rag-couchbase-vector-unstructured-io\/","og_site_name":"The Couchbase Blog","article_published_time":"2024-10-29T12:50:01+00:00","article_modified_time":"2025-06-16T17:43:45+00:00","og_image":[{"width":2400,"height":1256,"url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/Unstructured-data-ingestion-pipeline-Diagram_3.png","type":"image\/png"}],"author":"Vishwa Yeruru - Sr. Product Manager, Maria Khalusova - Staff Developer Advocate, Unstructured.io","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Vishwa Yeruru - Sr. Product Manager","Est. reading time":"6 minutos"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/#article","isPartOf":{"@id":"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/"},"author":{"name":"Vishwa Yeruru - Sr. Product Manager","@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/0670782b8878056390b6a256511c8858"},"headline":"Supercharge Your RAG application With Couchbase Vector Search and Unstructured.io","datePublished":"2024-10-29T12:50:01+00:00","dateModified":"2025-06-16T17:43:45+00:00","mainEntityOfPage":{"@id":"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/"},"wordCount":970,"commentCount":0,"publisher":{"@id":"https:\/\/www.couchbase.com\/blog\/#organization"},"image":{"@id":"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/#primaryimage"},"thumbnailUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/Unstructured-data-ingestion-pipeline-Diagram_3.png","keywords":["data prep","RAG retrieval-augmented generation","unstructured.io"],"articleSection":["Application Design","Company","Connectors","Couchbase Capella","Generative AI (GenAI)","Partners","Vector Search"],"inLanguage":"pt-BR","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/","url":"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/","name":"Supercharge Your RAG application With Couchbase Vector Search and Unstructured.io - The Couchbase Blog","isPartOf":{"@id":"https:\/\/www.couchbase.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/#primaryimage"},"image":{"@id":"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/#primaryimage"},"thumbnailUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/Unstructured-data-ingestion-pipeline-Diagram_3.png","datePublished":"2024-10-29T12:50:01+00:00","dateModified":"2025-06-16T17:43:45+00:00","description":"Anunciando o conector Couchbase e Unstructured.io - converta rapidamente dados n\u00e3o estruturados em JSON e incorpora\u00e7\u00f5es vetoriais para uma integra\u00e7\u00e3o perfeita em seu pipeline RAG.","breadcrumb":{"@id":"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/#breadcrumb"},"inLanguage":"pt-BR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/"]}]},{"@type":"ImageObject","inLanguage":"pt-BR","@id":"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/#primaryimage","url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/Unstructured-data-ingestion-pipeline-Diagram_3.png","contentUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/Unstructured-data-ingestion-pipeline-Diagram_3.png","width":2400,"height":1256},{"@type":"BreadcrumbList","@id":"https:\/\/www.couchbase.com\/blog\/supercharge-rag-couchbase-vector-unstructured-io\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.couchbase.com\/blog\/"},{"@type":"ListItem","position":2,"name":"Supercharge Your RAG application With Couchbase Vector Search and Unstructured.io"}]},{"@type":"WebSite","@id":"https:\/\/www.couchbase.com\/blog\/#website","url":"https:\/\/www.couchbase.com\/blog\/","name":"Blog do Couchbase","description":"Couchbase, o banco de dados NoSQL","publisher":{"@id":"https:\/\/www.couchbase.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.couchbase.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"pt-BR"},{"@type":"Organization","@id":"https:\/\/www.couchbase.com\/blog\/#organization","name":"Blog do Couchbase","url":"https:\/\/www.couchbase.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"pt-BR","@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2023\/04\/admin-logo.png","contentUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2023\/04\/admin-logo.png","width":218,"height":34,"caption":"The Couchbase Blog"},"image":{"@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/0670782b8878056390b6a256511c8858","name":"Vishwa Yeruru - Gerente s\u00eanior de produtos","image":{"@type":"ImageObject","inLanguage":"pt-BR","@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/image\/a7609300b8d22762330c56f24bc36684","url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/vishwa-yeruru.png","contentUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/vishwa-yeruru.png","caption":"Vishwa Yeruru - Sr. Product Manager"},"url":"https:\/\/www.couchbase.com\/blog\/pt\/author\/vishwayeruru\/"}]}},"authors":[{"term_id":10050,"user_id":85541,"is_guest":0,"slug":"vishwayeruru","display_name":"Vishwa Yeruru - Sr. Product Manager","avatar_url":{"url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/vishwa-yeruru.png","url2x":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/vishwa-yeruru.png"},"author_category":"","last_name":"Yeruru - Sr. Product Manager","first_name":"Vishwa","job_title":"Sr. Product Manager","user_url":"","description":""},{"term_id":10051,"user_id":85542,"is_guest":0,"slug":"mariakhalusova","display_name":"Maria Khalusova - Staff Developer Advocate, Unstructured.io","avatar_url":{"url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/maria-khalusova.jpeg","url2x":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/10\/maria-khalusova.jpeg"},"author_category":"","last_name":"Khalusova - Staff Developer Advocate, Unstructured.io","first_name":"Maria","job_title":"","user_url":"","description":""}],"_links":{"self":[{"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/posts\/16518","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/users\/85541"}],"replies":[{"embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/comments?post=16518"}],"version-history":[{"count":0,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/posts\/16518\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/media\/16557"}],"wp:attachment":[{"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/media?parent=16518"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/categories?post=16518"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/tags?post=16518"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/ppma_author?post=16518"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}