{"id":16002,"date":"2024-07-04T09:49:03","date_gmt":"2024-07-04T16:49:03","guid":{"rendered":"https:\/\/www.couchbase.com\/blog\/?p=16002"},"modified":"2025-06-13T16:36:50","modified_gmt":"2025-06-13T23:36:50","slug":"accelerate-rag-ai-couchbase-nvidia","status":"publish","type":"post","link":"https:\/\/www.couchbase.com\/blog\/pt\/accelerate-rag-ai-couchbase-nvidia\/","title":{"rendered":"Acelere o aplicativo de IA RAG com base em Couchbase com NVIDIA NIM\/NeMo e LangChain"},"content":{"rendered":"<p><span style=\"font-weight: 400;\">Hoje, temos o prazer de anunciar nossa nova integra\u00e7\u00e3o com o NVIDIA NIM\/NeMo. Nesta postagem do blog, apresentamos um conceito de solu\u00e7\u00e3o de um chatbot interativo baseado em um <em>Recupera\u00e7\u00e3o Gera\u00e7\u00e3o aumentada<\/em> (RAG)<\/span><span style=\"font-weight: 400;\">\u00a0<\/span><span style=\"font-weight: 400;\">com o Couchbase Capella como um banco de dados vetorial. As fases de recupera\u00e7\u00e3o e gera\u00e7\u00e3o do pipeline RAG s\u00e3o aceleradas pelo NVIDIA NIM\/NeMo com <\/span><span style=\"font-weight: 400;\">apenas algumas linhas de c\u00f3digo.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Empresas de v\u00e1rios setores verticais se esfor\u00e7am para oferecer o melhor atendimento aos seus clientes. Para isso, elas est\u00e3o equipando seus funcion\u00e1rios da linha de frente, como enfermeiros de pronto-socorro, vendedores de lojas e representantes de help desk, com chatbots interativos de perguntas e respostas (QA) com tecnologia de IA para recuperar rapidamente informa\u00e7\u00f5es relevantes e atualizadas. <\/span><\/p>\n<p><span style=\"font-weight: 400;\">Os chatbots geralmente s\u00e3o baseados em <\/span><a href=\"https:\/\/www.couchbase.com\/blog\/pt\/an-overview-of-retrieval-augmented-generation\/\"><span style=\"font-weight: 400;\">RAG<\/span><\/a><span style=\"font-weight: 400;\">O LLM \u00e9 uma estrutura de IA usada para recuperar fatos da base de conhecimento da empresa para fundamentar as respostas do LLM nas informa\u00e7\u00f5es mais precisas e recentes. Ela envolve tr\u00eas fases distintas, que come\u00e7am com a recupera\u00e7\u00e3o do contexto mais relevante usando <\/span><a href=\"https:\/\/www.couchbase.com\/blog\/pt\/products\/vector-search\/\"><span style=\"font-weight: 400;\">pesquisa vetorial<\/span><\/a><span style=\"font-weight: 400;\">A consulta do usu\u00e1rio pode ser aumentada com o contexto e, por fim, gerar respostas relevantes usando um LLM.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">O problema com os pipelines de RAG existentes \u00e9 que as chamadas para o servi\u00e7o de incorpora\u00e7\u00e3o na fase de recupera\u00e7\u00e3o para converter os prompts do usu\u00e1rio em vetores podem adicionar uma lat\u00eancia significativa, tornando mais lentos os aplicativos que exigem interatividade. A vetoriza\u00e7\u00e3o de um corpus de documentos que consiste em milh\u00f5es de PDFs, documentos e outras bases de conhecimento pode levar muito tempo para ser vetorizada, aumentando a probabilidade de uso de dados obsoletos para o RAG. Al\u00e9m disso, os usu\u00e1rios acham dif\u00edcil acelerar a infer\u00eancia (tokens\/seg.) de forma econ\u00f4mica para reduzir o tempo de resposta dos aplicativos de chatbot.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">A Figura 1 mostra uma pilha de desempenho que permitir\u00e1 que voc\u00ea desenvolva facilmente um <\/span><span style=\"font-weight: 400;\">chatbot interativo de atendimento ao cliente. Ele consiste na estrutura de aplicativo StreamLit, LangChain para orquestra\u00e7\u00e3o, Couchbase Capella para indexa\u00e7\u00e3o e pesquisa de vetores e NVIDIA NIM\/NeMo para acelerar os est\u00e1gios de recupera\u00e7\u00e3o e gera\u00e7\u00e3o.<\/span><\/p>\n<div id=\"attachment_16003\" style=\"width: 910px\" class=\"wp-caption alignnone\"><a href=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2024\/07\/image1-1.png\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-16003\" class=\"wp-image-16003 size-large\" style=\"border: solid black 1px;\" src=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2024\/07\/image1-1-1024x518.png\" alt=\"NVIDIA NIM\/NeMo and LangChain\" width=\"900\" height=\"455\" srcset=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/07\/image1-1-1024x518.png 1024w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/07\/image1-1-300x152.png 300w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/07\/image1-1-768x389.png 768w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/07\/image1-1-1320x668.png 1320w, https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/07\/image1-1.png 1345w\" sizes=\"auto, (max-width: 900px) 100vw, 900px\" \/><\/a><p id=\"caption-attachment-16003\" class=\"wp-caption-text\">Figura 1: Arquitetura conceitual de um chatbot de controle de qualidade criado com o Capella e o NVIDIA NIM\/NeMo<\/p><\/div>\n<p><span style=\"font-weight: 400;\">O Couchbase Capella, um banco de dados como servi\u00e7o (DBaaS) de alto desempenho, permite que voc\u00ea comece rapidamente a armazenar, indexar e consultar dados operacionais, vetoriais, de texto, de s\u00e9ries temporais e geoespaciais, aproveitando a flexibilidade do JSON. Voc\u00ea pode integrar facilmente o Capella para <\/span><a href=\"https:\/\/www.couchbase.com\/blog\/pt\/products\/vector-search\/\"><span style=\"font-weight: 400;\">pesquisa vetorial<\/span><\/a><span style=\"font-weight: 400;\"> ou pesquisa sem\u00e2ntica sem a necessidade de um banco de dados vetorial separado, integrando uma estrutura de orquestra\u00e7\u00e3o, como o <\/span><a href=\"https:\/\/www.langchain.com\/\"><span style=\"font-weight: 400;\">LangChain<\/span><\/a><span style=\"font-weight: 400;\"> ou <\/span><a href=\"https:\/\/www.llamaindex.ai\/\"><span style=\"font-weight: 400;\">LlamaIndex<\/span><\/a><span style=\"font-weight: 400;\"> em seu pipeline RAG de produ\u00e7\u00e3o. Ele oferece os seguintes recursos <\/span><a href=\"https:\/\/www.couchbase.com\/blog\/pt\/hybrid-search\/\"><span style=\"font-weight: 400;\">pesquisa h\u00edbrida<\/span><\/a><span style=\"font-weight: 400;\"> que combina a pesquisa vetorial com a pesquisa tradicional para melhorar significativamente o desempenho da pesquisa. Al\u00e9m disso, voc\u00ea pode estender a pesquisa vetorial para a borda usando o Couchbase mobile para casos de uso de IA de borda.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Depois de configurar o Capella Vector Search, voc\u00ea pode continuar a escolher um modelo de desempenho na se\u00e7\u00e3o <\/span><a href=\"https:\/\/build.nvidia.com\/explore\/discover\"><span style=\"font-weight: 400;\">Cat\u00e1logo de APIs da NVIDIA<\/span><\/a><span style=\"font-weight: 400;\">que oferece um amplo espectro de modelos de funda\u00e7\u00e3o que abrangem modelos de c\u00f3digo aberto, funda\u00e7\u00e3o NVIDIA AI e modelos personalizados, otimizados para oferecer o melhor desempenho na infraestrutura acelerada da NVIDIA. Esses modelos s\u00e3o implantados como <\/span><a href=\"https:\/\/developer.nvidia.com\/blog\/nvidia-nim-offers-optimized-inference-microservices-for-deploying-ai-models-at-scale\/?ref=blog.langchain.dev\"><span style=\"font-weight: 400;\">NVIDIA NIM<\/span><\/a><span style=\"font-weight: 400;\"> no local ou na nuvem usando cont\u00eaineres pr\u00e9-constru\u00eddos f\u00e1ceis de usar por meio de um \u00fanico comando. NeMo Retriever, <\/span><span style=\"font-weight: 400;\">uma parte do NVIDIA NeMo,<\/span><span style=\"font-weight: 400;\"> oferece recupera\u00e7\u00e3o de informa\u00e7\u00f5es com a menor lat\u00eancia, a maior taxa de transfer\u00eancia e a m\u00e1xima privacidade de dados.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">O chatbot que desenvolvemos usando a pilha mencionada acima permitir\u00e1 que voc\u00ea <\/span><span style=\"font-weight: 400;\">carregue seus documentos PDF e fa\u00e7a perguntas de forma interativa. Ele usa <em>NV-QA-Embed<\/em>um modelo de incorpora\u00e7\u00e3o de texto acelerado por GPU usado para recupera\u00e7\u00e3o de perguntas e respostas, e <\/span><a href=\"https:\/\/build.nvidia.com\/meta\/llama3-70b\"><span style=\"font-weight: 400;\">Lhama 3 - 70B<\/span><\/a><span style=\"font-weight: 400;\">que \u00e9 empacotado como um NIM e acelerado na infraestrutura da NVIDIA. O <\/span><a href=\"https:\/\/python.langchain.com\/v0.2\/docs\/integrations\/chat\/nvidia_ai_endpoints\/\"><span style=\"font-weight: 400;\">langchain-nvidia-ai-endpoints<\/span><\/a><span style=\"font-weight: 400;\"> cont\u00e9m integra\u00e7\u00f5es LangChain para a cria\u00e7\u00e3o de aplicativos com modelos no NVIDIA NIM. <\/span><span style=\"font-weight: 400;\">Embora tenhamos usado endpoints hospedados pela NVIDIA para fins de prototipagem, recomendamos que voc\u00ea considere o uso do NIM auto-hospedado, consultando o <\/span><a href=\"https:\/\/docs.nvidia.com\/nim\/large-language-models\/latest\/introduction.html?nvid=nv-int-tblg-432774\"><span style=\"font-weight: 400;\">Documenta\u00e7\u00e3o do NIM<\/span><\/a><span style=\"font-weight: 400;\"> para implementa\u00e7\u00f5es de produ\u00e7\u00e3o.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Voc\u00ea pode usar essa solu\u00e7\u00e3o para dar suporte a casos de uso que exigem recupera\u00e7\u00e3o r\u00e1pida de informa\u00e7\u00f5es, como:<\/span><\/p>\n<ul>\n<li style=\"list-style-type: none;\">\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Permitir que os enfermeiros do pronto-socorro acelerem a triagem por meio do acesso r\u00e1pido a informa\u00e7\u00f5es relevantes sobre sa\u00fade para aliviar a superlota\u00e7\u00e3o, as longas esperas por atendimento e a baixa satisfa\u00e7\u00e3o dos pacientes.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Ajudar os agentes de atendimento ao cliente a descobrir rapidamente o conhecimento relevante por meio de um chatbot de base de conhecimento interno para reduzir o tempo de espera das chamadas. Isso n\u00e3o apenas ajudar\u00e1 a aumentar as pontua\u00e7\u00f5es de CSAT, mas tamb\u00e9m permitir\u00e1 o gerenciamento de grandes volumes de chamadas.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Ajudar os vendedores em uma loja a descobrir e recomendar rapidamente itens em um cat\u00e1logo de produtos semelhantes \u00e0 imagem ou \u00e0 descri\u00e7\u00e3o do item solicitado por um comprador, mas que est\u00e1 atualmente fora de estoque (falta de estoque), para melhorar a experi\u00eancia de compra.<\/span><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p><span style=\"font-weight: 400;\">Em conclus\u00e3o, voc\u00ea pode desenvolver um aplicativo GenAI interativo, como um chatbot, com respostas fundamentadas e relevantes usando o RAG baseado no Couchbase Capella e aceler\u00e1-lo usando o NVIDIA NIM\/NeMo. <\/span><span style=\"font-weight: 400;\">Essa combina\u00e7\u00e3o oferece escalabilidade, confiabilidade e facilidade de uso. Al\u00e9m de ser implementado junto com o Capella para uma experi\u00eancia de DBaaS, o NIM\/NeMo pode ser implementado com o Couchbase local ou autogerenciado em nuvens p\u00fablicas dentro de sua VPC para casos de uso que tenham requisitos mais rigorosos de seguran\u00e7a e privacidade. Al\u00e9m disso, voc\u00ea pode usar <\/span><a href=\"https:\/\/developer.nvidia.com\/blog\/building-safer-llm-apps-with-langchain-templates-and-nvidia-nemo-guardrails\/\"><span style=\"font-weight: 400;\">Guardrails NeMo<\/span><\/a><span style=\"font-weight: 400;\"> para controlar a sa\u00edda de seu LLM em rela\u00e7\u00e3o ao conte\u00fado que sua empresa considera question\u00e1vel. <\/span><\/p>\n<p><span style=\"font-weight: 400;\">Os detalhes do aplicativo de chatbot podem ser encontrados no arquivo Couchbase <\/span><a href=\"https:\/\/github.com\/couchbase-examples\/couchbase-tutorials\/blob\/141424e68c18233c4ed47cc6321d38540ab4ca54\/tutorial\/markdown\/python\/nvidia-nim-llama3-pdf-chat\/nvidia-nim-llama3-pdf-chat.md\"><span style=\"font-weight: 400;\">Portal do desenvolvedor<\/span><\/a><span style=\"font-weight: 400;\"> juntamente com o <\/span><a href=\"https:\/\/github.com\/couchbase-examples\/nvidia-rag-demo\/blob\/main\/chat_with_pdf.py\"><span style=\"font-weight: 400;\">c\u00f3digo completo<\/span><\/a><span style=\"font-weight: 400;\">. Por favor, inscreva-se para um <\/span><a href=\"https:\/\/cloud.couchbase.com\/sign-up\"><span style=\"font-weight: 400;\">Conta de avalia\u00e7\u00e3o Capella<\/span><\/a><span style=\"font-weight: 400;\">, gratuito <\/span><a href=\"https:\/\/build.nvidia.com\/explore\/discover?signin_corporate=false&amp;signin=false\"><span style=\"font-weight: 400;\">Conta NVIDIA NIM<\/span><\/a><span style=\"font-weight: 400;\">e comece a desenvolver seu aplicativo GenAI.\u00a0<\/span><\/p>\n<p><br style=\"font-weight: 400;\" \/><br style=\"font-weight: 400;\" \/><\/p>","protected":false},"excerpt":{"rendered":"<p>Today, we&#8217;re excited to announce our new integration with NVIDIA NIM\/NeMo. In this blog post, we present a solution concept of an interactive chatbot based on a Retrieval Augmented Generation (RAG)\u00a0architecture with Couchbase Capella as a Vector database. The retrieval [&hellip;]<\/p>\n","protected":false},"author":84768,"featured_media":16003,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"inline_featured_image":false,"footnotes":""},"categories":[10122,2242,2225,1816,7666,9973,2389,9937],"tags":[9963,9989],"ppma_author":[9977,9981],"class_list":["post-16002","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-artificial-intelligence-ai","category-connectors","category-cloud","category-couchbase-server","category-edge-computing","category-generative-ai-genai","category-solutions","category-vector-search","tag-langchain","tag-nvidia"],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v27.3 (Yoast SEO v27.3) - https:\/\/yoast.com\/product\/yoast-seo-premium-wordpress\/ -->\n<title>Accelerate Couchbase-Powered RAG AI Application With NVIDIA NIM\/NeMo and LangChain - The Couchbase Blog<\/title>\n<meta name=\"description\" content=\"Develop an interactive GenAI application with grounded and relevant responses using Couchbase Capella-based RAG and accelerate it using NVIDIA NIM\/NeMo\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.couchbase.com\/blog\/pt\/accelerate-rag-ai-couchbase-nvidia\/\" \/>\n<meta property=\"og:locale\" content=\"pt_BR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Accelerate Couchbase-Powered RAG AI Application With NVIDIA NIM\/NeMo and LangChain\" \/>\n<meta property=\"og:description\" content=\"Develop an interactive GenAI application with grounded and relevant responses using Couchbase Capella-based RAG and accelerate it using NVIDIA NIM\/NeMo\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.couchbase.com\/blog\/pt\/accelerate-rag-ai-couchbase-nvidia\/\" \/>\n<meta property=\"og:site_name\" content=\"The Couchbase Blog\" \/>\n<meta property=\"article:published_time\" content=\"2024-07-04T16:49:03+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-06-13T23:36:50+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/07\/image1-1.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1345\" \/>\n\t<meta property=\"og:image:height\" content=\"681\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Lokesh Goel, Software Engineer, Kiran Matty, Lead Product Manager AI\/ML\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Lokesh Goel, Software Engineer\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutos\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/accelerate-rag-ai-couchbase-nvidia\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/accelerate-rag-ai-couchbase-nvidia\\\/\"},\"author\":{\"name\":\"Lokesh Goel, Developer Experience Engineer\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#\\\/schema\\\/person\\\/a918924898a24c1cbcf6712bb6d62b4e\"},\"headline\":\"Accelerate Couchbase-Powered RAG AI Application With NVIDIA NIM\\\/NeMo and LangChain\",\"datePublished\":\"2024-07-04T16:49:03+00:00\",\"dateModified\":\"2025-06-13T23:36:50+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/accelerate-rag-ai-couchbase-nvidia\\\/\"},\"wordCount\":859,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/accelerate-rag-ai-couchbase-nvidia\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/1\\\/2024\\\/07\\\/image1-1.png\",\"keywords\":[\"langchain\",\"NVIDIA\"],\"articleSection\":[\"Artificial Intelligence (AI)\",\"Connectors\",\"Couchbase Capella\",\"Couchbase Server\",\"Edge computing\",\"Generative AI (GenAI)\",\"Solutions\",\"Vector Search\"],\"inLanguage\":\"pt-BR\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/accelerate-rag-ai-couchbase-nvidia\\\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/accelerate-rag-ai-couchbase-nvidia\\\/\",\"url\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/accelerate-rag-ai-couchbase-nvidia\\\/\",\"name\":\"Accelerate Couchbase-Powered RAG AI Application With NVIDIA NIM\\\/NeMo and LangChain - The Couchbase Blog\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/accelerate-rag-ai-couchbase-nvidia\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/accelerate-rag-ai-couchbase-nvidia\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/1\\\/2024\\\/07\\\/image1-1.png\",\"datePublished\":\"2024-07-04T16:49:03+00:00\",\"dateModified\":\"2025-06-13T23:36:50+00:00\",\"description\":\"Develop an interactive GenAI application with grounded and relevant responses using Couchbase Capella-based RAG and accelerate it using NVIDIA NIM\\\/NeMo\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/accelerate-rag-ai-couchbase-nvidia\\\/#breadcrumb\"},\"inLanguage\":\"pt-BR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/accelerate-rag-ai-couchbase-nvidia\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-BR\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/accelerate-rag-ai-couchbase-nvidia\\\/#primaryimage\",\"url\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/1\\\/2024\\\/07\\\/image1-1.png\",\"contentUrl\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/wp-content\\\/uploads\\\/sites\\\/1\\\/2024\\\/07\\\/image1-1.png\",\"width\":1345,\"height\":681,\"caption\":\"NVIDIA NIM\\\/NeMo and LangChain\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/accelerate-rag-ai-couchbase-nvidia\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Accelerate Couchbase-Powered RAG AI Application With NVIDIA NIM\\\/NeMo and LangChain\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#website\",\"url\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/\",\"name\":\"The Couchbase Blog\",\"description\":\"Couchbase, the NoSQL Database\",\"publisher\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"pt-BR\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#organization\",\"name\":\"The Couchbase Blog\",\"url\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-BR\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/wp-content\\\/uploads\\\/2023\\\/04\\\/admin-logo.png\",\"contentUrl\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/wp-content\\\/uploads\\\/2023\\\/04\\\/admin-logo.png\",\"width\":218,\"height\":34,\"caption\":\"The Couchbase Blog\"},\"image\":{\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#\\\/schema\\\/logo\\\/image\\\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/#\\\/schema\\\/person\\\/a918924898a24c1cbcf6712bb6d62b4e\",\"name\":\"Lokesh Goel, Developer Experience Engineer\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-BR\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/8f2cb3333278f50e81806e9c068732cf57d7268c2b1ed80cc3dc9645151df405?s=96&d=mm&r=g28f42fa6eaa9ec33a742151714d1f0cb\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/8f2cb3333278f50e81806e9c068732cf57d7268c2b1ed80cc3dc9645151df405?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/8f2cb3333278f50e81806e9c068732cf57d7268c2b1ed80cc3dc9645151df405?s=96&d=mm&r=g\",\"caption\":\"Lokesh Goel, Developer Experience Engineer\"},\"url\":\"https:\\\/\\\/www.couchbase.com\\\/blog\\\/pt\\\/author\\\/lokeshgoel\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Accelerate Couchbase-Powered RAG AI Application With NVIDIA NIM\/NeMo and LangChain - The Couchbase Blog","description":"Desenvolva um aplicativo GenAI interativo com respostas fundamentadas e relevantes usando o RAG baseado no Couchbase Capella e acelere-o usando o NVIDIA NIM\/NeMo","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.couchbase.com\/blog\/pt\/accelerate-rag-ai-couchbase-nvidia\/","og_locale":"pt_BR","og_type":"article","og_title":"Accelerate Couchbase-Powered RAG AI Application With NVIDIA NIM\/NeMo and LangChain","og_description":"Develop an interactive GenAI application with grounded and relevant responses using Couchbase Capella-based RAG and accelerate it using NVIDIA NIM\/NeMo","og_url":"https:\/\/www.couchbase.com\/blog\/pt\/accelerate-rag-ai-couchbase-nvidia\/","og_site_name":"The Couchbase Blog","article_published_time":"2024-07-04T16:49:03+00:00","article_modified_time":"2025-06-13T23:36:50+00:00","og_image":[{"width":1345,"height":681,"url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/07\/image1-1.png","type":"image\/png"}],"author":"Lokesh Goel, Software Engineer, Kiran Matty, Lead Product Manager AI\/ML","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Lokesh Goel, Software Engineer","Est. reading time":"5 minutos"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.couchbase.com\/blog\/accelerate-rag-ai-couchbase-nvidia\/#article","isPartOf":{"@id":"https:\/\/www.couchbase.com\/blog\/accelerate-rag-ai-couchbase-nvidia\/"},"author":{"name":"Lokesh Goel, Developer Experience Engineer","@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/a918924898a24c1cbcf6712bb6d62b4e"},"headline":"Accelerate Couchbase-Powered RAG AI Application With NVIDIA NIM\/NeMo and LangChain","datePublished":"2024-07-04T16:49:03+00:00","dateModified":"2025-06-13T23:36:50+00:00","mainEntityOfPage":{"@id":"https:\/\/www.couchbase.com\/blog\/accelerate-rag-ai-couchbase-nvidia\/"},"wordCount":859,"commentCount":0,"publisher":{"@id":"https:\/\/www.couchbase.com\/blog\/#organization"},"image":{"@id":"https:\/\/www.couchbase.com\/blog\/accelerate-rag-ai-couchbase-nvidia\/#primaryimage"},"thumbnailUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/07\/image1-1.png","keywords":["langchain","NVIDIA"],"articleSection":["Artificial Intelligence (AI)","Connectors","Couchbase Capella","Couchbase Server","Edge computing","Generative AI (GenAI)","Solutions","Vector Search"],"inLanguage":"pt-BR","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.couchbase.com\/blog\/accelerate-rag-ai-couchbase-nvidia\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.couchbase.com\/blog\/accelerate-rag-ai-couchbase-nvidia\/","url":"https:\/\/www.couchbase.com\/blog\/accelerate-rag-ai-couchbase-nvidia\/","name":"Accelerate Couchbase-Powered RAG AI Application With NVIDIA NIM\/NeMo and LangChain - The Couchbase Blog","isPartOf":{"@id":"https:\/\/www.couchbase.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.couchbase.com\/blog\/accelerate-rag-ai-couchbase-nvidia\/#primaryimage"},"image":{"@id":"https:\/\/www.couchbase.com\/blog\/accelerate-rag-ai-couchbase-nvidia\/#primaryimage"},"thumbnailUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/07\/image1-1.png","datePublished":"2024-07-04T16:49:03+00:00","dateModified":"2025-06-13T23:36:50+00:00","description":"Desenvolva um aplicativo GenAI interativo com respostas fundamentadas e relevantes usando o RAG baseado no Couchbase Capella e acelere-o usando o NVIDIA NIM\/NeMo","breadcrumb":{"@id":"https:\/\/www.couchbase.com\/blog\/accelerate-rag-ai-couchbase-nvidia\/#breadcrumb"},"inLanguage":"pt-BR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.couchbase.com\/blog\/accelerate-rag-ai-couchbase-nvidia\/"]}]},{"@type":"ImageObject","inLanguage":"pt-BR","@id":"https:\/\/www.couchbase.com\/blog\/accelerate-rag-ai-couchbase-nvidia\/#primaryimage","url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/07\/image1-1.png","contentUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/07\/image1-1.png","width":1345,"height":681,"caption":"NVIDIA NIM\/NeMo and LangChain"},{"@type":"BreadcrumbList","@id":"https:\/\/www.couchbase.com\/blog\/accelerate-rag-ai-couchbase-nvidia\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.couchbase.com\/blog\/"},{"@type":"ListItem","position":2,"name":"Accelerate Couchbase-Powered RAG AI Application With NVIDIA NIM\/NeMo and LangChain"}]},{"@type":"WebSite","@id":"https:\/\/www.couchbase.com\/blog\/#website","url":"https:\/\/www.couchbase.com\/blog\/","name":"Blog do Couchbase","description":"Couchbase, o banco de dados NoSQL","publisher":{"@id":"https:\/\/www.couchbase.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.couchbase.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"pt-BR"},{"@type":"Organization","@id":"https:\/\/www.couchbase.com\/blog\/#organization","name":"Blog do Couchbase","url":"https:\/\/www.couchbase.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"pt-BR","@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2023\/04\/admin-logo.png","contentUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2023\/04\/admin-logo.png","width":218,"height":34,"caption":"The Couchbase Blog"},"image":{"@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/a918924898a24c1cbcf6712bb6d62b4e","name":"Lokesh Goel, engenheiro de experi\u00eancia do desenvolvedor","image":{"@type":"ImageObject","inLanguage":"pt-BR","@id":"https:\/\/secure.gravatar.com\/avatar\/8f2cb3333278f50e81806e9c068732cf57d7268c2b1ed80cc3dc9645151df405?s=96&d=mm&r=g28f42fa6eaa9ec33a742151714d1f0cb","url":"https:\/\/secure.gravatar.com\/avatar\/8f2cb3333278f50e81806e9c068732cf57d7268c2b1ed80cc3dc9645151df405?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/8f2cb3333278f50e81806e9c068732cf57d7268c2b1ed80cc3dc9645151df405?s=96&d=mm&r=g","caption":"Lokesh Goel, Developer Experience Engineer"},"url":"https:\/\/www.couchbase.com\/blog\/pt\/author\/lokeshgoel\/"}]}},"acf":[],"authors":[{"term_id":9977,"user_id":84768,"is_guest":0,"slug":"lokeshgoel","display_name":"Lokesh Goel, Software Engineer","avatar_url":"https:\/\/secure.gravatar.com\/avatar\/8f2cb3333278f50e81806e9c068732cf57d7268c2b1ed80cc3dc9645151df405?s=96&d=mm&r=g","0":null,"1":"","2":"","3":"","4":"","5":"","6":"","7":"","8":""},{"term_id":9981,"user_id":85346,"is_guest":0,"slug":"kiranmatty","display_name":"Kiran Matty, Lead Product Manager AI\/ML","avatar_url":{"url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/06\/T024FJS4M-U064W1AETPD-456e21a66cf5-512.png","url2x":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/06\/T024FJS4M-U064W1AETPD-456e21a66cf5-512.png"},"0":null,"1":"","2":"","3":"","4":"","5":"","6":"","7":"","8":""}],"_links":{"self":[{"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/posts\/16002","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/users\/84768"}],"replies":[{"embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/comments?post=16002"}],"version-history":[{"count":0,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/posts\/16002\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/media\/16003"}],"wp:attachment":[{"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/media?parent=16002"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/categories?post=16002"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/tags?post=16002"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/pt\/wp-json\/wp\/v2\/ppma_author?post=16002"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}