{"id":17010,"date":"2025-04-07T10:48:50","date_gmt":"2025-04-07T17:48:50","guid":{"rendered":"https:\/\/www.couchbase.com\/blog\/?p=17010"},"modified":"2025-07-08T09:15:31","modified_gmt":"2025-07-08T16:15:31","slug":"pyspark-ga-couchbase-spark-connector","status":"publish","type":"post","link":"https:\/\/www.couchbase.com\/blog\/ko\/pyspark-ga-couchbase-spark-connector\/","title":{"rendered":"Couchbase\uc640 PySpark\ub85c \ud655\uc7a5\uc131\uc774 \ub6f0\uc5b4\ub09c AI\/ML \uc560\ud50c\ub9ac\ucf00\uc774\uc158 \uad6c\ucd95\ud558\uae30"},"content":{"rendered":"<p>\ub2e4\uc74c\uc5d0 \ub300\ud55c Python \uc9c0\uc6d0\uc758 \uc815\uc2dd \ubc84\uc804(GA)\uc744 \ubc1c\ud45c\ud558\uac8c \ub418\uc5b4 \uae30\uc058\uac8c \uc0dd\uac01\ud569\ub2c8\ub2e4. <a href=\"https:\/\/docs.couchbase.com\/spark-connector\/current\/pyspark.html\" target=\"_blank\" rel=\"noopener\">\uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \uc2a4\ud30c\ud06c \ucee4\ub125\ud130<\/a>\ub97c \ud1b5\ud574 Couchbase \uc11c\ubc84\uc640 <a href=\"https:\/\/docs.couchbase.com\/spark-connector\/current\/index.html\" target=\"_blank\" rel=\"noopener\">\uc544\ud30c\uce58 \uc2a4\ud30c\ud06c<\/a> \ub97c \ud30c\uc774\uc36c \ub370\uc774\ud130 \uc5d4\uc9c0\ub2c8\uc5b4\uc5d0\uac8c \uacf5\uac1c\ud569\ub2c8\ub2e4. \uc774\ubc88 GA \ub9b4\ub9ac\uc988\ub294 \uc774 \ucee4\ub125\ud130\uac00 \ud504\ub85c\ub355\uc158\uc5d0 \ubc14\ub85c \uc0ac\uc6a9\ud560 \uc218 \uc788\uace0 \uc644\ubcbd\ud558\uac8c \uc9c0\uc6d0\ub41c\ub2e4\ub294 \uac83\uc744 \uc758\ubbf8\ud558\uba70, PySpark \uc560\ud50c\ub9ac\ucf00\uc774\uc158\uc774 Couchbase\uc5d0\uc11c \uc6d0\ud65c\ud558\uac8c \uc77d\uace0 \uc4f8 \uc218 \uc788\uac8c \ud574\uc90d\ub2c8\ub2e4. \uc774\uc81c \ub370\uc774\ud130 \uc5d4\uc9c0\ub2c8\uc5b4\ub294 Couchbase\uc758 \uace0\uc131\ub2a5 NoSQL \ub370\uc774\ud130\ubca0\uc774\uc2a4(SQL++\/SQL++ \ucffc\ub9ac \uc5b8\uc5b4 \ud3ec\ud568)\uc640 Spark\uc758 \ubd84\uc0b0 \ucc98\ub9ac \uc5d4\uc9c4\uc744 \ud1b5\ud574 \uc774\ub7ec\ud55c \uae30\uc220\uc744 \uc190\uc27d\uac8c \uacb0\ud569\ud558\uc5ec \ube60\ub974\uace0 \ud655\uc7a5 \uac00\ub2a5\ud55c \ub370\uc774\ud130 \ud30c\uc774\ud504\ub77c\uc778\uacfc \ubd84\uc11d \uc6cc\ud06c\ud50c\ub85c\uc6b0\ub97c \uad6c\ucd95\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4. \uc694\ucee8\ub300, PySpark\uc6a9 Couchbase Spark \ucee4\ub125\ud130\ub294 \ud6a8\uc728\uc801\uc778 \ubcd1\ub82c \ub370\uc774\ud130 \ud1b5\ud569\uc744 \uc2e4\ud604\ud558\uc5ec Couchbase\uc5d0 \uc800\uc7a5\ub41c \ub370\uc774\ud130\uc5d0\uc11c ETL\/ELT, \uc2e4\uc2dc\uac04 \ubd84\uc11d, \uba38\uc2e0 \ub7ec\ub2dd \ub4f1\uc744 \uc704\ud574 Spark\ub97c \ud65c\uc6a9\ud560 \uc218 \uc788\uac8c \ud574\uc90d\ub2c8\ub2e4.<\/p>\n<p>\uc774 \uae00\uc5d0\uc11c\ub294 PySpark \ucee4\ub125\ud130\ub97c \uc2dc\uc791\ud558\ub294 \ubc29\ubc95, Couchbase \uc6b4\uc601 \ub370\uc774\ud130\ubca0\uc774\uc2a4\uc640 Capella \uceec\ub7fc\ud615 \ub370\uc774\ud130\ubca0\uc774\uc2a4 \ubaa8\ub450\uc5d0 \ub300\ud55c \uae30\ubcf8\uc801\uc778 \uc77d\uae30\/\uc4f0\uae30 \uc791\uc5c5(\ud0a4 \uac12 \ubc0f \ucffc\ub9ac \uae30\ubc18 \ubaa8\ub450)\uc744 \uc2dc\uc5f0\ud558\uace0, \ucd5c\uace0\uc758 \ucc98\ub9ac\ub7c9\uc744 \uc5bb\uae30 \uc704\ud55c \uc131\ub2a5 \ud29c\ub2dd \ud301\uc744 \uacf5\uc720\ud569\ub2c8\ub2e4. Scala\uc5d0\uc11c Couchbase Spark \ucee4\ub125\ud130\ub97c \uc0ac\uc6a9\ud574 \uc654\ub4e0, Couchbase-Spark \ud1b5\ud569\uc744 \ucc98\uc74c \uc0ac\uc6a9\ud558\ub4e0, \uc774 \uac00\uc774\ub4dc\ub294 \ub370\uc774\ud130 \uc5d4\uc9c0\ub2c8\uc5b4\ub9c1 \uc694\uad6c\uc0ac\ud56d\uc5d0 \ub9de\uac8c PySpark\ub97c \ube60\ub974\uac8c \uc0ac\uc6a9\ud558\ub294 \ub370 \ub3c4\uc6c0\uc774 \ub420 \uac83\uc785\ub2c8\ub2e4.<\/p>\n<h2>\uc65c \ud30c\uc774\uc2a4\ud30c\ud06c\uc778\uac00?<\/h2>\n<p>\ub370\uc774\ud130 \uacfc\ud559 \ubc0f \uc5d4\uc9c0\ub2c8\uc5b4\ub9c1 \uc6cc\ud06c\ud50c\ub85c\uc6b0\uc5d0\uc11c Spark\uc758 \ub2e8\uc21c\uc131\uacfc \ubc29\ub300\ud55c Python ML \uc5d0\ucf54\uc2dc\uc2a4\ud15c\uc73c\ub85c \uc778\ud574 Python\uc744 \uc120\ud638\ud558\ub294 \ub370\uc774\ud130 \uc5d4\uc9c0\ub2c8\uc5b4\uc640 \uac1c\ubc1c\uc790\ub4e4\uc758 \uc218\uc694\uac00 \uc99d\uac00\ud568\uc5d0 \ub530\ub77c \uae30\uc874 Couchbase Spark \ucee4\ub125\ud130\uc5d0 PySpark \uc9c0\uc6d0\uc744 \ucd94\uac00\ud558\uac8c \ub418\uc5c8\uc2b5\ub2c8\ub2e4. \uc774 \uc9c0\uc6d0\uc744 \ud1b5\ud574 \uc774\ubbf8 Python\uc744 \uc0ac\uc6a9 \uc911\uc778 \ud300\ub3c4 \uc774\uc81c Couchbase\ub97c \ud1b5\ud569\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4. <a href=\"https:\/\/www.couchbase.com\/blog\/ko\/products\/capella\/\" target=\"_blank\" rel=\"noopener\">\uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \uce74\ud3a0\ub77c(DBaaS)<\/a>, <a href=\"https:\/\/www.couchbase.com\/blog\/ko\/products\/server\/\" target=\"_blank\" rel=\"noopener\">\uc790\uccb4 \uad00\ub9ac \uc6b4\uc601 \ub370\uc774\ud130\ubca0\uc774\uc2a4<\/a> \ub610\ub294 <a href=\"https:\/\/www.couchbase.com\/blog\/ko\/products\/analytics\/\" target=\"_blank\" rel=\"noopener\">\uce74\ud3a0\ub77c \uce7c\ub7fc<\/a> \ub370\uc774\ud130\ubca0\uc774\uc2a4)\ub97c Python \uae30\ubc18 Spark \uc6cc\ud06c\ud50c\ub85c\uc6b0\uc5d0 \ud1b5\ud569\ud558\uc5ec \ub354 \ud3ed\ub113\uac8c \ucc44\ud0dd\ud558\uace0 \ub370\uc774\ud130 \ud504\ub85c\uc138\uc2a4\ub97c \uac04\uc18c\ud654\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4.<\/p>\n<p>AI\/ML \uc0ac\uc6a9 \uc0ac\ub840\uc5d0\uc11c Python\uc774 \uc555\ub3c4\uc801\uc778 \uc6b0\uc704\ub97c \uc810\ud558\uace0 \uc788\uc73c\uba70, SparkML, PyTorch, TensorFlow, H2O, DataRobot, scikit-learn, SageMaker\uc640 \uac19\uc740 \ud504\ub808\uc784\uc6cc\ud06c\uc640 Matplotlib \ubc0f Plotly \uac19\uc740 \uc778\uae30 \ud0d0\uc0c9 \ub370\uc774\ud130 \ubd84\uc11d \ub3c4\uad6c\uc5d0\uc11c \uc9c0\uc6d0\ub418\ub294 \uac83\uc740 PySpark \ud1b5\ud569\uc758 \ud544\uc694\uc131\uc744 \ub354\uc6b1 \uac15\uc870\ud558\uace0 \uc788\uc2b5\ub2c8\ub2e4. \ub610\ud55c, PySpark \ud638\ud658\uc131\uc744 \ud1b5\ud574 GPU \uac00\uc18d(Spark RAPIDS)\uc744 \ud65c\uc6a9\ud55c \uac00\uc18d\ud654\ub41c ETL \ubc0f ML \ud30c\uc774\ud504\ub77c\uc778\uc744 \uc774\uc6a9\ud560 \uc218 \uc788\uc73c\uba70, Pandas, NumPy, Spark\uc758 \ub0b4\uc7a5 \uae30\ub2a5 \uc5d4\uc9c0\ub2c8\uc5b4\ub9c1 API \ub4f1 \ub110\ub9ac \ucc44\ud0dd\ub41c \ub77c\uc774\ube0c\ub7ec\ub9ac\ub97c \uc0ac\uc6a9\ud574 \uc815\uad50\ud55c \uae30\ub2a5 \uc5d4\uc9c0\ub2c8\uc5b4\ub9c1 \ubc0f \ub370\uc774\ud130 \ub7ad\uae00\ub9c1 \uc791\uc5c5\uc744 \uc6a9\uc774\ud558\uac8c \ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4. \uc774 \uc0c8\ub85c\uc6b4 \uc9c0\uc6d0\uc740 \ub370\uc774\ud130 \ud504\ub85c\uc138\uc2a4\ub97c \ud06c\uac8c \uac04\uc18c\ud654\ud558\uace0 \ub370\uc774\ud130 \uacfc\ud559 \ubc0f \uc5d4\uc9c0\ub2c8\uc5b4\ub9c1 \ud300\uc5d0\uc11c Couchbase\ub97c \ucc44\ud0dd\ud560 \uc218 \uc788\ub294 \uae30\ud68c\ub97c \ud655\ub300\ud569\ub2c8\ub2e4.<\/p>\n<h2>\uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \ud30c\uc774\uc2a4\ud30c\ud06c \uc2dc\uc791\ud558\uae30<\/h2>\n<p>\uc2dc\uc791\uc740 \uac04\ub2e8\ud569\ub2c8\ub2e4. Couchbase Spark \ucee4\ub125\ud130\ub294 Spark \ud658\uacbd\uc5d0 \ucd94\uac00\ud558\ub294 \ub2e8\uc77c JAR(Java \uc544\uce74\uc774\ube0c)\ub85c \ubc30\ud3ec\ub429\ub2c8\ub2e4. \ucee4\ub125\ud130\ub294 \uacf5\uc2dd <a href=\"https:\/\/docs.couchbase.com\/spark-connector\/current\/download-links.html\" target=\"_blank\" rel=\"noopener\">\uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \ub2e4\uc6b4\ub85c\ub4dc \uc0ac\uc774\ud2b8<\/a> \ub610\ub294 <a href=\"https:\/\/mvnrepository.com\/artifact\/com.couchbase.client\/spark-connector\" target=\"_blank\" rel=\"noopener\">\uba54\uc774\ube10 \uc88c\ud45c<\/a>. JAR\uc774 \uc788\uc73c\uba74 \ucee4\ub125\ud130 \ubc0f Couchbase \uc5f0\uacb0 \uc124\uc815\uc73c\ub85c Spark \uc138\uc158\uc744 \uad6c\uc131\ud558\ub294 \uac83\ub9cc\ud07c\uc774\ub098 \uac04\ub2e8\ud558\uac8c PySpark\uc5d0\uc11c \uc0ac\uc6a9\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4.<\/p>\n<p><b>1. Couchbase \uc6b4\uc601 \ub370\uc774\ud130\ubca0\uc774\uc2a4 \ub610\ub294 Capella Columnar \ub370\uc774\ud130\ubca0\uc774\uc2a4\ub97c \uac00\uc838\uc624\uac70\ub098 \ub9cc\ub4ed\ub2c8\ub2e4.<\/b><\/p>\n<p>Couchbase\ub97c \uc2dc\uc791\ud558\ub294 \uac00\uc7a5 \ube60\ub978 \ubc29\ubc95\uc740 \uc800\ud76c\uc758 <a href=\"https:\/\/cloud.couchbase.com\/sign-in\" target=\"_blank\" rel=\"noopener\">\uce74\ud3a0\ub77c DBaaS<\/a>. \uac70\uae30\uc5d0\uc11c \uae30\uc874 \ub370\uc774\ud130\ubca0\uc774\uc2a4\ub97c \ucc3e\uac70\ub098 \uc0c8 \ub370\uc774\ud130\ubca0\uc774\uc2a4\ub97c \ub9cc\ub4e4 \uc218 \uc788\uc2b5\ub2c8\ub2e4. <a href=\"https:\/\/docs.couchbase.com\/cloud\/clusters\/databases.html\" target=\"_blank\" rel=\"noopener\">\uc6b4\uc601<\/a> \ub610\ub294 <a href=\"https:\/\/docs.couchbase.com\/columnar\/admin\/prepare-project.html\" target=\"_blank\" rel=\"noopener\">\uae30\ub465\ud615<\/a> (\ubd84\uc11d\uc6a9) \ub370\uc774\ud130\ubca0\uc774\uc2a4\uc5d0 \uc800\uc7a5\ud569\ub2c8\ub2e4. \ub610\ub294 <a href=\"https:\/\/www.couchbase.com\/blog\/ko\/products\/server\/\" target=\"_blank\" rel=\"noopener\">\uc790\uccb4 \uad00\ub9ac\ud615 \uce74\uc6b0\uce58\ubca0\uc774\uc2a4<\/a>.<\/p>\n<p><b>2. \ud30c\uc774\uc2a4\ud30c\ud06c \uc124\uce58(\uc544\uc9c1 \uc124\uce58\ud558\uc9c0 \uc54a\uc740 \uacbd\uc6b0)<\/b><\/p>\n<p>Python \ud658\uacbd\uc5d0\uc11c \uc791\uc5c5\ud558\ub294 \uacbd\uc6b0 pip\ub97c \uc0ac\uc6a9\ud558\uc5ec PySpark\ub97c \uc124\uce58\ud558\uc138\uc694. \uc608\ub97c \ub4e4\uc5b4 \uac00\uc0c1 \ud658\uacbd\uc5d0\uc11c:<\/p>\n<pre class=\"nums:false lang:default decode:true\">pip \uc124\uce58 \ud30c\uc774\uc2a4\ud30c\ud06c<\/pre>\n<p>\ud30c\uc774\uc36c\uacfc \ud568\uaed8 \uc0ac\uc6a9\ud560 \uc218 \uc788\ub3c4\ub85d Apache Spark\uac00 \uc124\uce58\ub429\ub2c8\ub2e4. \uae30\uc874 Spark \ud074\ub7ec\uc2a4\ud130 \ub610\ub294 \ub370\uc774\ud130\ube0c\ub9ad\uc2a4\uc5d0\uc11c \uc2e4\ud589 \uc911\uc778 \uacbd\uc6b0 PySpark\ub97c \uc774\ubbf8 \uc0ac\uc6a9\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4.<\/p>\n<p><b>3. \uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \uc2a4\ud30c\ud06c \ucee4\ub125\ud130 JAR\uc744 \ud3ec\ud568\ud569\ub2c8\ub2e4.<\/b><\/p>\n<p><a href=\"https:\/\/docs.couchbase.com\/spark-connector\/current\/download-links.html#using-from-pyspark\" target=\"_blank\" rel=\"noopener\">\ub2e4\uc6b4\ub85c\ub4dc<\/a> \uc758 <code>\uc2a4\ud30c\ud06c-\ucee4\ub125\ud130-\uc5b4\uc148\ube14\ub9ac-.jar<\/code> \uc5d0\uc11c \ucd5c\uc2e0 \ucee4\ub125\ud130 \ub9b4\ub9ac\uc2a4\ub97c \ub2e4\uc6b4\ub85c\ub4dc\ud558\uc138\uc694. \uadf8\ub7f0 \ub2e4\uc74c Spark \uc138\uc158\uc744 \ub9cc\ub4e4\uac70\ub098 \uc791\uc5c5\uc744 \uc81c\ucd9c\ud560 \ub54c \uad6c\uc131\uc5d0\uc11c \uc774 JAR\uc744 \uc81c\uacf5\ud558\uc138\uc694. \uc774 \uc791\uc5c5\uc744 \uc218\ud589\ud558\ub824\uba74 <code>--jars<\/code> \uc635\uc158\uc758 <code>\uc2a4\ud30c\ud06c \uc81c\ucd9c<\/code> \ub610\ub294 \uc544\ub798 \uadf8\ub9bc\uacfc \uac19\uc774 \ucf54\ub4dc\uc758 SparkSession \ube4c\ub354\ub97c \ud1b5\ud574 \uc2e4\ud589\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4.<\/p>\n<p><b>4. Couchbase \uc5f0\uacb0 \uad6c\uc131<\/b><\/p>\n<p>Couchbase \ud074\ub7ec\uc2a4\ud130 \uc5f0\uacb0 \ubb38\uc790\uc5f4\uacfc \uc790\uaca9 \uc99d\uba85(\uc0ac\uc6a9\uc790 \uc774\ub984 \ubc0f \ube44\ubc00\ubc88\ud638)\uc744 \uc9c0\uc815\ud574\uc57c \ud569\ub2c8\ub2e4. Capella\uc5d0\uc11c\ub294 '\uc5f0\uacb0' \ud0ed\uc5d0\uc11c \uc6b4\uc601 \ubc0f <strong>\uc124\uc815-&gt;\uc5f0\uacb0 \ubb38\uc790\uc5f4<\/strong> \ub97c \uc785\ub825\ud569\ub2c8\ub2e4. \ud544\uc694\ud55c \uacbd\uc6b0 \uae30\ubcf8 \ubc84\ud0b7 \ub610\ub294 \ubc94\uc704\ub97c \uc9c0\uc815\ud560 \uc218\ub3c4 \uc788\uc2b5\ub2c8\ub2e4(\uc791\uc5c5\ubcc4\ub85c \ubc84\ud0b7\/\ubc94\uc704\ub97c \uc9c0\uc815\ud560 \uc218\ub3c4 \uc788\uc74c).<\/p>\n<p>\uc544\ub798\ub294 <b>\uac04\ub2e8\ud55c PySpark \uc608\uc81c<\/b> \ub97c \uc124\uc815\ud558\ub294 <code>SparkSession<\/code> \ub97c \uc0ac\uc6a9\ud558\uc5ec Couchbase \ud074\ub7ec\uc2a4\ud130\uc5d0 \uc5f0\uacb0\ud55c \ub2e4\uc74c \uc77c\ubd80 \ub370\uc774\ud130\ub97c \uc77d\uc2b5\ub2c8\ub2e4:<\/p>\n<pre class=\"nums:false lang:default decode:true\">pyspark.sql\uc5d0\uc11c SparkSession\uc744 \uac00\uc838\uc635\ub2c8\ub2e4.\r\n# Couchbase \ucee4\ub125\ud130 \ubc0f \uc5f0\uacb0 \uc124\uc815\uc73c\ub85c SparkSession \ucd08\uae30\ud654\ud558\uae30\r\n\r\nspark = SparkSession.builder \\.\r\n    .appName(\"CouchbaseIntegrationExample\") \\.\r\n    .master(\"local[*]\") \\ #(\uc608: \ub85c\uceec \uc2a4\ud30c\ud06c \uc0ac\uc6a9); \uc2a4\ud30c\ud06c \ud074\ub7ec\uc2a4\ud130\uc758 \uacbd\uc6b0 \uc0dd\ub7b5 \ub610\ub294 \uc870\uc815\ud569\ub2c8\ub2e4.\r\n    .config(\"spark.jars\", \"\/path\/to\/spark-connector-assembly-.jar\") \\.\r\n    .config(\"spark.couchbase.connectionString\", \"couchbases:\/\/\") \\.\r\n    .config(\"spark.couchbase.username\", \"\") \\.\r\n    .config(\"spark.couchbase.password\", \"\") \\.\r\n    .getOrCreate()\r\n\r\n# \uc0d8\ud50c \ubc84\ud0b7\uc744 \uc0ac\uc6a9\ud558\uc5ec Couchbase\uc5d0\uc11c \uba87 \uac1c\uc758 \ubb38\uc11c\ub97c \uc77d\uc5b4 \uc5f0\uacb0\uc744 \ud14c\uc2a4\ud2b8\ud569\ub2c8\ub2e4.\r\ndf = spark.read.format(\"couchbase.query\") \\.\r\n    .option(\"\ubc84\ud0b7\", \"\ubc84\ud0b7_\uc774\ub984\") \\.\r\n    .option(\"scope\", \"scope_name\") \\.\r\n    .option(\"collection\", \"collection_name\") \\.\r\n    .load()\r\n\r\ndf.printSchema()\r\ndf.show(5)<\/pre>\n<p>\uc704\uc758 \ucf54\ub4dc\uc5d0\uc11c\ub294 Couchbase \ucee4\ub125\ud130 JAR\uc744 \ud3ec\ud568\ud558\ub3c4\ub85d Spark \uc138\uc158\uc744 \uad6c\uc131\ud558\uace0 Couchbase \ud074\ub7ec\uc2a4\ud130\ub97c \uac00\ub9ac\ud0a4\ub3c4\ub85d \ud569\ub2c8\ub2e4. \uadf8\ub7f0 \ub2e4\uc74c \ub370\uc774\ud130 \ud504\ub808\uc784 <code>df<\/code> \uc5d0\uc11c \uc77d\uc5b4\uc11c <code>\ubc84\ud0b7_\uc774\ub984<\/code> \ubc84\ud0b7 (\ud2b9\ud788<code>\ub300\ucc98_\uc774\ub984.\uceec\ub809\uc158_\uc774\ub984<\/code> \uceec\ub809\uc158)\uc744 \ud1b5\ud574 \ucffc\ub9ac \uc11c\ube44\uc2a4\ub97c \ud1b5\ud574 \ud655\uc778\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4.<\/p>\n<p>\uc774 \ubb38\uc11c\uc758 \ub098\uba38\uc9c0 \ubd80\ubd84\uc5d0\uc11c\ub294 \uc0d8\ud50c \ub370\uc774\ud130 \uc9d1\ud569\uc744 \ub85c\ub4dc\ud588\ub2e4\uace0 \uac00\uc815\ud569\ub2c8\ub2e4. <a href=\"https:\/\/docs.couchbase.com\/scala-sdk\/current\/ref\/travel-app-data-model.html\" target=\"_blank\" rel=\"noopener\">\uc5ec\ud589 \uc0d8\ud50c <\/a>\uce74\uc6b0\uce58\ubca0\uc774\uc2a4\uc5d0 \ub300\ud574 \uc218\ud589\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4. <a href=\"https:\/\/docs.couchbase.com\/cloud\/get-started\/run-first-queries.html\" target=\"_blank\" rel=\"noopener\">\uce74\ud3a0\ub77c \uc6b4\uc601<\/a> \ub610\ub294 <a href=\"https:\/\/docs.couchbase.com\/columnar\/intro\/examples.html#travel-sample\" target=\"_blank\" rel=\"noopener\">\uae30\ub465\ud615<\/a> \uc544\uc8fc \uc27d\uac8c.<\/p>\n<h2>PySpark\ub97c \uc0ac\uc6a9\ud558\uc5ec \uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \uc77d\uae30\/\uc4f0\uae30<\/h2>\n<p>Spark \uc138\uc158\uc774 Couchbase\uc5d0 \uc5f0\uacb0\ub418\uba74 \ub2e4\uc74c \ub450 \uac00\uc9c0 \uc791\uc5c5\uc744 \uc218\ud589\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4. <b>\ud0a4-\uac12 \uc5f0\uc0b0<\/b> (\uc4f0\uae30\uc6a9) \ubc0f <b>\ucffc\ub9ac \uc791\uc5c5<\/b> (\uc77d\uae30 \ubc0f \uc4f0\uae30 \ubaa8\ub450\uc5d0 SQL++ \uc0ac\uc6a9) \ub370\uc774\ud130 \ud504\ub808\uc784\uc744 \ud1b5\ud574.<\/p>\n<p>\ub2e4\uc74c \ud45c\ub294 Sparks \ucee4\ub125\ud130\uac00 Couchbase \ubc0f \uceec\ub7fc\ud615 \ub370\uc774\ud130\ubca0\uc774\uc2a4 \uc77d\uae30 \ubc0f \uc4f0\uae30\ub97c \uc704\ud574 \uc9c0\uc6d0\ud558\ub294 \ud615\uc2dd\uc744 \ubcf4\uc5ec\uc90d\ub2c8\ub2e4:<\/p>\n<table style=\"border: 1px solid Gainsboro;\">\n<tbody>\n<tr style=\"border: 1px solid Gainsboro;\">\n<td style=\"border: 1px solid Gainsboro; width: 15%;\"><\/td>\n<td style=\"border: 1px solid Gainsboro;\"><b>\uce74\uc6b0\uce58\ubca0\uc774\uc2a4\/\uce74\ud3a0\ub77c \uc6b4\uc601 \ub370\uc774\ud130\ubca0\uc774\uc2a4<\/b><\/td>\n<td style=\"border: 1px solid Gainsboro;\"><b>\uce74\ud3a0\ub77c \uceec\ub7fc \ub370\uc774\ud130\ubca0\uc774\uc2a4<\/b><\/td>\n<\/tr>\n<tr style=\"border: 1px solid Gainsboro;\">\n<td style=\"border: 1px solid Gainsboro; background-color: white;\">\uc77d\uae30 \uc791\uc5c5<\/td>\n<td style=\"border: 1px solid Gainsboro; background-color: white;\"><code>read.format(\"couchbase.query\")<\/code><\/td>\n<td style=\"border: 1px solid Gainsboro; background-color: white;\"><code>read.format(\"couchbase.columnar\")<\/code><\/td>\n<\/tr>\n<tr style=\"border: 1px solid Gainsboro;\">\n<td style=\"border: 1px solid Gainsboro;\">\uc4f0\uae30 \uc791\uc5c5<\/td>\n<td style=\"border: 1px solid Gainsboro;\">(\ub370\uc774\ud130 \uc11c\ube44\uc2a4 \uc0ac\uc6a9 \uad8c\uc7a5)<\/p>\n<p><code>write.format(\"couchbase.kv\")<\/code><\/p>\n<p><code>write.format(\"couchbase.query\")<\/code><\/td>\n<td><code>write.format(\"couchbase.columnar\")<\/code><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h3>\ucffc\ub9ac \ub370\uc774\ud130 \ud504\ub808\uc784\uc744 \uc0ac\uc6a9\ud558\uc5ec \uce74\uc6b0\uce58\ubca0\uc774\uc2a4\uc5d0\uc11c \uc77d\uae30<\/h3>\n<p>Couchbase Spark \ucee4\ub125\ud130\ub97c \uc0ac\uc6a9\ud558\uba74 SQL++ \ucffc\ub9ac\ub97c \ud1b5\ud574 Couchbase \ubc84\ud0b7\uc758 \ub370\uc774\ud130\ub97c Spark \ub370\uc774\ud130\ud504\ub808\uc784\uc73c\ub85c \ub85c\ub4dc\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4. \ud615\uc2dd\uc774 \uc788\ub294 \ub370\uc774\ud130 \ud504\ub808\uc784 \ub9ac\ub354 \uc0ac\uc6a9 <code>couchbase.query<\/code>\ub97c \uc0ac\uc6a9\ud558\uc5ec \ubc84\ud0b7(\ubc0f \ubc94\uc704\/\uceec\ub809\uc158)\uacfc \uc120\ud0dd\uc801 \ucffc\ub9ac \ub9e4\uac1c\ubcc0\uc218\ub97c \uc9c0\uc815\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4. \uc608\ub97c \ub4e4\uc5b4 \uceec\ub809\uc158 \ub610\ub294 \ud544\ud130\ub85c \uc815\uc758\ub41c \ud558\uc704 \uc9d1\ud569\uc5d0\uc11c \ubaa8\ub4e0 \ubb38\uc11c\ub97c \uc77d\uc73c\ub824\uace0 \ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4:<\/p>\n<pre class=\"nums:false lang:default decode:true\"># \ucffc\ub9ac \uc11c\ube44\uc2a4\ub97c \uc0ac\uc6a9\ud558\uc5ec Couchbase \uceec\ub809\uc158\uc758 \ubaa8\ub4e0 \ubb38\uc11c \uc77d\uae30\r\nairlines_df = spark.read.format(\"couchbase.query\") \\.\r\n    .option(\"bucket\", \"travel-sample\") \\.\r\n    .option(\"scope\", \"inventory\") \\.\r\n    .option(\"collection\", \"airline\") \\.\r\n    .load()\r\n\r\n# \uc608\uc81c: Spark\ub97c \uc0ac\uc6a9\ud558\uc5ec \ub370\uc774\ud130 \ud504\ub808\uc784 \ud544\ud130\ub9c1(\uac00\ub2a5\ud55c \uacbd\uc6b0 Couchbase\ub85c \ud478\uc2dc \ub2e4\uc6b4)\r\nusa_airlines_df = airlines_df.filter(\"country = 'United States'\")\r\nusa_airlines_df.show(5)<\/pre>\n<p>\uc774 \uc608\uc81c\uc5d0\uc11c\ub294 <code>airlines_df<\/code> \uc5d0\uc11c \ubaa8\ub4e0 \ubb38\uc11c\ub97c \ub85c\ub4dc\ud569\ub2c8\ub2e4. <code>travel-sample.inventory.airline<\/code> \uceec\ub809\uc158\uc744 Spark \ub370\uc774\ud130 \ud504\ub808\uc784\uc5d0 \ub123\uc2b5\ub2c8\ub2e4. \uadf8\ub7f0 \ub2e4\uc74c \ud544\ud130\ub97c \uc801\uc6a9\ud558\uc5ec \ubbf8\uad6d\uc5d0 \ubcf8\uc0ac\ub97c \ub454 \ud56d\uacf5\uc0ac\ub97c \ucc3e\uc2b5\ub2c8\ub2e4. \ucee4\ub125\ud130\ub294 \ub2e4\uc74c\uc744 \uc2dc\ub3c4\ud569\ub2c8\ub2e4. <a href=\"https:\/\/docs.couchbase.com\/spark-connector\/current\/spark-sql.html#aggregate-push-down\" target=\"_blank\" rel=\"noopener\">\uc544\ub798\ub85c \ubc00\uae30<\/a> \ud544\ud130\ub97c \ucd94\uac00\ud558\uc5ec \ubd88\ud544\uc694\ud55c \ub370\uc774\ud130\uac00 \uc804\uc1a1\ub418\uc9c0 \uc54a\ub3c4\ub85d \ud569\ub2c8\ub2e4(\uc608 <code>WHERE \uad6d\uac00 = '\ubbf8\uad6d'<\/code> \uc808\uc744 \uc2e4\ud589\ud569\ub2c8\ub2e4(\uac00\ub2a5\ud558\uba74 \uc2e4\ud589\ud558\ub294 SQL++ \ucffc\ub9ac\uc5d0\uc11c). \uacb0\uacfc\uc785\ub2c8\ub2e4, <code>\ubbf8\uad6d_\ud56d\uacf5_df<\/code>\ub294 Spark\uc758 \ub2e4\ub978 \ub370\uc774\ud130 \ud504\ub808\uc784\ucc98\ub7fc \uc0ac\uc6a9\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4(\uc608: \ub2e4\ub978 \ub370\uc774\ud130 \ud504\ub808\uc784\uacfc \uc870\uc778\ud558\uac70\ub098 \uc9d1\uacc4\ub97c \uc801\uc6a9\ud558\ub294 \ub4f1).<\/p>\n<p>\ub0b4\ubd80\uc801\uc73c\ub85c\ub294 <a href=\"https:\/\/docs.couchbase.com\/spark-connector\/current\/spark-sql.html#_dataframe_partitioning\" target=\"_blank\" rel=\"noopener\">\ucee4\ub125\ud130 \ud30c\ud2f0\uc158<\/a> \ub97c \uc124\uc815\ud558\uba74 \ucffc\ub9ac \uacb0\uacfc\uac00 \uc5ec\ub7ec \uc791\uc5c5\uc73c\ub85c \ub098\ub269\ub2c8\ub2e4(\uc790\uc138\ud55c \ub0b4\uc6a9\uc740 <i>\uc131\ub2a5 \ud29c\ub2dd<\/i> \uc544\ub798 \ucc38\uc870), \uadf8\ub9ac\uace0 Couchbase\uc758 \ucffc\ub9ac \uc11c\ube44\uc2a4(SQL++ \uc5d4\uc9c4 \uae30\ubc18)\ub97c \uc0ac\uc6a9\ud574 \ub370\uc774\ud130\ub97c \uac80\uc0c9\ud569\ub2c8\ub2e4. \uac01 Spark \ud30c\ud2f0\uc158\uc740 \ub3d9\ub4f1\ud55c SQL++ \ucffc\ub9ac\ub85c \uac80\uc0c9\ub41c \ub370\uc774\ud130\uc758 \ud558\uc704 \uc9d1\ud569\uc5d0 \ud574\ub2f9\ud569\ub2c8\ub2e4. \uc774\ub807\uac8c \ud558\uba74 Spark\uc640 Couchbase\uc758 \ubd84\uc0b0 \ud2b9\uc131\uc744 \ud65c\uc6a9\ud558\uc5ec Couchbase\uc5d0\uc11c \ubcd1\ub82c \uc77d\uae30\uac00 \uac00\ub2a5\ud569\ub2c8\ub2e4.<\/p>\n<h3>\ud0a4-\uac12(KV) \uc5f0\uc0b0\uc744 \uc0ac\uc6a9\ud558\uc5ec Couchbase\uc5d0 \uc4f0\uae30(\uad8c\uc7a5)<\/h3>\n<p>\ucee4\ub125\ud130\ub294 \ub610\ud55c \ub2e4\uc74c\uc744 \ud1b5\ud574 Couchbase\uc5d0 \ub370\uc774\ud130 \uc4f0\uae30\ub97c \uc9c0\uc6d0\ud569\ub2c8\ub2e4. <b>\ub370\uc774\ud130 \uc11c\ube44\uc2a4(KV)<\/b> \ub610\ub294 \ucffc\ub9ac \uc11c\ube44\uc2a4\ub97c \ud1b5\ud574(SQL++ \uc2e4\ud589) <code>\uc0bd\uc785\/\uc5c5\uc11c\ud2b8<\/code> \uba85\ub839\uc5b4\ub97c \uc0ac\uc6a9\ud558\uc138\uc694). \uadf8\ub9ac\uace0 <b>\ucd94\ucc9c<\/b> \ub300\ubd80\ubd84\uc758 \uc0ac\uc6a9 \uc0ac\ub840\uc5d0\uc11c \uc0ac\uc6a9\ud558\ub294 \ubc29\ubc95\uc740 <b>\ud0a4-\uac12 \ub370\uc774\ud130 \uc18c\uc2a4<\/b> (<code>format(\"couchbase.kv\")<\/code>) <a href=\"https:\/\/docs.couchbase.com\/spark-connector\/current\/spark-sql.html#_dataframe_persistence\" target=\"_blank\" rel=\"noopener\">\ub354 \ub098\uc740 \uc131\ub2a5<\/a>. \ud0a4-\uac12 \ubaa8\ub4dc\uc5d0\uc11c \uac01 Spark \uc791\uc5c5\uc740 Couchbase \ub370\uc774\ud130 \ub178\ub4dc\uc5d0 \uc9c1\uc811 \ubb38\uc11c\ub97c \uc791\uc131\ud569\ub2c8\ub2e4.<\/p>\n<p>Couchbase\uc5d0 \ub370\uc774\ud130 \ud504\ub808\uc784\uc744 \uc791\uc131\ud560 \ub54c\ub294 \uac01 \ubb38\uc11c\uc5d0 \uace0\uc720\ud55c ID\uac00 \uc788\ub294\uc9c0 \ud655\uc778\ud574\uc57c \ud569\ub2c8\ub2e4(Couchbase\uc5d0\ub294 \ubb38\uc11c ID\uac00 \ud544\uc694\ud558\ubbc0\ub85c). \uae30\ubcf8\uc801\uc73c\ub85c \ucee4\ub125\ud130\ub294 \ub2e4\uc74c\uacfc \uac19\uc740 \uc774\ub984\uc758 \uc5f4\uc744 \ucc3e\uc2b5\ub2c8\ub2e4. <code>__META_ID<\/code> (\ub610\ub294 <code>META_ID<\/code> )\ub97c \ubb38\uc11c ID\uc5d0 \ub300\ud55c \ub370\uc774\ud130\ud504\ub808\uc784\uc5d0 \ucd94\uac00\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4. \uc0ac\uc6a9\uc790 \uc815\uc758 ID \ud544\ub4dc\ub97c \uc9c0\uc815\ud560 \uc218\ub3c4 \uc788\uc2b5\ub2c8\ub2e4. <code>IdFieldName<\/code> \uc635\uc158\uc744 \uc120\ud0dd\ud569\ub2c8\ub2e4.<\/p>\n<p>\uc608\ub97c \ub4e4\uc5b4, Spark \ub370\uc774\ud130\ud504\ub808\uc784\uc774 \uc788\ub2e4\uace0 \uac00\uc815\ud574 \ubcf4\uaca0\uc2b5\ub2c8\ub2e4. <code>new_airlines_df<\/code> \ub97c \uc791\uc131\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4. \uc5ec\uae30\uc5d0\ub294 \uc5f4\uc774 \uc788\uc2b5\ub2c8\ub2e4. <code>\ud56d\uacf5\uc0ac_ID<\/code> \uc5f4\uc740 Couchbase \ubb38\uc11c \ud0a4 \uc5ed\ud560\uc744 \ud558\uace0 \ub098\uba38\uc9c0 \uc5f4\uc740 \ubb38\uc11c \ucf58\ud150\uce20\uc785\ub2c8\ub2e4:<\/p>\n<pre class=\"nums:false lang:default decode:true\"># new_airlines_df\uac00 Couchbase\uc5d0 \uc791\uc131\ud558\ub824\ub294 \ub370\uc774\ud130\ud504\ub808\uc784\uc774\ub77c\uace0 \uac00\uc815\ud569\ub2c8\ub2e4.\r\n# \ubb38\uc11c ID\ub85c \uc0ac\uc6a9\ud560 \"airline_id\" \uc5f4\uc774 \ud3ec\ud568\ub418\uc5b4 \uc788\uc2b5\ub2c8\ub2e4.\r\nnew_airlines_df.write.format(\"couchbase.kv\") \\.\r\n    .option(\"bucket\", \"mybucket\") \\.\r\n    .option(\"scope\", \"myscope\") \\.\r\n    .option(\"collection\", \"airlines\") \\.\r\n    .option(\"idFieldName\", \"airline_id\") \\.\r\n    .save()<\/pre>\n<h3>\ucffc\ub9ac(SQL++) \uc791\uc5c5\uc744 \uc0ac\uc6a9\ud558\uc5ec Couchbase\uc5d0 \uc4f0\uae30<\/h3>\n<p>\uc77c\ubc18\uc801\uc73c\ub85c \ucffc\ub9ac \uc11c\ube44\uc2a4\ubcf4\ub2e4 \ube60\ub974\ubbc0\ub85c \uc704\uc640 \uac19\uc774 \ub370\uc774\ud130 \uc11c\ube44\uc2a4(KV)\ub97c \uc0ac\uc6a9\ud558\ub294 \uac83\uc774 \uc88b\uc9c0\ub9cc, \uc6d0\ud558\ub294 \uacbd\uc6b0 \ub2e4\uc74c\uc744 \uc0ac\uc6a9\ud558\uc5ec \ucffc\ub9ac \uc11c\ube44\uc2a4\ub97c \ud1b5\ud574 \uc791\uc131\ud560 \uc218\ub3c4 \uc788\uc2b5\ub2c8\ub2e4. <code>format(\"couchbase.query\")<\/code> \ub97c \ucd94\uac00\ud569\ub2c8\ub2e4. \uc774\ub807\uac8c \ud558\uba74 \ub0b4\ubd80\uc801\uc73c\ub85c \uac01 \ud589\uc5d0 \ub300\ud574 SQL++ UPSERT \ubb38\uc774 \uc2e4\ud589\ub429\ub2c8\ub2e4. \uc774 \ubc29\ubc95\uc740 SQL++ \uae30\ub2a5(\uc608: \uc11c\ubc84 \uce21 \ubcc0\ud658)\uc744 \ud65c\uc6a9\ud574\uc57c \ud558\ub294 \uacbd\uc6b0 \uc720\uc6a9\ud560 \uc218 \uc788\uc9c0\ub9cc, \uac04\ub2e8\ud55c \uc0bd\uc785\/\uc5c5\ub370\uc774\ud2b8\uc758 \uacbd\uc6b0 KV \uc811\uadfc \ubc29\uc2dd\uc774 \ub354 \ud6a8\uc728\uc801\uc785\ub2c8\ub2e4.<\/p>\n<pre class=\"nums:false lang:default decode:true\">df.write.format(\"couchbase.query\") \\.\r\n    .option(\"bucket\", \"mybucket\") \\.\r\n    .option(\"scope\", \"myscope\") \\.\r\n    .option(\"collection\", \"airlines\") \\.\r\n    .mode(\"\ub36e\uc5b4\uc4f0\uae30\") \\.\r\n    .save()<\/pre>\n<p>\ub2e4\uc74c \uc139\uc158\uc5d0\uc11c\ub294 \uc774\ub7ec\ud55c \uae30\ubcf8 \uc77d\uae30\/\uc4f0\uae30 \uc0ac\ub840\ub97c Couchbase\uc758 \ucd5c\uc2e0 \ubd84\uc11d \uc81c\ud488\uc778 Capella Columnar\uc5d0 \ub9de\uac8c \uc218\uc815\ud574 \ubcf4\uaca0\uc2b5\ub2c8\ub2e4.<\/p>\n<h2>\uce74\ud3a0\ub77c \uceec\ub7fc\uc5d0 \ub300\ud55c PySpark \uc9c0\uc6d0<\/h2>\n<p>Couchbase Spark Connector GA\uc758 \uc8fc\uc694 \uc0c8 \uae30\ub2a5 \uc911 \ud558\ub098\ub294 Capella Columnar \uc9c0\uc6d0\uc785\ub2c8\ub2e4. Capella Columnar\ub294 \uace0\uc131\ub2a5 \ubd84\uc11d\uc744 \uc704\ud574 \ub370\uc774\ud130\ub97c \uc5f4 \uc911\uc2ec \ud615\uc2dd\uc73c\ub85c \uc800\uc7a5\ud558\ub294 Couchbase Capella\uc758 JSON \ub124\uc774\ud2f0\ube0c \ubd84\uc11d \ub370\uc774\ud130\ubca0\uc774\uc2a4 \uc11c\ube44\uc2a4\uc785\ub2c8\ub2e4.<\/p>\n<h3>PySpark\ub85c \uc5f4 \ud615\uc2dd\uc758 \ub370\uc774\ud130 \uc77d\uae30<\/h3>\n<p>PySpark\uc5d0\uc11c Couchbase Capella Columnar \ud074\ub7ec\uc2a4\ud130\uc758 \ub370\uc774\ud130\ub97c \uc77d\ub294 \uac83\uc740 \uc138 \uac00\uc9c0 \ubcc0\uacbd \uc0ac\ud56d\uc744 \uc81c\uc678\ud558\uba74 Couchbase \uc6b4\uc601 \ud074\ub7ec\uc2a4\ud130\uc640 \uc720\uc0ac\ud569\ub2c8\ub2e4:<\/p>\n<ol>\n<li style=\"font-weight: 400;\" aria-level=\"1\">\uc0ac\uc6a9 <code>format(\"couchbase.columnar\")<\/code> \ub97c \uc0ac\uc6a9\ud558\uc5ec \uc5f0\uacb0\uc744 \uc9c0\uc815\ud558\ub294 \uac83\uc740 \uceec\ub7fc\ud615 \uc11c\ube44\uc2a4\uc6a9\uc785\ub2c8\ub2e4.<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\">\uceec\ub7fc\ud615\uc5d0 \ub300\ud55c \uc5f0\uacb0 \ubb38\uc790\uc5f4\uc740 Capella UI\uc5d0\uc11c \uac80\uc0c9\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4.<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\">\ub610\ud55c \uc635\uc158\uc73c\ub85c \ub370\uc774\ud130\ubca0\uc774\uc2a4, \ubc94\uc704 \ubc0f \uceec\ub809\uc158 \uc774\ub984(Couchbase\uc758 \ubc84\ud0b7\/\ubc94\uc704\/\uceec\ub809\uc158\uacfc \uc720\uc0ac)\uc744 \uc81c\uacf5\ud558\uc5ec \ub85c\ub4dc\ud560 \ub370\uc774\ud130 \uc9d1\ud569\uc744 \uc9c0\uc815\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4.<\/li>\n<\/ol>\n<p>Spark\uac00 \uad6c\uc131\ub418\uba74 Spark \ub370\uc774\ud130\ud504\ub808\uc784 \ub9ac\ub354 API\ub97c \uc0ac\uc6a9\ud558\uc5ec \uc5f4 \ud615\uc2dd \uc11c\ube44\uc2a4\uc5d0\uc11c \ub370\uc774\ud130\ub97c \ub85c\ub4dc\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4:<\/p>\n<pre class=\"nums:false lang:default decode:true\">pyspark.sql\uc5d0\uc11c SparkSession\uc744 \uac00\uc838\uc635\ub2c8\ub2e4.\r\n# Couchbase \uad6c\uc131\uc73c\ub85c SparkSession \ucd08\uae30\ud654(\ucee4\ub125\ud130 jar\ub97c \uc0ac\uc6a9\ud560 \uc218 \uc788\ub2e4\uace0 \uac00\uc815)\r\nspark = SparkSession.builder \\.\r\n    .appName(\"\uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \uc2a4\ud30c\ud06c \ucee4\ub125\ud130 \uceec\ub7fc\ud615 \uc608\uc81c\") \\.\r\n    .config(\"spark.couchbase.connectionString\", \"couchbases:\/\/your.columnar.connection.string\") \\.\r\n    .config(\"spark.couchbase.username\", \"YourColumnarUsername\") \\.\r\n    .config(\"spark.couchbase.password\", \"YourColumnarPassword\") \\.\r\n    .getOrCreate()\r\n\r\n# \uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \uce74\ud3a0\ub77c \uceec\ub7fc\ud615(travel-sample.inventory.airline \uceec\ub809\uc158)\uc5d0\uc11c \ub370\uc774\ud130 \ud504\ub808\uc784 \uc77d\uae30\r\nairlines_df = spark.read.format(\"couchbase.columnar\") \\.\r\n    .option(\"database\", \"travel-sample\") \\.\r\n    .option(\"\ubc94\uc704\", \"\uc778\ubca4\ud1a0\ub9ac\") \\.\r\n    .option(\"collection\", \"airline\") \\.\r\n    .load()<\/pre>\n<p>\uc774 \uc608\uc81c\uc5d0\uc11c\ub294 \uacb0\uacfc <code>airlines_df<\/code> \uc740 \uc77c\ubc18 Spark \ub370\uc774\ud130\ud504\ub808\uc784\uc73c\ub85c, \uc774\ub97c \uac80\uc0ac\ud558\uace0, \ubcc0\ud658\uc744 \uc2e4\ud589\ud558\uace0, \ub2e4\uc74c\uacfc \uac19\uc740 \uc791\uc5c5\uc744 \uc218\ud589\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4. <code>.count()<\/code> \ub610\ub294 <code>.show()<\/code> \ub97c \uc0ac\uc6a9\ud569\ub2c8\ub2e4. \uc608\ub97c \ub4e4\uc5b4 <code>airlines_df.show(5)<\/code> \ub294 \uba87 \uac00\uc9c0 \ud56d\uacf5\uc0ac \ubb38\uc11c\ub97c \uc778\uc1c4\ud569\ub2c8\ub2e4. <code>airlines_df.count()<\/code> \ub294 \uceec\ub809\uc158\uc758 \ubb38\uc11c \uc218\ub97c \ubc18\ud658\ud569\ub2c8\ub2e4. \ub0b4\ubd80\uc801\uc73c\ub85c \ucee4\ub125\ud130\ub294 \ucd5c\ub300 \ud2b9\uc815 \uc218\uc758 \ub808\ucf54\ub4dc(\uae30\ubcf8\uac12\uc740 1000\uac1c)\uae4c\uc9c0 \uc0d8\ud50c\ub9c1\ud558\uc5ec JSON \ubb38\uc11c\uc758 \uc2a4\ud0a4\ub9c8\ub97c \uc790\ub3d9\uc73c\ub85c \ucd94\ub860\ud569\ub2c8\ub2e4. \uc0d8\ud50c\ub9c1\ub41c \ubb38\uc11c\uc5d0 \uc77c\uad00\ub418\uac8c \ub098\ud0c0\ub098\ub294 \ubaa8\ub4e0 \ud544\ub4dc\ub294 \uc801\uc808\ud55c Spark \ub370\uc774\ud130 \uc720\ud615\uacfc \ud568\uaed8 \ub370\uc774\ud130 \ud504\ub808\uc784\uc758 \uc5f4\uc774 \ub429\ub2c8\ub2e4.<\/p>\n<p>\ubb38\uc11c\uc5d0 \ub2e4\uc591\ud55c \uc2a4\ud0a4\ub9c8\uac00 \uc788\ub294 \uacbd\uc6b0 \ucd94\ub860 \uc2dc \ubaa8\ub4e0 \ud544\ub4dc\uc758 \ud569\uc9d1\uc744 \ud3ec\ud568\ud558\ub294 \uc2a4\ud0a4\ub9c8\uac00 \uc0dd\uc131\ub420 \uc218 \uc788\uc2b5\ub2c8\ub2e4(\uc77c\ubd80 \ubb38\uc11c\uc5d0 \uc5c6\ub294 \ud544\ub4dc\ub294 \ud574\ub2f9 \ud589\uc5d0\uc11c null\uc774 \ub428). \uc2a4\ud0a4\ub9c8\uac00 \uc9c4\ud654 \uc911\uc774\uac70\ub098 \uace0\ub824\ub418\ub294 \ub808\ucf54\ub4dc\ub97c \uc81c\ud55c\ud558\ub824\ub294 \uacbd\uc6b0 \ub2e4\uc74c\uc5d0 \uc124\uba85\ub41c \ub300\ub85c \ub9ac\ub354\uc5d0 \uba85\uc2dc\uc801 \ud544\ud130(\uc220\uc5b4)\ub97c \uc81c\uacf5\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4.<\/p>\n<h3>Spark\ub97c \ud1b5\ud574 \uce74\uc6b0\uce58\ubca0\uc774\uc2a4\uc5d0\uc11c \uc5f4 \ud615\uc2dd \ub370\uc774\ud130 \uc9d1\ud569 \ucffc\ub9ac\ud558\uae30<\/h3>\n<p>\ud2b9\ud788 \uceec\ub809\uc158\uc774 \ud070 \uacbd\uc6b0 \uc804\uccb4 \uceec\ub809\uc158\uc744 \ub85c\ub4dc\ud558\uace0 \uc2f6\uc9c0 \uc54a\uc740 \uacbd\uc6b0\uac00 \ub9ce\uc2b5\ub2c8\ub2e4. \ub370\uc774\ud130\ub97c \ub85c\ub4dc\ud560 \ub54c \ud544\ud130 \uc220\uc5b4\ub97c Capella Columnar \uc11c\ube44\uc2a4\ub85c \uc9c1\uc811 \ud478\uc2dc\ub2e4\uc6b4\ud558\uc5ec \ubd88\ud544\uc694\ud55c \ub370\uc774\ud130 \uc804\uc1a1\uc744 \ud53c\ud568\uc73c\ub85c\uc368 \uc131\ub2a5\uc744 \ucd5c\uc801\ud654\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4. \uc0ac\uc6a9 <code>.option(\"filter\", \"\")<\/code> \ub97c \uc0ac\uc6a9\ud558\uc5ec \uc77d\uae30 \uc791\uc5c5 \uc911\uc5d0 SQL++ WHERE \uc808\uc744 \uc801\uc6a9\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4. \uc608\ub97c \ub4e4\uc5b4 \ubbf8\uad6d\uc5d0 \ubcf8\uc0ac\ub97c \ub454 \ud56d\uacf5\uc0ac\ub9cc \ub85c\ub4dc\ud558\ub294 \uacbd\uc6b0\uc785\ub2c8\ub2e4:<\/p>\n<pre class=\"nums:false lang:default decode:true\">usa_airlines_df = spark.read.format(\"couchbase.columnar\") \\.\r\n    .option(\"database\", \"travel-sample\") \\.\r\n    .option(\"scope\", \"inventory\") \\.\r\n    .option(\"collection\", \"airline\") \\.\r\n    .option(\"filter\", \"country = 'United States'\") \\.\r\n    .load()\r\n\r\nprint(usa_airlines_df.count()) # \uad6d\uac00\uac00 '\ubbf8\uad6d'\uc778 \ud56d\uacf5\uc0ac\ub9cc \ud574\ub2f9\ub428<\/pre>\n<p>\ucee4\ub125\ud130\ub294 \uc18c\uc2a4\uc5d0\uc11c \uc9c1\uc811 \uc774 \ud544\ud130\ub97c \uc2e4\ud589\ud558\uc5ec \uad00\ub828 \ubb38\uc11c\ub9cc \uac80\uc0c9\ud569\ub2c8\ub2e4. \ub610\ud55c \uacbd\uc6b0\uc5d0 \ub530\ub77c \ud22c\uc601(\ud2b9\uc815 \ud544\ub4dc \uc120\ud0dd) \ubc0f \uc9d1\uacc4\ub97c \ud478\uc2dc\ub2e4\uc6b4\ud560 \uc218\ub3c4 \uc788\uc2b5\ub2c8\ub2e4. <a href=\"https:\/\/docs.couchbase.com\/spark-connector\/current\/spark-sql.html#aggregate-push-down\" target=\"_blank\" rel=\"noopener\">\ub2e8\uc21c \uc9d1\uacc4<\/a> \uac19\uc740 <code>COUNT<\/code>, <code>MIN<\/code>, <code>MAX<\/code>\ubc0f <code>SUM<\/code> \ub97c \uc2a4\ud30c\ud06c\uc5d0\uc11c \uacc4\uc0b0\ud558\uc9c0 \uc54a\uace0 \uac00\ub2a5\ud55c \ud55c \uceec\ub7fc \uc5d4\uc9c4\uc73c\ub85c \uc804\uc1a1\ud558\uc5ec \uc131\ub2a5\uc744 \ud5a5\uc0c1\uc2dc\ud0b5\ub2c8\ub2e4.<\/p>\n<p>\ub370\uc774\ud130\uac00 \ub370\uc774\ud130\ud504\ub808\uc784\uc5d0 \ub85c\ub4dc\ub418\uba74 \ud45c\uc900 \uc791\uc5c5\uc744 \uc218\ud589\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4. <a href=\"https:\/\/docs.couchbase.com\/spark-connector\/current\/columnar.html#spark-sql\" target=\"_blank\" rel=\"noopener\">\uc2a4\ud30c\ud06c \ubcc0\ud658<\/a>, \uc870\uc778 \ubc0f \uc9d1\uacc4\ub97c \uc0ac\uc6a9\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4. \uc608\ub97c \ub4e4\uc5b4, Spark SQL\uc744 \uc0ac\uc6a9\ud558\uc5ec \uad6d\uac00\ubcc4 \ud56d\uacf5\uc0ac \uc218\ub97c \uacc4\uc0b0\ud558\ub824\uba74 \ub2e4\uc74c\uacfc \uac19\uc774 \uc784\uc2dc \ubdf0\ub97c \ub9cc\ub4e4\uc5b4 \ub370\uc774\ud130\uc5d0 \ub300\ud574 Spark SQL \ucffc\ub9ac\ub97c \uc2e4\ud589\ud560 \uc218\ub3c4 \uc788\uc2b5\ub2c8\ub2e4:<\/p>\n<pre class=\"nums:false lang:default decode:true\">airlines_df.createOrReplaceTempView(\"airlines_view\")\r\nresult_df = spark.sql(\"\"\"\r\n    SELECT \uad6d\uac00, COUNT(*) AS airline_count\r\n    FROM airlines_view\r\n    GROUP BY \uad6d\uac00\r\n    ORDER BY airline_count DESC\r\n\"\"\")\r\nresult_df.show(10)<\/pre>\n<p>\uc774 \ucffc\ub9ac\ub294 \uc804\uc801\uc73c\ub85c Spark \uc5d4\uc9c4 \ub0b4\uc5d0\uc11c \uc2e4\ud589\ub418\ubbc0\ub85c \ubcf5\uc7a1\ud55c \ubd84\uc11d \uc6cc\ud06c\ud50c\ub85c\uc6b0\uc5d0 Couchbase \ub370\uc774\ud130\ub97c \uc6d0\ud65c\ud558\uac8c \ud1b5\ud569\ud560 \uc218 \uc788\ub294 \uc720\uc5f0\uc131\uc744 \uc81c\uacf5\ud569\ub2c8\ub2e4.<\/p>\n<p>\uae30\ubcf8\uc801\uc778 \uc77d\uae30 \ubc0f \uc4f0\uae30\uc5d0 \ub300\ud574 \uc0b4\ud3b4\ubcf4\uc558\uc73c\ub2c8, \uc774\uc81c Couchbase\uc640 Spark \uac04\uc5d0 \ub300\ub7c9\uc758 \ub370\uc774\ud130\ub97c \uc774\ub3d9\ud560 \ub54c \uc131\ub2a5\uc744 \uc870\uc815\ud558\ub294 \ubc29\ubc95\uc5d0 \ub300\ud574 \uc54c\uc544\ubcf4\uaca0\uc2b5\ub2c8\ub2e4.<\/p>\n<h2>\uc131\ub2a5 \ud29c\ub2dd \ud301<\/h2>\n<p>\uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \ud30c\uc774\uc2a4\ud30c\ud06c \ucee4\ub125\ud130\ub97c \uc0ac\uc6a9\ud560 \ub54c \ucc98\ub9ac\ub7c9\uacfc \ud6a8\uc728\uc131\uc744 \uadf9\ub300\ud654\ud558\ub824\uba74 \ub2e4\uc74c \ubaa8\ubc94 \uc0ac\ub840\ub97c \uace0\ub824\ud558\uc138\uc694.<\/p>\n<h3>\uc77d\uae30 \uc791\uc5c5 \uc870\uc815\ud558\uae30<\/h3>\n<p><b>\ubcd1\ub82c \ucc98\ub9ac\ub97c \uc704\ud55c \ucffc\ub9ac \ud30c\ud2f0\uc154\ub2dd \uc0ac\uc6a9<\/b><br \/>\n(<a href=\"https:\/\/www.couchbase.com\/blog\/ko\/products\/capella\/\" target=\"_blank\" rel=\"noopener\">\uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \uce74\ud3a0\ub77c(DBaaS)<\/a>, <a href=\"https:\/\/www.couchbase.com\/blog\/ko\/products\/server\/\" target=\"_blank\" rel=\"noopener\">\uc790\uccb4 \uad00\ub9ac \uc6b4\uc601 \ub370\uc774\ud130\ubca0\uc774\uc2a4<\/a> \ub610\ub294 <a href=\"https:\/\/www.couchbase.com\/blog\/ko\/products\/analytics\/\" target=\"_blank\" rel=\"noopener\">\uce74\ud3a0\ub77c \uce7c\ub7fc<\/a>)<\/p>\n<p>\uc6b4\uc601 \ub370\uc774\ud130\ubca0\uc774\uc2a4 \ub610\ub294 \uceec\ub7fc\ud615 \ub370\uc774\ud130\ubca0\uc774\uc2a4\uc5d0 \ub300\ud55c \ucffc\ub9ac \uc11c\ube44\uc2a4\ub97c \ud1b5\ud574 \uc77d\uc744 \ub54c\ub294 \ucffc\ub9ac \uacb0\uacfc\ub97c \ubd84\ud560\ud558\ub294 \ucee4\ub125\ud130\uc758 \uae30\ub2a5\uc744 \ud65c\uc6a9\ud558\uc138\uc694. \ucffc\ub9ac \uacb0\uacfc\ub97c \ubd84\ud560\ud558\uae30 \uc704\ud574 <a href=\"https:\/\/docs.couchbase.com\/spark-connector\/current\/spark-sql.html#_dataframe_partitioning\" target=\"_blank\" rel=\"noopener\">\ud30c\ud2f0\uc158 \uc218<\/a> (\ubc0f \ud558\ud55c\/\uc0c1\ud55c\uc774 \uc788\ub294 \uc22b\uc790 \ubd84\ud560 \ud544\ub4dc)\ub97c \uc0ac\uc6a9\ud558\uc5ec \ub370\uc774\ud130\ud504\ub808\uc784 \uc77d\uae30\ub97c \uc124\uc815\ud569\ub2c8\ub2e4. \uc88b\uc740 \uacbd\ud5d8 \ubc95\uce59\uc740 <code>\ud30c\ud2f0\uc158 \uc218<\/code> \uc5d0 <b>\ucd5c\uc18c\ud55c \ucffc\ub9ac \uc11c\ube44\uc2a4 CPU \ucf54\uc5b4\uc758 \ucd1d \uc218\uc785\ub2c8\ub2e4.<\/b> \ub97c \uc0ac\uc6a9\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4. \uc774\ub807\uac8c \ud558\uba74 Spark\uac00 \ubaa8\ub4e0 \ucffc\ub9ac \ub178\ub4dc\ub97c \ud65c\uc6a9\ud558\uc5ec \uc5ec\ub7ec \ucffc\ub9ac\ub97c \ubcd1\ub82c\ub85c \uc2e4\ud589\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4. \uc608\ub97c \ub4e4\uc5b4, Couchbase \ud074\ub7ec\uc2a4\ud130\uc758 \ucffc\ub9ac \uc11c\ube44\uc2a4\uc5d0 \ucd1d 8\uac1c\uc758 \ucf54\uc5b4\uac00 \uc788\ub294 \uacbd\uc6b0, \ub2e4\uc74c\uacfc \uac19\uc774 \uc124\uc815\ud569\ub2c8\ub2e4. <code>partitionCount &gt;= 8<\/code> \ub97c \uc124\uc815\ud558\uc5ec \ucd5c\uc18c 8\uac1c\uc758 \ubcd1\ub82c SQL++ \ucffc\ub9ac\uac00 \uc2e4\ud589\ub418\ub3c4\ub85d \ud569\ub2c8\ub2e4. \uc774\ub807\uac8c \ud558\uba74 \ubaa8\ub4e0 \ucffc\ub9ac \ub178\ub4dc\ub97c \ub3d9\uc2dc\uc5d0 \ud65c\uc6a9\ud568\uc73c\ub85c\uc368 \uc77d\uae30 \ucc98\ub9ac\ub7c9\uc744 \ud06c\uac8c \ub192\uc77c \uc218 \uc788\uc2b5\ub2c8\ub2e4. \uc774 \ub9ce\uc740 \ubcd1\ub82c \ucffc\ub9ac\ub97c \uc2e4\ud589\ud558\ub824\uba74 Spark \ud074\ub7ec\uc2a4\ud130\uc5d0 \ucda9\ubd84\ud55c \ucf54\uc5b4\uac00 \uc788\uc5b4\uc57c \ud55c\ub2e4\ub294 \uc810\uc5d0 \uc720\uc758\ud558\uc138\uc694.<\/p>\n<p><b>\ucffc\ub9ac \ud6a8\uc728\uc131\uc744 \uc704\ud55c \ucee4\ubc84\ub9c1 \uc778\ub371\uc2a4 \ud65c\uc6a9<br \/>\n<\/b>(<a href=\"https:\/\/www.couchbase.com\/blog\/ko\/products\/capella\/\" target=\"_blank\" rel=\"noopener\">\uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \uce74\ud3a0\ub77c(DBaaS)<\/a>, <a href=\"https:\/\/www.couchbase.com\/blog\/ko\/products\/server\/\" target=\"_blank\" rel=\"noopener\">\uc790\uccb4 \uad00\ub9ac \uc6b4\uc601 \ub370\uc774\ud130\ubca0\uc774\uc2a4<\/a>)<b><br \/>\n<\/b><\/p>\n<p>SQL++ \ucffc\ub9ac\ub97c \uc0ac\uc6a9\ud558\ub294 \uacbd\uc6b0 \ub2e4\uc74c\uc744 \ud1b5\ud574 \ucffc\ub9ac\ud574 \ubcf4\uc138\uc694. <a href=\"https:\/\/docs.couchbase.com\/server\/current\/n1ql\/n1ql-language-reference\/covering-indexes.html\" target=\"_blank\" rel=\"noopener\">\uc778\ub371\uc2a4 \ucee4\ubc84\ub9c1<\/a> \ub97c \uc0ac\uc6a9\ud558\uc138\uc694. \ucee4\ubc84\ub9c1 \uc778\ub371\uc2a4\ub294 \ub2e4\uc74c\uc744 \ud3ec\ud568\ud558\ub294 \uc778\ub371\uc2a4\uc785\ub2c8\ub2e4. <i>\ubaa8\ub450<\/i> \ud544\ub4dc\uc5d0 \ucffc\ub9ac\uac00 \ud544\uc694\ud558\ubbc0\ub85c \ub370\uc774\ud130 \uc11c\ube44\uc2a4\uc5d0\uc11c \uac00\uc838\uc624\uc9c0 \uc54a\uace0 \uc778\ub371\uc2a4\uc5d0\uc11c \uc804\uc801\uc73c\ub85c \ucffc\ub9ac\ub97c \uc81c\uacf5\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4. \ucee4\ubc84\ub41c \ucffc\ub9ac\ub294 \uc804\uccb4 \ubb38\uc11c\ub97c \uac00\uc838\uc624\uae30 \uc704\ud55c \ucd94\uac00 \ub124\ud2b8\uc6cc\ud06c \ud649\uc744 \ud53c\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4. <b>\ub354 \ub098\uc740 \uc131\ub2a5 \uc81c\uacf5<\/b>. \ud544\ud130\ub9c1\ud558\ub294 \ud544\ub4dc\ub97c \ud3ec\ud568\ud558\ub3c4\ub85d Couchbase \ubcf4\uc870 \uc778\ub371\uc2a4\ub97c \uc124\uacc4\ud569\ub2c8\ub2e4. <i>\uadf8\ub9ac\uace0<\/i> \uac00\ub2a5\ud55c \uacbd\uc6b0 \ubc18\ud658\ud558\ub294 \ud544\ub4dc\ub97c \uc9c0\uc815\ud558\uc138\uc694. \uc774\ub294 \ud544\uc694\ud55c \ub370\uc774\ud130\ub97c \uc815\ud655\ud788 \ud3ec\ud568\ud558\ub294 \ud2b9\uc815 \uc778\ub371\uc2a4\ub97c Spark \uc791\uc5c5\uc5d0 \ub300\ud574 \uc0dd\uc131\ud558\ub294 \uac83\uc744 \uc758\ubbf8\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4.<\/p>\n<p><b>\ubcd1\ubaa9 \ud604\uc0c1\uc744 \ubc29\uc9c0\ud558\uae30 \uc704\ud55c \uc778\ub371\uc2a4 \ubcf5\uc81c\ubcf8 \ubcf4\uc7a5<br \/>\n<\/b>(<a href=\"https:\/\/www.couchbase.com\/blog\/ko\/products\/capella\/\" target=\"_blank\" rel=\"noopener\">\uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \uce74\ud3a0\ub77c(DBaaS)<\/a>, <a href=\"https:\/\/www.couchbase.com\/blog\/ko\/products\/server\/\" target=\"_blank\" rel=\"noopener\">\uc790\uccb4 \uad00\ub9ac \uc6b4\uc601 \ub370\uc774\ud130\ubca0\uc774\uc2a4<\/a>)<\/p>\n<p>\ucee4\ubc84\ub9c1 \uc778\ub371\uc2a4\ub97c \uc0ac\uc6a9\ud558\ub294 \uac83\uacfc \ud568\uaed8 \uc778\ub371\uc2a4\uac00 \uc5ec\ub7ec \uc778\ub371\uc2a4 \ub178\ub4dc\uc5d0 \ubcf5\uc81c\ub418\uc5b4 \uc788\ub294\uc9c0 \ud655\uc778\ud558\uc138\uc694. <a href=\"https:\/\/docs.couchbase.com\/server\/current\/learn\/services-and-indexes\/indexes\/index-replication.html#index-replication\" target=\"_blank\" rel=\"noopener\">\uc778\ub371\uc2a4 \ubcf5\uc81c<\/a> \ub294 \uace0\uac00\uc6a9\uc131\uc744 \uc81c\uacf5\ud560 \ubfd0\ub9cc \uc544\ub2c8\ub77c \ucffc\ub9ac\uac00 <b>\ucc98\ub9ac\ub7c9 \uc99d\uac00\ub97c \uc704\ud574 \uc5ec\ub7ec \ub178\ub4dc\uc758 \uc778\ub371\uc2a4 \ubcf5\uc0ac\ubcf8 \uac04\uc5d0 \ub85c\ub4dc \ubc38\ub7f0\uc2f1\uc744 \uc218\ud589\ud569\ub2c8\ub2e4.<\/b>. \uc2e4\uc81c\ub85c (\uc608\ub97c \ub4e4\uc5b4) \uc778\ub371\uc2a4 \ub178\ub4dc\uac00 3\uac1c\uc778 \uacbd\uc6b0, \uc911\uc694\ud55c \uc778\ub371\uc2a4\ub97c \uc5ec\ub7ec \ub178\ub4dc\uc5d0 \ubcf5\uc81c\ud558\uba74 Spark \ucee4\ub125\ud130\uc758 \ubcd1\ub82c \ucffc\ub9ac\uac00 \ub2e8\uc77c \ub178\ub4dc\ub97c \ubaa8\ub450 \ub450\ub4dc\ub9ac\ub294 \uac83\uc774 \uc544\ub2c8\ub77c \ub2e4\ub978 \uc778\ub371\uc2a4 \ub178\ub4dc\uc5d0 \ub3c4\ub2ec\ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4.<\/p>\n<h3>\uc4f0\uae30 \uc791\uc5c5 \uc870\uc815\ud558\uae30<\/h3>\n<p><b>\ub300\ub7c9 \uc4f0\uae30\uc5d0\ub294 \ub370\uc774\ud130 \uc11c\ube44\uc2a4\ub97c \uc120\ud638\ud569\ub2c8\ub2e4.<\/b><br \/>\n(<a href=\"https:\/\/www.couchbase.com\/blog\/ko\/products\/capella\/\" target=\"_blank\" rel=\"noopener\">\uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \uce74\ud3a0\ub77c(DBaaS)<\/a>, <a href=\"https:\/\/www.couchbase.com\/blog\/ko\/products\/server\/\" target=\"_blank\" rel=\"noopener\">\uc790\uccb4 \uad00\ub9ac \uc6b4\uc601 \ub370\uc774\ud130\ubca0\uc774\uc2a4<\/a>)<\/p>\n<p>\ud0a4-\uac12 \ub370\uc774\ud130 \uc18c\uc2a4(<a href=\"https:\/\/docs.couchbase.com\/spark-connector\/current\/spark-sql.html#_dataframe_persistence\" target=\"_blank\" rel=\"noopener\">\ub370\uc774\ud130 \uc11c\ube44\uc2a4<\/a>) \uc4f0\uae30 \uc791\uc5c5\uc5d0\ub294 \ucffc\ub9ac \uc11c\ube44\uc2a4 \ub300\uc2e0 \ub370\uc774\ud130 \uc11c\ube44\uc2a4\ub97c \uc0ac\uc6a9\ud558\uc138\uc694. \ub370\uc774\ud130 \uc11c\ube44\uc2a4\ub97c \ud1b5\ud55c \uc4f0\uae30(\uc9c1\uc811 KV \uc5c5\uc11c\ud2b8)\ub294 \uc77c\ubc18\uc801\uc73c\ub85c <b>\uba87 \ubc30 \ub354 \ube60\ub984<\/b> \ub97c \uc0ac\uc6a9\ud558\ub294 \uac83\ubcf4\ub2e4 \ud6e8\uc52c \ube60\ub985\ub2c8\ub2e4. \uc2e4\uc81c\ub85c \ub0b4\ubd80 \ubca4\uce58\ub9c8\ud06c\uc5d0 \ub530\ub974\uba74 KV\ub97c \ud1b5\ud55c \uc4f0\uae30 \uc791\uc5c5\uc740 \ub2e4\uc74c\uacfc \uac19\uc2b5\ub2c8\ub2e4. <i>3\ubc30 \ube60\ub978 \uc18d\ub3c4<\/i> \ub97c \uc0ac\uc6a9\ud558\ub294 \uac83\ubcf4\ub2e4 \ud6e8\uc52c \ub0ab\uc2b5\ub2c8\ub2e4. \uc774\ub294 \ub370\uc774\ud130 \uc11c\ube44\uc2a4\uac00 \ub2f4\ub2f9 \ub178\ub4dc\uc5d0 \uc9c1\uc811 \ubcd1\ub82c\ub85c \ubb38\uc11c\ub97c \uc218\uc9d1\ud560 \uc218 \uc788\uae30 \ub54c\ubb38\uc5d0 \uc791\uc5c5\ub2f9 \uc9c0\uc5f0 \uc2dc\uac04\uc774 \uc9e7\uc544\uc9c0\uae30 \ub54c\ubb38\uc785\ub2c8\ub2e4. KV \uc4f0\uae30\ub294 \uae30\ubcf8 \uc778\ub371\uc2a4 \uc774\uc678\uc758 \uc778\ub371\uc2a4 \uc5c5\ub370\uc774\ud2b8\ub97c \uc790\ub3d9\uc73c\ub85c \ud2b8\ub9ac\uac70\ud558\uc9c0 \uc54a\uc73c\ubbc0\ub85c \ud544\uc694\ud55c \uacbd\uc6b0 \uc0c8 \ubb38\uc11c\uc5d0 \ub300\ud574 \uc778\ub371\uc2a4\ub97c \ubcc4\ub3c4\ub85c \uc5c5\ub370\uc774\ud2b8\ud574\uc57c \ud55c\ub2e4\ub294 \uc810\uc5d0 \uc720\uc758\ud558\uc138\uc694.<\/p>\n<p><b>\ucffc\ub9ac \uc11c\ube44\uc2a4 \uc4f0\uae30\ub97c \uc704\ud55c \uc4f0\uae30 \ud30c\ud2f0\uc158 \ub298\ub9ac\uae30<\/b><br \/>\n(<a href=\"https:\/\/www.couchbase.com\/blog\/ko\/products\/capella\/\" target=\"_blank\" rel=\"noopener\">\uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \uce74\ud3a0\ub77c(DBaaS)<\/a>, <a href=\"https:\/\/www.couchbase.com\/blog\/ko\/products\/server\/\" target=\"_blank\" rel=\"noopener\">\uc790\uccb4 \uad00\ub9ac \uc6b4\uc601 \ub370\uc774\ud130\ubca0\uc774\uc2a4<\/a>)<\/p>\n<p>\uad8c\uc7a5\ud558\uc9c0\ub294 \uc54a\uc9c0\ub9cc, \ub2e4\uc74c\uacfc \uac19\uc774 \uc0ac\uc6a9\ud558\uae30\ub85c \uacb0\uc815\ud55c \uacbd\uc6b0 <code>couchbase.query<\/code> \ub97c \uc0ac\uc6a9\ud558\ub294 \uacbd\uc6b0(\uc608: \uc4f0\uae30 \uc911\uc5d0 \uc11c\ubc84 \uce21 \ubcc0\ud658\uc744 \uc218\ud589\ud558\ub294 \uacbd\uc6b0) \ub9ce\uc740 \uc218\uc758 \uc4f0\uae30 \ud30c\ud2f0\uc158\uc744 \uc0ac\uc6a9\ud558\uc5ec \uc131\ub2a5\uc744 \ucd5c\uc801\ud654\ud558\uc138\uc694. \uc4f0\uae30 \uc804\uc5d0 \ub370\uc774\ud130\ud504\ub808\uc784\uc744 \ub2e4\uc2dc \ud30c\ud2f0\uc158\ud558\uc5ec Spark\uac00 \ub9ce\uc740 \uc4f0\uae30 \uc791\uc5c5\uc744 \ub3d9\uc2dc\uc5d0 \uc2e4\ud589\ud558\ub3c4\ub85d \ud560 \uc218 \uc788\uc2b5\ub2c8\ub2e4. \ub300\ub7b5\uc801\uc778 \uac00\uc774\ub4dc\ub77c\uc778\uc740 \ub2e4\uc74c\uacfc \uac19\uc740 \uc21c\uc11c\ub85c \uc0ac\uc6a9\ud558\ub294 \uac83\uc785\ub2c8\ub2e4. <b>\uc218\ubc31 \uac1c\uc758 \ud30c\ud2f0\uc158<\/b> \ub97c \uc0ac\uc6a9\ud558\uc5ec SQL++\ub97c \ud1b5\ud55c \ub300\uaddc\ubaa8 \uc4f0\uae30\ub97c \uc218\ud589\ud569\ub2c8\ub2e4. \uc608\ub97c \ub4e4\uc5b4, \uc57d <i>\ucffc\ub9ac \ub178\ub4dc CPU\ub2f9 128\uac1c\uc758 \ud30c\ud2f0\uc158<\/i> \ub294 \uc77c\ubd80 \uc0ac\uc6a9\uc790\uc5d0\uac8c \ud6a8\uacfc\uc801\uc778 \uc2dc\uc791\uc810\uc785\ub2c8\ub2e4. \uc989, \ucffc\ub9ac \ucf54\uc5b4\uac00 8\uac1c\uc778 \uacbd\uc6b0 \ucd5c\ub300 1024\uac1c\uc758 \ud30c\ud2f0\uc158\uc744 \uc0ac\uc6a9\ud574 \ubcf4\uc138\uc694. \uc774 \uc544\uc774\ub514\uc5b4\ub294 \ucc98\ub9ac\ub7c9\uc744 \ucd5c\ub300\ud654\ud558\uae30 \uc704\ud574 \ucda9\ubd84\ud55c \ubcd1\ub82c UPSERT \ubb38\uc73c\ub85c \ucffc\ub9ac \uc11c\ube44\uc2a4\ub97c \uac00\ub4dd \ucc44\uc6b0\ub294 \uac83\uc785\ub2c8\ub2e4. \ub108\ubb34 \ub192\uc740 \ub3d9\uc2dc\uc131\uc740 \ucffc\ub9ac \uc11c\ube44\uc2a4\uc5d0 \uacfc\ubd80\ud558\uac00 \uac78\ub9b4 \uc218 \uc788\uc73c\ubbc0\ub85c \uc8fc\uc758\ud558\uc5ec \ud074\ub7ec\uc2a4\ud130\uc5d0 \uc801\ud569\ud55c \uade0\ud615\uc744 \ucc3e\uc544\uc57c \ud569\ub2c8\ub2e4. Couchbase\uc758 \ucffc\ub9ac \ucc98\ub9ac\ub7c9\uc744 \ubaa8\ub2c8\ud130\ub9c1\ud558\uace0 \uadf8\uc5d0 \ub530\ub77c \uc870\uc815\ud558\uc138\uc694.<\/p>\n<p>\ud30c\ud2f0\uc158 \uc218\ub97c \ud074\ub7ec\uc2a4\ud130 \ub9ac\uc18c\uc2a4\uc5d0 \ub9de\uac8c \uc870\uc815\ud558\uace0, \ud604\uba85\ud558\uac8c \uc778\ub371\uc2f1\ud558\uace0, \uc791\uc5c5\uc5d0 \uc801\ud569\ud55c \uc11c\ube44\uc2a4\ub97c \uc120\ud0dd\ud558\ub294 \ub4f1 \uc774\ub7ec\ud55c \ud29c\ub2dd \ud301\uc744 \ub530\ub974\uba74 Couchbase-Spark \ud1b5\ud569\uc744 \uc704\ud55c \ucd5c\uc801\uc758 \uc131\ub2a5\uc744 \uc5bb\uc744 \uc218 \uc788\uc2b5\ub2c8\ub2e4. Spark\uc758 \uc791\uc5c5 \uba54\ud2b8\ub9ad\uacfc Couchbase\uc758 \uc131\ub2a5 \ud1b5\uacc4(Couchbase UI \ubc0f \ub85c\uadf8\uc5d0\uc11c \ud655\uc778 \uac00\ub2a5)\ub97c \ubaa8\ub450 \uc8fc\uc2dc\ud558\uc5ec \ubcd1\ubaa9 \ud604\uc0c1(\uc608: \ud558\ub098\uc758 \ucffc\ub9ac \ub178\ub4dc\uac00 \ubaa8\ub4e0 \uc791\uc5c5\uc744 \uc218\ud589\ud558\ub294 \uacbd\uc6b0 \ub610\ub294 \ub124\ud2b8\uc6cc\ud06c\uac00 \ud3ec\ud654 \uc0c1\ud0dc\uc778 \uacbd\uc6b0)\uc744 \ud30c\uc545\ud558\uace0 \ud544\uc694\uc5d0 \ub530\ub77c \uad6c\uc131\uc744 \uc870\uc815\ud558\uc138\uc694.<\/p>\n<h2>\ucee4\ubba4\ub2c8\ud2f0 \ubc0f \uc9c0\uc6d0<\/h2>\n<p>CouchBase PySpark \uc9c0\uc6d0\uc740 CouchBase\uc6a9 CouchBase Spark \ucee4\ub125\ud130 \ubc0f <a href=\"https:\/\/github.com\/couchbase\/couchbase-spark-connector\" target=\"_blank\" rel=\"noopener\">\ub294 \uc624\ud508 \uc18c\uc2a4\uc785\ub2c8\ub2e4.<\/a>\ub97c \ud1b5\ud574 \uae30\uc5ec\ud558\uace0, \ud53c\ub4dc\ubc31\uc744 \uc81c\uacf5\ud558\uace0, \ub300\ud654\uc5d0 \ucc38\uc5ec\ud560 \uac83\uc744 \uad8c\uc7a5\ud569\ub2c8\ub2e4. <a href=\"https:\/\/docs.couchbase.com\/spark-connector\/current\/pyspark.html\" target=\"_blank\" rel=\"noopener\">\ubb38\uc11c<\/a>\uc5d0 \uac00\uc785\ud558\uc5ec <a href=\"https:\/\/www.couchbase.com\/blog\/ko\/forums\/\" target=\"_blank\" rel=\"noopener\">\uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \ud3ec\ub7fc<\/a> \ub610\ub294 <a href=\"https:\/\/discord.com\/invite\/K7NPMPGrPk\" target=\"_blank\" rel=\"noopener\">\uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \ub514\uc2a4\ucf54\ub4dc<\/a>.<\/p>\n<h2>\ucd94\uac00 \uc77d\uae30<\/h2>\n<p>\uc790\uc138\ud55c \uc815\ubcf4\uc640 \uc790\uc138\ud55c \ubb38\uc11c\ub294 \uacf5\uc2dd \ubb38\uc11c\ub97c \ucc38\uc870\ud558\uc138\uc694. <a href=\"https:\/\/docs.couchbase.com\/spark-connector\/current\/index.html\" target=\"_blank\" rel=\"noopener\">\uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \uc2a4\ud30c\ud06c \ucee4\ub125\ud130 \ubb38\uc11c<\/a> \ubc0f \uad00\ub828 \uc139\uc158 <a href=\"https:\/\/docs.couchbase.com\/spark-connector\/current\/pyspark.html\" target=\"_blank\" rel=\"noopener\">PySpark<\/a>:<\/p>\n<ul>\n<li style=\"list-style-type: none;\">\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><a href=\"https:\/\/docs.couchbase.com\/spark-connector\/current\/pyspark.html\" target=\"_blank\" rel=\"noopener\">\uce74\uc6b0\uce58\ubca0\uc774\uc2a4 PySpark \ubb38\uc11c<\/a><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><a href=\"https:\/\/github.com\/couchbase\/couchbase-spark-connector\" target=\"_blank\" rel=\"noopener\">\uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \uc2a4\ud30c\ud06c \ucee4\ub125\ud130 \uae43\ud5c8\ube0c \ub9ac\ud3ec\uc9c0\ud1a0\ub9ac<\/a><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><a href=\"https:\/\/www.couchbase.com\/blog\/ko\/forums\/tag\/spark\/\" target=\"_blank\" rel=\"noopener\">\uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \ud3ec\ub7fc(\uc2a4\ud30c\ud06c \ucee4\ub125\ud130 \uc139\uc158)<\/a><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><a href=\"https:\/\/github.com\/couchbase\/couchbase-spark-connector\/tree\/master\/src\/test\/pyspark\/examples\/basic\" target=\"_blank\" rel=\"noopener\">\uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \ud30c\uc774\uc2a4\ud30c\ud06c \ucee4\ub125\ud130 \uc8fc\ud53c\ud130 \ub178\ud2b8\ubd81 \uc608\uc81c<\/a><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><a href=\"https:\/\/github.com\/couchbase\/couchbase-spark-connector\/blob\/master\/src\/test\/pyspark\/examples\/ml\/pyspark_ml_example_hotel_cancellations.ipynb\" target=\"_blank\" rel=\"noopener\">\uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \ud30c\uc774\uc2a4\ud30c\ud06c ML \uc608\uc81c \uc8fc\ud53c\ud130 \ub178\ud2b8\ubd81: \ud638\ud154 \ucde8\uc18c<\/a><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>\ud589\ubcf5\ud55c \ucf54\ub529!<\/p>\n<p>\uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \ud300<\/p>\n<p><br style=\"font-weight: 400;\" \/><br style=\"font-weight: 400;\" \/><\/p>","protected":false},"excerpt":{"rendered":"<p>We are excited to announce the General Availability (GA) of the Python support for Couchbase Spark Connector, bringing first-class integration between Couchbase Server and Apache Spark to Python data engineers\u200b. This GA release means the connector is production-ready and fully [&hellip;]<\/p>","protected":false},"author":85357,"featured_media":17013,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"inline_featured_image":false,"footnotes":""},"categories":[1815,10129,2242,2294,2225,1816,10133,9417,9139,9141,1812],"tags":[10105,10104],"ppma_author":[9987],"class_list":["post-17010","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-best-practices-and-tutorials","category-columnar","category-connectors","category-analytics","category-cloud","category-couchbase-server","category-engineering","category-performance","category-python","category-scala","category-n1ql-query","tag-data-engineering","tag-pyspark"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v25.7.1 (Yoast SEO v25.7) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Build Highly Scalable AI\/ML Applications With Couchbase and PySpark - The Couchbase Blog<\/title>\n<meta name=\"description\" content=\"Couchbase Spark Connector now supports PySpark! Build fast, scalable data pipelines and ML apps with Couchbase and Apache Spark in Python.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.couchbase.com\/blog\/ko\/pyspark-ga-couchbase-spark-connector\/\" \/>\n<meta property=\"og:locale\" content=\"ko_KR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Build Highly Scalable AI\/ML Applications With Couchbase and PySpark\" \/>\n<meta property=\"og:description\" content=\"Couchbase Spark Connector now supports PySpark! Build fast, scalable data pipelines and ML apps with Couchbase and Apache Spark in Python.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.couchbase.com\/blog\/ko\/pyspark-ga-couchbase-spark-connector\/\" \/>\n<meta property=\"og:site_name\" content=\"The Couchbase Blog\" \/>\n<meta property=\"article:published_time\" content=\"2025-04-07T17:48:50+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-07-08T16:15:31+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2025\/04\/Pyspark-1.png\" \/>\n\t<meta property=\"og:image:width\" content=\"2560\" \/>\n\t<meta property=\"og:image:height\" content=\"1340\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Vishal Dhiman, Sr. Product Manager\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Vishal Dhiman, Sr. Product Manager\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"12\ubd84\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/\"},\"author\":{\"name\":\"Vishal Dhiman, Sr. Product Manager\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/853c7ac2867fb9e801ff769321364961\"},\"headline\":\"Build Highly Scalable AI\/ML Applications With Couchbase and PySpark\",\"datePublished\":\"2025-04-07T17:48:50+00:00\",\"dateModified\":\"2025-07-08T16:15:31+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/\"},\"wordCount\":2546,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2025\/04\/Pyspark-1.png\",\"keywords\":[\"data engineering\",\"PySpark\"],\"articleSection\":[\"Best Practices and Tutorials\",\"Columnar\",\"Connectors\",\"Couchbase Analytics\",\"Couchbase Capella\",\"Couchbase Server\",\"Engineering\",\"High Performance\",\"Python\",\"Scala\",\"SQL++ \/ N1QL Query\"],\"inLanguage\":\"ko-KR\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/\",\"url\":\"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/\",\"name\":\"Build Highly Scalable AI\/ML Applications With Couchbase and PySpark - The Couchbase Blog\",\"isPartOf\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2025\/04\/Pyspark-1.png\",\"datePublished\":\"2025-04-07T17:48:50+00:00\",\"dateModified\":\"2025-07-08T16:15:31+00:00\",\"description\":\"Couchbase Spark Connector now supports PySpark! Build fast, scalable data pipelines and ML apps with Couchbase and Apache Spark in Python.\",\"breadcrumb\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/#breadcrumb\"},\"inLanguage\":\"ko-KR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"ko-KR\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/#primaryimage\",\"url\":\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2025\/04\/Pyspark-1.png\",\"contentUrl\":\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2025\/04\/Pyspark-1.png\",\"width\":2560,\"height\":1340,\"caption\":\"Couchbase PySpark connector released\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.couchbase.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Build Highly Scalable AI\/ML Applications With Couchbase and PySpark\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/#website\",\"url\":\"https:\/\/www.couchbase.com\/blog\/\",\"name\":\"The Couchbase Blog\",\"description\":\"Couchbase, the NoSQL Database\",\"publisher\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.couchbase.com\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"ko-KR\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/#organization\",\"name\":\"The Couchbase Blog\",\"url\":\"https:\/\/www.couchbase.com\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"ko-KR\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2023\/04\/admin-logo.png\",\"contentUrl\":\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2023\/04\/admin-logo.png\",\"width\":218,\"height\":34,\"caption\":\"The Couchbase Blog\"},\"image\":{\"@id\":\"https:\/\/www.couchbase.com\/blog\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/853c7ac2867fb9e801ff769321364961\",\"name\":\"Vishal Dhiman, Sr. Product Manager\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"ko-KR\",\"@id\":\"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/image\/58e586f8e4645cc672ef6f140799b4b3\",\"url\":\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/09\/vishal-dhiman-couchbase.jpg\",\"contentUrl\":\"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/09\/vishal-dhiman-couchbase.jpg\",\"caption\":\"Vishal Dhiman, Sr. Product Manager\"},\"url\":\"https:\/\/www.couchbase.com\/blog\/ko\/author\/vishald\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Build Highly Scalable AI\/ML Applications With Couchbase and PySpark - The Couchbase Blog","description":"Couchbase Spark Connector now supports PySpark! Build fast, scalable data pipelines and ML apps with Couchbase and Apache Spark in Python.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.couchbase.com\/blog\/ko\/pyspark-ga-couchbase-spark-connector\/","og_locale":"ko_KR","og_type":"article","og_title":"Build Highly Scalable AI\/ML Applications With Couchbase and PySpark","og_description":"Couchbase Spark Connector now supports PySpark! Build fast, scalable data pipelines and ML apps with Couchbase and Apache Spark in Python.","og_url":"https:\/\/www.couchbase.com\/blog\/ko\/pyspark-ga-couchbase-spark-connector\/","og_site_name":"The Couchbase Blog","article_published_time":"2025-04-07T17:48:50+00:00","article_modified_time":"2025-07-08T16:15:31+00:00","og_image":[{"width":2560,"height":1340,"url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2025\/04\/Pyspark-1.png","type":"image\/png"}],"author":"Vishal Dhiman, Sr. Product Manager","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Vishal Dhiman, Sr. Product Manager","Est. reading time":"12\ubd84"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/#article","isPartOf":{"@id":"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/"},"author":{"name":"Vishal Dhiman, Sr. Product Manager","@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/853c7ac2867fb9e801ff769321364961"},"headline":"Build Highly Scalable AI\/ML Applications With Couchbase and PySpark","datePublished":"2025-04-07T17:48:50+00:00","dateModified":"2025-07-08T16:15:31+00:00","mainEntityOfPage":{"@id":"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/"},"wordCount":2546,"commentCount":0,"publisher":{"@id":"https:\/\/www.couchbase.com\/blog\/#organization"},"image":{"@id":"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/#primaryimage"},"thumbnailUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2025\/04\/Pyspark-1.png","keywords":["data engineering","PySpark"],"articleSection":["Best Practices and Tutorials","Columnar","Connectors","Couchbase Analytics","Couchbase Capella","Couchbase Server","Engineering","High Performance","Python","Scala","SQL++ \/ N1QL Query"],"inLanguage":"ko-KR","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/","url":"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/","name":"Build Highly Scalable AI\/ML Applications With Couchbase and PySpark - The Couchbase Blog","isPartOf":{"@id":"https:\/\/www.couchbase.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/#primaryimage"},"image":{"@id":"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/#primaryimage"},"thumbnailUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2025\/04\/Pyspark-1.png","datePublished":"2025-04-07T17:48:50+00:00","dateModified":"2025-07-08T16:15:31+00:00","description":"Couchbase Spark Connector now supports PySpark! Build fast, scalable data pipelines and ML apps with Couchbase and Apache Spark in Python.","breadcrumb":{"@id":"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/#breadcrumb"},"inLanguage":"ko-KR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/"]}]},{"@type":"ImageObject","inLanguage":"ko-KR","@id":"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/#primaryimage","url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2025\/04\/Pyspark-1.png","contentUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2025\/04\/Pyspark-1.png","width":2560,"height":1340,"caption":"Couchbase PySpark connector released"},{"@type":"BreadcrumbList","@id":"https:\/\/www.couchbase.com\/blog\/pyspark-ga-couchbase-spark-connector\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.couchbase.com\/blog\/"},{"@type":"ListItem","position":2,"name":"Build Highly Scalable AI\/ML Applications With Couchbase and PySpark"}]},{"@type":"WebSite","@id":"https:\/\/www.couchbase.com\/blog\/#website","url":"https:\/\/www.couchbase.com\/blog\/","name":"\uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \ube14\ub85c\uadf8","description":"NoSQL \ub370\uc774\ud130\ubca0\uc774\uc2a4, Couchbase","publisher":{"@id":"https:\/\/www.couchbase.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.couchbase.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"ko-KR"},{"@type":"Organization","@id":"https:\/\/www.couchbase.com\/blog\/#organization","name":"\uce74\uc6b0\uce58\ubca0\uc774\uc2a4 \ube14\ub85c\uadf8","url":"https:\/\/www.couchbase.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"ko-KR","@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2023\/04\/admin-logo.png","contentUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/2023\/04\/admin-logo.png","width":218,"height":34,"caption":"The Couchbase Blog"},"image":{"@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/853c7ac2867fb9e801ff769321364961","name":"\ube44\uc0ec \ub514\ub9cc, Sr. \uc81c\ud488 \uad00\ub9ac\uc790","image":{"@type":"ImageObject","inLanguage":"ko-KR","@id":"https:\/\/www.couchbase.com\/blog\/#\/schema\/person\/image\/58e586f8e4645cc672ef6f140799b4b3","url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/09\/vishal-dhiman-couchbase.jpg","contentUrl":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/09\/vishal-dhiman-couchbase.jpg","caption":"Vishal Dhiman, Sr. Product Manager"},"url":"https:\/\/www.couchbase.com\/blog\/ko\/author\/vishald\/"}]}},"authors":[{"term_id":9987,"user_id":85357,"is_guest":0,"slug":"vishald","display_name":"Vishal Dhiman, Sr. Product Manager","avatar_url":{"url":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/09\/vishal-dhiman-couchbase.jpg","url2x":"https:\/\/www.couchbase.com\/blog\/wp-content\/uploads\/sites\/1\/2024\/09\/vishal-dhiman-couchbase.jpg"},"first_name":"Vishal","last_name":"Dhiman, Sr. Product Manager","user_url":"","author_category":"","description":""}],"_links":{"self":[{"href":"https:\/\/www.couchbase.com\/blog\/ko\/wp-json\/wp\/v2\/posts\/17010","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.couchbase.com\/blog\/ko\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.couchbase.com\/blog\/ko\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/ko\/wp-json\/wp\/v2\/users\/85357"}],"replies":[{"embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/ko\/wp-json\/wp\/v2\/comments?post=17010"}],"version-history":[{"count":0,"href":"https:\/\/www.couchbase.com\/blog\/ko\/wp-json\/wp\/v2\/posts\/17010\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/ko\/wp-json\/wp\/v2\/media\/17013"}],"wp:attachment":[{"href":"https:\/\/www.couchbase.com\/blog\/ko\/wp-json\/wp\/v2\/media?parent=17010"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/ko\/wp-json\/wp\/v2\/categories?post=17010"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/ko\/wp-json\/wp\/v2\/tags?post=17010"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.couchbase.com\/blog\/ko\/wp-json\/wp\/v2\/ppma_author?post=17010"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}