How to Clone Couchbase Clusters for CI/CD On-Demand Ephemeral Environments

Continuous Integration and Continuous Deployment are now common software development practices. In the world of databases, this translates into needing on-demand, stateful, ephemeral environments.

Provisioning a stateless environment is not tied to any particular source of data. All that is needed is to run the code you want to test in your CI environment. This is the basis of most CI/CD tools and won’t be covered in this article.

The slightly harder part comes from the dependencies the application needs to be tested properly, which is often referred to as external services. Couchbase being one of them. There are different ways to get those, through Docker containers for instance, or hosted in your test infrastructure, or some external as a Service solution. It does not really matter as long as they are available while running your test. Good practices would be to use Environment Variables to refer to those instances.

Assuming these services are running, like a Couchbase Free Tier instance or a Docker container, the next step is to make sure that they are configured correctly, and seeded with the data needed for the test.

A while ago, I posted about using Couchbase Shell in GitHub actions. This will tell you the basics about using Couchbase Shell with GitHub Actions, but this can be applied to most CI/CD solutions as well. Today, I want to go further and show you some useful scripts to clone a cluster or elements of a cluster for your on demand environments.

Using Couchbase Shell to clone environments

When using Couchbase Shell, the first thing that comes to mind when wanting to do something is, is there a function for that? As of now we don’t have a function to clone something. Most of the available functions reflect our APIs capabilities and we have no cloning APIs today. But, we have the ability to write scripts, which means we can make our own!

The first thing that comes to mind when managing databases is often to recreate the structure and schemas. As Couchbase is Schemaless, this will only consist of the existing buckets, scopes, collections, and indexes in the source cluster. The first step is to export that structure so it can be reimported later. This function will list every bucket, then inner scopes and collections, and add them to an array. Then it will list all indexes and add them to the output JSON.

# Exports all buckets, scopes, collections and indexes
# for the given cluster
def export-cluster-struct [
    source: string # The cluster to export
] {
    mut export = []
    let buckets = buckets  --clusters $source # List the buckets of the given cluster
    for bucket in $buckets {
        mut scope_structs = []
        let scopes = scopes --clusters $source --bucket $bucket.name
        for scope in $scopes {
            let collections = (collections --clusters $source --bucket $bucket.name --scope $scope.scope | reject -i cluster)
            $scope_structs ++= [{ 
                scope: $scope.scope, 
               collections: $collections 
            }]
        }
        # Merge the scopes with the bucket object and add it to the export array
        let buc = ( $bucket | merge {scopes: $scope_structs } )
        $export ++= [ $buc ]
    }

    let indexes = query indexes --definitions --disable-context --clusters $source
    let output = { 
        buckets: $export,
        indexes: $indexes
    }
    return $output
}

# Exports all buckets, scopes, collections and indexes

# for the given cluster

def export-cluster-struct [

source: string # The cluster to export

] {

mut export = []

let buckets = buckets --clusters $source # List the buckets of the given cluster

for bucket in $buckets {

mut scope_structs = []

let scopes = scopes --clusters $source --bucket $bucket.name

for scope in $scopes {

let collections = (collections --clusters $source --bucket $bucket.name --scope $scope.scope | reject -i cluster)

$scope_structs ++= [{

scope: $scope.scope,

collections: $collections

}]

}

# Merge the scopes with the bucket object and add it to the export array

let buc = ( $bucket | merge {scopes: $scope_structs } )

$export ++= [ $buc ]

}

let indexes = query indexes --definitions --disable-context --clusters $source

let output = {

buckets: $export,

indexes: $indexes

}

return $output

}

This works because under the hood, Couchbase Shell is using Nushell, a new type of shell that is portable (meaning it works the same way on Linux, Windows, or OS X, which is great for CI/CD scripts having to support different OS), and that considers any structure data as a DataFrame, making the manipulation of JSON extremely easy.

To try it out, run cbsh, then source the file containing the function. For me it’s ci_scripts.nu. I have a cluster already configured in my cbsh config, called local

Laurent Doguin at local in travel-sample.inventory._default
> source ci-scripts.nu
Laurent Doguin at local in travel-sample.inventory._default
> export-cluster-struct local | save local-cluster-export.json

Laurent Doguin at local in travel-sample.inventory._default

> source ci-scripts.nu

Laurent Doguin at local in travel-sample.inventory._default

> export-cluster-struct local | save local-cluster-export.json

Now if you open local-cluster-export.json, you will get the structure of your cluster:

{
  "buckets": [
    {
      "cluster": "local",
      "name": "travel-sample",
      "type": "couchbase",
      "replicas": 0,
      "min_durability_level": "none",
      "ram_quota": 209715200,
      "flush_enabled": false,
      "cloud": false,
      "max_expiry": 0,
      "scopes": [
        {
          "scope": "inventory",
          "collections": [
            {
              "collection": "airport",
              "max_expiry": "inherited"
            },
            {
              "collection": "airline",
              "max_expiry": "inherited"
            },
            {
              "collection": "route",
              "max_expiry": "inherited"
            },
            {
              "collection": "landmark",
              "max_expiry": "inherited"
            },
            {
              "collection": "hotel",
              "max_expiry": "inherited"
            }
          ]
        },
        {
          "scope": "tenant_agent_00",
          "collections": [
            {
              "collection": "users",
              "max_expiry": "inherited"
            },
            {
              "collection": "bookings",
              "max_expiry": "inherited"
            }
          ]
        },
        {
          "scope": "tenant_agent_01",
          "collections": [
            {
              "collection": "users",
              "max_expiry": "inherited"
            },
            {
              "collection": "bookings",
              "max_expiry": "inherited"
            }
          ]
        },
        {
          "scope": "tenant_agent_02",
          "collections": [
            {
              "collection": "users",
              "max_expiry": "inherited"
            },
            {
              "collection": "bookings",
              "max_expiry": "inherited"
            }
          ]
        },
        {
          "scope": "tenant_agent_03",
          "collections": [
            {
              "collection": "users",
              "max_expiry": "inherited"
            },
            {
              "collection": "bookings",
              "max_expiry": "inherited"
            }
          ]
        },
        {
          "scope": "tenant_agent_04",
          "collections": [
            {
              "collection": "users",
              "max_expiry": "inherited"
            },
            {
              "collection": "bookings",
              "max_expiry": "inherited"
            }
          ]
        },
        {
          "scope": "_default",
          "collections": [
            {
              "collection": "_default",
              "max_expiry": "inherited"
            }
          ]
        },
        {
          "scope": "_system",
          "collections": [
            {
              "collection": "_query",
              "max_expiry": ""
            },
            {
              "collection": "_mobile",
              "max_expiry": ""
            }
          ]
        }
      ]
    }
  ],
  "indexes": [
    {
      "bucket": "travel-sample",
      "scope": "_system",
      "collection": "_query",
      "name": "#primary",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE PRIMARY INDEX `#primary` ON `travel-sample`.`_system`.`_query`",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "_default",
      "collection": "_default",
      "name": "def_airportname",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE INDEX `def_airportname` ON `travel-sample`(`airportname`) WITH {  \"defer_build\":true }",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "_default",
      "collection": "_default",
      "name": "def_city",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE INDEX `def_city` ON `travel-sample`(`city`) WITH {  \"defer_build\":true }",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "_default",
      "collection": "_default",
      "name": "def_faa",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE INDEX `def_faa` ON `travel-sample`(`faa`) WITH {  \"defer_build\":true }",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "_default",
      "collection": "_default",
      "name": "def_icao",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE INDEX `def_icao` ON `travel-sample`(`icao`) WITH {  \"defer_build\":true }",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "inventory",
      "collection": "airline",
      "name": "def_inventory_airline_primary",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE PRIMARY INDEX `def_inventory_airline_primary` ON `travel-sample`.`inventory`.`airline` WITH {  \"defer_build\":true }",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "inventory",
      "collection": "airport",
      "name": "def_inventory_airport_airportname",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE INDEX `def_inventory_airport_airportname` ON `travel-sample`.`inventory`.`airport`(`airportname`) WITH {  \"defer_build\":true }",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "inventory",
      "collection": "airport",
      "name": "def_inventory_airport_city",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE INDEX `def_inventory_airport_city` ON `travel-sample`.`inventory`.`airport`(`city`) WITH {  \"defer_build\":true }",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "inventory",
      "collection": "airport",
      "name": "def_inventory_airport_faa",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE INDEX `def_inventory_airport_faa` ON `travel-sample`.`inventory`.`airport`(`faa`) WITH {  \"defer_build\":true }",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "inventory",
      "collection": "airport",
      "name": "def_inventory_airport_primary",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE PRIMARY INDEX `def_inventory_airport_primary` ON `travel-sample`.`inventory`.`airport` WITH {  \"defer_build\":true }",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "inventory",
      "collection": "hotel",
      "name": "def_inventory_hotel_city",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE INDEX `def_inventory_hotel_city` ON `travel-sample`.`inventory`.`hotel`(`city`) WITH {  \"defer_build\":true }",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "inventory",
      "collection": "hotel",
      "name": "def_inventory_hotel_primary",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE PRIMARY INDEX `def_inventory_hotel_primary` ON `travel-sample`.`inventory`.`hotel` WITH {  \"defer_build\":true }",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "inventory",
      "collection": "landmark",
      "name": "def_inventory_landmark_city",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE INDEX `def_inventory_landmark_city` ON `travel-sample`.`inventory`.`landmark`(`city`) WITH {  \"defer_build\":true }",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "inventory",
      "collection": "landmark",
      "name": "def_inventory_landmark_primary",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE PRIMARY INDEX `def_inventory_landmark_primary` ON `travel-sample`.`inventory`.`landmark` WITH {  \"defer_build\":true }",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "inventory",
      "collection": "route",
      "name": "def_inventory_route_primary",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE PRIMARY INDEX `def_inventory_route_primary` ON `travel-sample`.`inventory`.`route` WITH {  \"defer_build\":true }",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "inventory",
      "collection": "route",
      "name": "def_inventory_route_route_src_dst_day",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE INDEX `def_inventory_route_route_src_dst_day` ON `travel-sample`.`inventory`.`route`(`sourceairport`,`destinationairport`,(distinct (array (`v`.`day`) for `v` in `schedule` end))) WITH {  \"defer_build\":true }",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "inventory",
      "collection": "route",
      "name": "def_inventory_route_schedule_utc",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE INDEX `def_inventory_route_schedule_utc` ON `travel-sample`.`inventory`.`route`(array (`s`.`utc`) for `s` in `schedule` end) WITH {  \"defer_build\":true }",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "inventory",
      "collection": "route",
      "name": "def_inventory_route_sourceairport",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE INDEX `def_inventory_route_sourceairport` ON `travel-sample`.`inventory`.`route`(`sourceairport`) WITH {  \"defer_build\":true }",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "_default",
      "collection": "_default",
      "name": "def_name_type",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE INDEX `def_name_type` ON `travel-sample`(`name`) WHERE (`_type` = \"User\") WITH {  \"defer_build\":true }",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "_default",
      "collection": "_default",
      "name": "def_primary",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE PRIMARY INDEX `def_primary` ON `travel-sample` WITH {  \"defer_build\":true }",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "_default",
      "collection": "_default",
      "name": "def_route_src_dst_day",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE INDEX `def_route_src_dst_day` ON `travel-sample`(`sourceairport`,`destinationairport`,(distinct (array (`v`.`day`) for `v` in `schedule` end))) WHERE (`type` = \"route\") WITH {  \"defer_build\":true }",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "_default",
      "collection": "_default",
      "name": "def_schedule_utc",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE INDEX `def_schedule_utc` ON `travel-sample`(array (`s`.`utc`) for `s` in `schedule` end) WITH {  \"defer_build\":true }",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "_default",
      "collection": "_default",
      "name": "def_sourceairport",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE INDEX `def_sourceairport` ON `travel-sample`(`sourceairport`) WITH {  \"defer_build\":true }",
      "cluster": "local"
    },
    {
      "bucket": "travel-sample",
      "scope": "_default",
      "collection": "_default",
      "name": "def_type",
      "status": "Ready",
      "storage_mode": "memory_optimized",
      "replicas": 0,
      "definition": "CREATE INDEX `def_type` ON `travel-sample`(`type`) WITH {  \"defer_build\":true }",
      "cluster": "local"
    }
  ]
}

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

139

140

141

142

143

144

145

146

147

148

149

150

151

152

153

154

155

156

157

158

159

160

161

162

163

164

165

166

167

168

169

170

171

172

173

174

175

176

177

178

179

180

181

182

183

184

185

186

187

188

189

190

191

192

193

194

195

196

197

198

199

200

201

202

203

204

205

206

207

208

209

210

211

212

213

214

215

216

217

218

219

220

221

222

223

224

225

226

227

228

229

230

231

232

233

234

235

236

237

238

239

240

241

242

243

244

245

246

247

248

249

250

251

252

253

254

255

256

257

258

259

260

261

262

263

264

265

266

267

268

269

270

271

272

273

274

275

276

277

278

279

280

281

282

283

284

285

286

287

288

289

290

291

292

293

294

295

296

297

298

299

300

301

302

303

304

305

306

307

308

309

310

311

312

313

314

315

316

317

318

319

320

321

322

323

324

325

326

327

328

329

330

331

332

333

334

335

336

337

338

339

340

341

342

343

344

345

346

347

348

349

350

351

352

353

354

355

356

357

358

359

360

361

362

363

364

365

366

367

368

369

370

371

372

373

374

375

376

377

378

379

380

381

382

383

384

385

386

387

388

389

390

391

392

393

394

395

{

"buckets": [

{

"cluster": "local",

"name": "travel-sample",

"type": "couchbase",

"replicas": 0,

"min_durability_level": "none",

"ram_quota": 209715200,

"flush_enabled": false,

"cloud": false,

"max_expiry": 0,

"scopes": [

{

"scope": "inventory",

"collections": [

{

"collection": "airport",

"max_expiry": "inherited"

{

"collection": "airline",

"max_expiry": "inherited"

{

"collection": "route",

"max_expiry": "inherited"

{

"collection": "landmark",

"max_expiry": "inherited"

{

"collection": "hotel",

"max_expiry": "inherited"

}

]

{

"scope": "tenant_agent_00",

"collections": [

{

"collection": "users",

"max_expiry": "inherited"

{

"collection": "bookings",

"max_expiry": "inherited"

}

]

{

"scope": "tenant_agent_01",

"collections": [

{

"collection": "users",

"max_expiry": "inherited"

{

"collection": "bookings",

"max_expiry": "inherited"

}

]

{

"scope": "tenant_agent_02",

"collections": [

{

"collection": "users",

"max_expiry": "inherited"

{

"collection": "bookings",

"max_expiry": "inherited"

}

]

{

"scope": "tenant_agent_03",

"collections": [

{

"collection": "users",

"max_expiry": "inherited"

{

"collection": "bookings",

"max_expiry": "inherited"

}

]

{

"scope": "tenant_agent_04",

"collections": [

{

"collection": "users",

"max_expiry": "inherited"

{

"collection": "bookings",

"max_expiry": "inherited"

}

]

{

"scope": "_default",

"collections": [

{

"collection": "_default",

"max_expiry": "inherited"

}

]

{

"scope": "_system",

"collections": [

{

"collection": "_query",

"max_expiry": ""

{

"collection": "_mobile",

"max_expiry": ""

}

]

}

]

}

"indexes": [

{

"bucket": "travel-sample",

"scope": "_system",

"collection": "_query",

"name": "#primary",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE PRIMARY INDEX `#primary` ON `travel-sample`.`_system`.`_query`",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "_default",

"collection": "_default",

"name": "def_airportname",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE INDEX `def_airportname` ON `travel-sample`(`airportname`) WITH { \"defer_build\":true }",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "_default",

"collection": "_default",

"name": "def_city",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE INDEX `def_city` ON `travel-sample`(`city`) WITH { \"defer_build\":true }",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "_default",

"collection": "_default",

"name": "def_faa",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE INDEX `def_faa` ON `travel-sample`(`faa`) WITH { \"defer_build\":true }",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "_default",

"collection": "_default",

"name": "def_icao",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE INDEX `def_icao` ON `travel-sample`(`icao`) WITH { \"defer_build\":true }",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "inventory",

"collection": "airline",

"name": "def_inventory_airline_primary",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE PRIMARY INDEX `def_inventory_airline_primary` ON `travel-sample`.`inventory`.`airline` WITH { \"defer_build\":true }",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "inventory",

"collection": "airport",

"name": "def_inventory_airport_airportname",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE INDEX `def_inventory_airport_airportname` ON `travel-sample`.`inventory`.`airport`(`airportname`) WITH { \"defer_build\":true }",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "inventory",

"collection": "airport",

"name": "def_inventory_airport_city",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE INDEX `def_inventory_airport_city` ON `travel-sample`.`inventory`.`airport`(`city`) WITH { \"defer_build\":true }",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "inventory",

"collection": "airport",

"name": "def_inventory_airport_faa",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE INDEX `def_inventory_airport_faa` ON `travel-sample`.`inventory`.`airport`(`faa`) WITH { \"defer_build\":true }",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "inventory",

"collection": "airport",

"name": "def_inventory_airport_primary",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE PRIMARY INDEX `def_inventory_airport_primary` ON `travel-sample`.`inventory`.`airport` WITH { \"defer_build\":true }",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "inventory",

"collection": "hotel",

"name": "def_inventory_hotel_city",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE INDEX `def_inventory_hotel_city` ON `travel-sample`.`inventory`.`hotel`(`city`) WITH { \"defer_build\":true }",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "inventory",

"collection": "hotel",

"name": "def_inventory_hotel_primary",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE PRIMARY INDEX `def_inventory_hotel_primary` ON `travel-sample`.`inventory`.`hotel` WITH { \"defer_build\":true }",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "inventory",

"collection": "landmark",

"name": "def_inventory_landmark_city",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE INDEX `def_inventory_landmark_city` ON `travel-sample`.`inventory`.`landmark`(`city`) WITH { \"defer_build\":true }",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "inventory",

"collection": "landmark",

"name": "def_inventory_landmark_primary",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE PRIMARY INDEX `def_inventory_landmark_primary` ON `travel-sample`.`inventory`.`landmark` WITH { \"defer_build\":true }",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "inventory",

"collection": "route",

"name": "def_inventory_route_primary",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE PRIMARY INDEX `def_inventory_route_primary` ON `travel-sample`.`inventory`.`route` WITH { \"defer_build\":true }",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "inventory",

"collection": "route",

"name": "def_inventory_route_route_src_dst_day",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE INDEX `def_inventory_route_route_src_dst_day` ON `travel-sample`.`inventory`.`route`(`sourceairport`,`destinationairport`,(distinct (array (`v`.`day`) for `v` in `schedule` end))) WITH { \"defer_build\":true }",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "inventory",

"collection": "route",

"name": "def_inventory_route_schedule_utc",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE INDEX `def_inventory_route_schedule_utc` ON `travel-sample`.`inventory`.`route`(array (`s`.`utc`) for `s` in `schedule` end) WITH { \"defer_build\":true }",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "inventory",

"collection": "route",

"name": "def_inventory_route_sourceairport",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE INDEX `def_inventory_route_sourceairport` ON `travel-sample`.`inventory`.`route`(`sourceairport`) WITH { \"defer_build\":true }",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "_default",

"collection": "_default",

"name": "def_name_type",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE INDEX `def_name_type` ON `travel-sample`(`name`) WHERE (`_type` = \"User\") WITH { \"defer_build\":true }",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "_default",

"collection": "_default",

"name": "def_primary",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE PRIMARY INDEX `def_primary` ON `travel-sample` WITH { \"defer_build\":true }",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "_default",

"collection": "_default",

"name": "def_route_src_dst_day",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE INDEX `def_route_src_dst_day` ON `travel-sample`(`sourceairport`,`destinationairport`,(distinct (array (`v`.`day`) for `v` in `schedule` end))) WHERE (`type` = \"route\") WITH { \"defer_build\":true }",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "_default",

"collection": "_default",

"name": "def_schedule_utc",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE INDEX `def_schedule_utc` ON `travel-sample`(array (`s`.`utc`) for `s` in `schedule` end) WITH { \"defer_build\":true }",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "_default",

"collection": "_default",

"name": "def_sourceairport",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE INDEX `def_sourceairport` ON `travel-sample`(`sourceairport`) WITH { \"defer_build\":true }",

"cluster": "local"

{

"bucket": "travel-sample",

"scope": "_default",

"collection": "_default",

"name": "def_type",

"status": "Ready",

"storage_mode": "memory_optimized",

"replicas": 0,

"definition": "CREATE INDEX `def_type` ON `travel-sample`(`type`) WITH { \"defer_build\":true }",

"cluster": "local"

}

]

}

I have deleted that bucket for the purpose of this test, to reimport it later: buckets drop travel-sample.

The next logical step is to have a function that takes this file as input and recreate the complete structure in another cluster:

# Import all buckets, scopes and collections structure
# in the given cluster
def import-cluster-struct [
    destination: string # The cluster to import
] {
    let structure = $in # Assigning the piped structure to a variable
    let buckets = $structure.buckets
    for bucket in $buckets {
        $bucket | _create-bucket-definition $destination
        for scope in ($bucket.scopes | where not ( $it.scope | str starts-with "_" ) ) {
            print $"Create scope ($destination)_($bucket.name)_($scope.scope)"
            scopes create --clusters $destination --bucket $bucket.name $scope.scope
            for col in $scope.collections {
                print $"Create collection ($destination)_($bucket.name)_($scope.scope)_($col.collection)"
                collections create --clusters $destination --bucket $bucket.name --scope  $scope.scope $col.collection
            }
        }
    }
    let indexes = $structure.indexes
    $indexes | _create-indexes $destination # Nushell allows you to use other functions you created
}

def _create-indexes [
    destination: string # the cluster where to create indexes
] {
    let indexes = $in
    for index in $indexes {
        print $"Recreating index ($index.name) on cluster ($destination) with: "
        print $index.definition
        query $index.definition --disable-context --clusters $destination
    }
}

# Import all buckets, scopes and collections structure

# in the given cluster

def import-cluster-struct [

destination: string # The cluster to import

] {

let structure = $in # Assigning the piped structure to a variable

let buckets = $structure.buckets

for bucket in $buckets {

$bucket | _create-bucket-definition $destination

for scope in ($bucket.scopes | where not ( $it.scope | str starts-with "_" ) ) {

print $"Create scope ($destination)_($bucket.name)_($scope.scope)"

scopes create --clusters $destination --bucket $bucket.name $scope.scope

for col in $scope.collections {

print $"Create collection ($destination)_($bucket.name)_($scope.scope)_($col.collection)"

collections create --clusters $destination --bucket $bucket.name --scope $scope.scope $col.collection

}

let indexes = $structure.indexes

$indexes | _create-indexes $destination # Nushell allows you to use other functions you created

}

def _create-indexes [

destination: string # the cluster where to create indexes

] {

let indexes = $in

for index in $indexes {

print $"Recreating index ($index.name) on cluster ($destination) with: "

print $index.definition

query $index.definition --disable-context --clusters $destination

}

Now to run that function:

Laurent Doguin at local in travel-sample.inventory._default
> open local-cluster-export.json | import-cluster-struct capella
Laurent Doguin at local in travel-sample.inventory._default
> open local-cluster-export.json | import-cluster-struct local
Create Bucket local_travel-sample with 200 quota, type couchbase, 0 replicas, none durability, 0 expiry
Create scope local_travel-sample_inventory
Create collection local_travel-sample_inventory_airport
Create collection local_travel-sample_inventory_airline
Create collection local_travel-sample_inventory_route
Create collection local_travel-sample_inventory_landmark
Create collection local_travel-sample_inventory_hotel
Create scope local_travel-sample_tenant_agent_00
Create collection local_travel-sample_tenant_agent_00_users
Create collection local_travel-sample_tenant_agent_00_bookings
Create scope local_travel-sample_tenant_agent_01
Create collection local_travel-sample_tenant_agent_01_users
Create collection local_travel-sample_tenant_agent_01_bookings
Create scope local_travel-sample_tenant_agent_02
Create collection local_travel-sample_tenant_agent_02_users
Create collection local_travel-sample_tenant_agent_02_bookings
Create scope local_travel-sample_tenant_agent_03
Create collection local_travel-sample_tenant_agent_03_users
Create collection local_travel-sample_tenant_agent_03_bookings
Create scope local_travel-sample_tenant_agent_04
Create collection local_travel-sample_tenant_agent_04_users
Create collection local_travel-sample_tenant_agent_04_bookings
Recreating index #primary on cluster local with: 
CREATE PRIMARY INDEX `#primary` ON `travel-sample`.`_system`.`_query`
Recreating index def_airportname on cluster local with:
CREATE INDEX `def_airportname` ON `travel-sample`(`airportname`) WITH {  "defer_build":true }
Recreating index def_city on cluster local with:
CREATE INDEX `def_city` ON `travel-sample`(`city`) WITH {  "defer_build":true }
Recreating index def_faa on cluster local with:
CREATE INDEX `def_faa` ON `travel-sample`(`faa`) WITH {  "defer_build":true }
Recreating index def_icao on cluster local with:
CREATE INDEX `def_icao` ON `travel-sample`(`icao`) WITH {  "defer_build":true }
Recreating index def_inventory_airline_primary on cluster local with:
CREATE PRIMARY INDEX `def_inventory_airline_primary` ON `travel-sample`.`inventory`.`airline` WITH {  "defer_build":true }
Recreating index def_inventory_airport_airportname on cluster local with:
CREATE INDEX `def_inventory_airport_airportname` ON `travel-sample`.`inventory`.`airport`(`airportname`) WITH {  "defer_build":true }Recreating index def_inventory_airport_city on cluster local with:
CREATE INDEX `def_inventory_airport_city` ON `travel-sample`.`inventory`.`airport`(`city`) WITH {  "defer_build":true }
Recreating index def_inventory_airport_faa on cluster local with:
CREATE INDEX `def_inventory_airport_faa` ON `travel-sample`.`inventory`.`airport`(`faa`) WITH {  "defer_build":true }
Recreating index def_inventory_airport_primary on cluster local with: 
CREATE PRIMARY INDEX `def_inventory_airport_primary` ON `travel-sample`.`inventory`.`airport` WITH {  "defer_build":true }
Recreating index def_inventory_hotel_city on cluster local with: 
CREATE INDEX `def_inventory_hotel_city` ON `travel-sample`.`inventory`.`hotel`(`city`) WITH {  "defer_build":true }
Recreating index def_inventory_hotel_primary on cluster local with: 
CREATE PRIMARY INDEX `def_inventory_hotel_primary` ON `travel-sample`.`inventory`.`hotel` WITH {  "defer_build":true }
Recreating index def_inventory_landmark_city on cluster local with: 
CREATE INDEX `def_inventory_landmark_city` ON `travel-sample`.`inventory`.`landmark`(`city`) WITH {  "defer_build":true }
Recreating index def_inventory_landmark_primary on cluster local with: 
CREATE PRIMARY INDEX `def_inventory_landmark_primary` ON `travel-sample`.`inventory`.`landmark` WITH {  "defer_build":true }
Recreating index def_inventory_route_primary on cluster local with: 
CREATE PRIMARY INDEX `def_inventory_route_primary` ON `travel-sample`.`inventory`.`route` WITH {  "defer_build":true }
Recreating index def_inventory_route_route_src_dst_day on cluster local with: 
CREATE INDEX `def_inventory_route_route_src_dst_day` ON `travel-sample`.`inventory`.`route`(`sourceairport`,`destinationairport`,(distinct (array (`v`.`day`) for `v` in `schedule` end))) WITH {  "defer_build":true }
Recreating index def_inventory_route_schedule_utc on cluster local with: 
CREATE INDEX `def_inventory_route_schedule_utc` ON `travel-sample`.`inventory`.`route`(array (`s`.`utc`) for `s` in `schedule` end) WITH {  "defer_build":true }
Recreating index def_inventory_route_sourceairport on cluster local with: 
CREATE INDEX `def_inventory_route_sourceairport` ON `travel-sample`.`inventory`.`route`(`sourceairport`) WITH {  "defer_build":true }
Recreating index def_name_type on cluster local with: 
CREATE INDEX `def_name_type` ON `travel-sample`(`name`) WHERE (`_type` = "User") WITH {  "defer_build":true }
Recreating index def_primary on cluster local with: 
CREATE PRIMARY INDEX `def_primary` ON `travel-sample` WITH {  "defer_build":true }
Recreating index def_route_src_dst_day on cluster local with: 
CREATE INDEX `def_route_src_dst_day` ON `travel-sample`(`sourceairport`,`destinationairport`,(distinct (array (`v`.`day`) for `v` in `schedule` end))) WHERE (`type` = "route") WITH {  "defer_build":true }
Recreating index def_schedule_utc on cluster local with: 
CREATE INDEX `def_schedule_utc` ON `travel-sample`(array (`s`.`utc`) for `s` in `schedule` end) WITH {  "defer_build":true }
Recreating index def_sourceairport on cluster local with: 
CREATE INDEX `def_sourceairport` ON `travel-sample`(`sourceairport`) WITH {  "defer_build":true }
Recreating index def_type on cluster local with: 
CREATE INDEX `def_type` ON `travel-sample`(`type`) WITH {  "defer_build":true }

Laurent Doguin at local in travel-sample.inventory._default

> open local-cluster-export.json | import-cluster-struct capella

Laurent Doguin at local in travel-sample.inventory._default

> open local-cluster-export.json | import-cluster-struct local

Create Bucket local_travel-sample with 200 quota, type couchbase, 0 replicas, none durability, 0 expiry

Create scope local_travel-sample_inventory

Create collection local_travel-sample_inventory_airport

Create collection local_travel-sample_inventory_airline

Create collection local_travel-sample_inventory_route

Create collection local_travel-sample_inventory_landmark

Create collection local_travel-sample_inventory_hotel

Create scope local_travel-sample_tenant_agent_00

Create collection local_travel-sample_tenant_agent_00_users

Create collection local_travel-sample_tenant_agent_00_bookings

Create scope local_travel-sample_tenant_agent_01

Create collection local_travel-sample_tenant_agent_01_users

Create collection local_travel-sample_tenant_agent_01_bookings

Create scope local_travel-sample_tenant_agent_02

Create collection local_travel-sample_tenant_agent_02_users

Create collection local_travel-sample_tenant_agent_02_bookings

Create scope local_travel-sample_tenant_agent_03

Create collection local_travel-sample_tenant_agent_03_users

Create collection local_travel-sample_tenant_agent_03_bookings

Create scope local_travel-sample_tenant_agent_04

Create collection local_travel-sample_tenant_agent_04_users

Create collection local_travel-sample_tenant_agent_04_bookings

Recreating index #primary on cluster local with:

CREATE PRIMARY INDEX `#primary` ON `travel-sample`.`_system`.`_query`

Recreating index def_airportname on cluster local with:

CREATE INDEX `def_airportname` ON `travel-sample`(`airportname`) WITH { "defer_build":true }

Recreating index def_city on cluster local with:

CREATE INDEX `def_city` ON `travel-sample`(`city`) WITH { "defer_build":true }

Recreating index def_faa on cluster local with:

CREATE INDEX `def_faa` ON `travel-sample`(`faa`) WITH { "defer_build":true }

Recreating index def_icao on cluster local with:

CREATE INDEX `def_icao` ON `travel-sample`(`icao`) WITH { "defer_build":true }

Recreating index def_inventory_airline_primary on cluster local with:

CREATE PRIMARY INDEX `def_inventory_airline_primary` ON `travel-sample`.`inventory`.`airline` WITH { "defer_build":true }

Recreating index def_inventory_airport_airportname on cluster local with:

CREATE INDEX `def_inventory_airport_airportname` ON `travel-sample`.`inventory`.`airport`(`airportname`) WITH { "defer_build":true }Recreating index def_inventory_airport_city on cluster local with:

CREATE INDEX `def_inventory_airport_city` ON `travel-sample`.`inventory`.`airport`(`city`) WITH { "defer_build":true }

Recreating index def_inventory_airport_faa on cluster local with:

CREATE INDEX `def_inventory_airport_faa` ON `travel-sample`.`inventory`.`airport`(`faa`) WITH { "defer_build":true }

Recreating index def_inventory_airport_primary on cluster local with:

CREATE PRIMARY INDEX `def_inventory_airport_primary` ON `travel-sample`.`inventory`.`airport` WITH { "defer_build":true }

Recreating index def_inventory_hotel_city on cluster local with:

CREATE INDEX `def_inventory_hotel_city` ON `travel-sample`.`inventory`.`hotel`(`city`) WITH { "defer_build":true }

Recreating index def_inventory_hotel_primary on cluster local with:

CREATE PRIMARY INDEX `def_inventory_hotel_primary` ON `travel-sample`.`inventory`.`hotel` WITH { "defer_build":true }

Recreating index def_inventory_landmark_city on cluster local with:

CREATE INDEX `def_inventory_landmark_city` ON `travel-sample`.`inventory`.`landmark`(`city`) WITH { "defer_build":true }

Recreating index def_inventory_landmark_primary on cluster local with:

CREATE PRIMARY INDEX `def_inventory_landmark_primary` ON `travel-sample`.`inventory`.`landmark` WITH { "defer_build":true }

Recreating index def_inventory_route_primary on cluster local with:

CREATE PRIMARY INDEX `def_inventory_route_primary` ON `travel-sample`.`inventory`.`route` WITH { "defer_build":true }

Recreating index def_inventory_route_route_src_dst_day on cluster local with:

CREATE INDEX `def_inventory_route_route_src_dst_day` ON `travel-sample`.`inventory`.`route`(`sourceairport`,`destinationairport`,(distinct (array (`v`.`day`) for `v` in `schedule` end))) WITH { "defer_build":true }

Recreating index def_inventory_route_schedule_utc on cluster local with:

CREATE INDEX `def_inventory_route_schedule_utc` ON `travel-sample`.`inventory`.`route`(array (`s`.`utc`) for `s` in `schedule` end) WITH { "defer_build":true }

Recreating index def_inventory_route_sourceairport on cluster local with:

CREATE INDEX `def_inventory_route_sourceairport` ON `travel-sample`.`inventory`.`route`(`sourceairport`) WITH { "defer_build":true }

Recreating index def_name_type on cluster local with:

CREATE INDEX `def_name_type` ON `travel-sample`(`name`) WHERE (`_type` = "User") WITH { "defer_build":true }

Recreating index def_primary on cluster local with:

CREATE PRIMARY INDEX `def_primary` ON `travel-sample` WITH { "defer_build":true }

Recreating index def_route_src_dst_day on cluster local with:

CREATE INDEX `def_route_src_dst_day` ON `travel-sample`(`sourceairport`,`destinationairport`,(distinct (array (`v`.`day`) for `v` in `schedule` end))) WHERE (`type` = "route") WITH { "defer_build":true }

Recreating index def_schedule_utc on cluster local with:

CREATE INDEX `def_schedule_utc` ON `travel-sample`(array (`s`.`utc`) for `s` in `schedule` end) WITH { "defer_build":true }

Recreating index def_sourceairport on cluster local with:

CREATE INDEX `def_sourceairport` ON `travel-sample`(`sourceairport`) WITH { "defer_build":true }

Recreating index def_type on cluster local with:

CREATE INDEX `def_type` ON `travel-sample`(`type`) WITH { "defer_build":true }

And there you have it, functions that allow you to export and import the data structure from one cluster to another. While this is a good starting point, there are still questions about how to reimport data, or about granularity. Also, you may not want to export and import a complete cluster.

Filtering buckets to import is fairly easy as Nushell allows you to filter dataframes:

Laurent Doguin at local in travel-sample.inventory._default
> open local-cluster-export.json | { buckets: ( $in.buckets | where name == 'travel-sample'), indexes :( $in.indexes | where bucket == 'travel-sample') }

1 2	Laurent Doguin at local in travel-sample.inventory._default > open local-cluster-export.json \| { buckets: ( $in.buckets \| where name == 'travel-sample'), indexes :( $in.indexes \| where bucket == 'travel-sample') }

This will recreate a JSON object containing only a bucket named travel-sample and indexes for this bucket.

From there you should be all set to manage basic cluster structure. What about the data? There are different ways you can import data with cbsh, as it covers most key/value operations as well as any INSERT/UPSERT queries. And then we have the doc import command. Its usage is fairly straightforward, all you need is a list of rows with an identified id field. This can be anything that can be turned into a dataframe for Nushell (XML, CSV, TSV, Parquet, and more). And of course, it can be a JSON file from a Couchbase SQL++ query. This is an example that will save a query result to a file and import that file back to a collection:

# Save file content to filename
let filename = $"temp_($src_bucket)_($src_scope)_($src_collection).json"
let query = "SELECT meta().id as meta_id, meta().expiration as expiration, c.* FROM `" + $src_bucket + "`." + $src_scope + "." + $src_collection + " c"
query --disable-context --clusters $p.src $query | save -f $filename

# Import the file content and print the results
print $"Import collection content from ($src)_($src_bucket)_($src_scope)_($src_collection) to ($dest)_($dest_bucket)_($dest_scope)_($dest_collection)"
print ( doc import --bucket $p.dest_bucket --scope $p.dest_scope --collection $p.dest_collection --clusters $p.dest --id-column meta_id $filename )

# Save file content to filename

let filename = $"temp_($src_bucket)_($src_scope)_($src_collection).json"

let query = "SELECT meta().id as meta_id, meta().expiration as expiration, c.* FROM `" + $src_bucket + "`." + $src_scope + "." + $src_collection + " c"

query --disable-context --clusters $p.src $query | save -f $filename

# Import the file content and print the results

print $"Import collection content from ($src)_($src_bucket)_($src_scope)_($src_collection) to ($dest)_($dest_bucket)_($dest_scope)_($dest_collection)"

print ( doc import --bucket $p.dest_bucket --scope $p.dest_scope --collection $p.dest_collection --clusters $p.dest --id-column meta_id $filename )

That’s one particular example but the whole point of using scripting language is to make them your own. You will find a more complete example in this GitHub Gist. It has support for environment variables for source and destination and you can decide to either clone all buckets of a cluster, a specific bucket, scope, or collection.

Don’t hesitate to drop us a comment here or on Discord, we are always looking for suggestions to improve the global Couchbase experience.

Laurent Doguin

Share this article

Platform

Self-Managed

Services

Capabilities

Why Couchbase?

Migrate to Capella

By Use Case

By Industry

By Application Need

Popular Docs

By Developer Role

Quickstart

Resource Center

About

Partnerships

Our Services

Partners: Register a Deal

Ready to register a deal with Couchbase?

Marriott

How to Clone Couchbase Clusters for CI/CD On-Demand Ephemeral Environments

Using Couchbase Shell to clone environments

Get Couchbase blog updates in your inbox

Author

Posted by Laurent Doguin

Leave a comment Cancel reply

Ready to get Started with Couchbase Capella?

Start building

Use Capella free

Get in touch