Moving SQL database content to Couchbase

Since the GA release of N1QL, we get a lot of questions about moving content from a SQL database to Couchbase. There are many different ways to do so. Today, I have chosen what is probably the simplest. I will transform each row of each table in a JsonDocument and store it in Couchbase. I will do my test with Postgres and their sample dataset inspired by MySQL Sakila sample. I will use Java, but the guidelines presented here are applicable to other languages.

Connecting to a running SQL database

Since I am using Java, I will implement Spring Boot and their JDBC package, which handles the db connection for me. All I have to do is set up the right dependencies and properties to configure the JdbcTemplate. This object makes running a SQL query a breeze.

Dependencies

To make sure you have everything configured neatly and automatically you need the following dependencies:


        dependencies {
            compile "org.springframework.boot:spring-boot-starter",
                    "org.springframework.boot:spring-boot-starter-data-jpa",
                    "org.postgresql:postgresql:9.4-1206-jdbc4"
        }

dependencies {

compile "org.springframework.boot:spring-boot-starter",

"org.springframework.boot:spring-boot-starter-data-jpa",

"org.postgresql:postgresql:9.4-1206-jdbc4"

}

I am testing with Postgres but you could add any other driver supported by Spring JDBC. The spring-boot-starter-data-jpa will allow me to inject the preconfigured JdbcTemplate.

Configuration

To make sure the Spring framework finds your database, add the following properties to your configuration file (for example, src/main/resources/application.properties).


        spring.jpa.database=POSTGRESQL
        spring.datasource.platform=postgres
        spring.jpa.show-sql=true
        spring.jpa.hibernate.ddl-auto=create-drop
        spring.database.driverClassName=org.postgresql.Driver
        spring.datasource.url=jdbc:postgresql://192.168.99.100:5432/dvdrental
        spring.datasource.username=postgres
        spring.datasource.password=password

spring.jpa.データベース=POSTGRESQL

spring.datasource.プラットフォーム=postgres

spring.jpa.ショー-sql=真の

spring.jpa.hibernate.ddl-auto=作成する-drop

spring.データベース.driverClassName=オルグ.postgresql.Driver

spring.datasource.url=jdbc:postgresql://192.168.99.100:5432/dvdrental

spring.datasource.ユーザー名=postgres

spring.datasource.パスワード=パスワード

Of course you would need to fine-tune this according to the database you are using. Here I am using Postgres running on 192.168.99.100 with default port 5432. The name of the database I want to use is dvdrental.

Code

If everything is configured correctly you should be able to inject the JdbcTemplate and start querying your SQL DB.


     @Autowired
     JdbcTemplate jdbcTemplate;

     @Override
     public void doStuff() throws Exception {
      String sql = "SELECT id FROM table";
         Long id = jdbcTemplate.queryForObject(sql, Long.class);
     }

@Autowired

JdbcTemplate jdbcTemplate;

@オーバーライド

公開ボイド doStuff() スロー例外 {

ストリング sql = "SELECT id FROM table";

Long アイドル = jdbcTemplate.queryForObject(sql, Long.クラス);

}

Connecting to Couchbase

My goal is to move content from a SQL database to Couchbase, so we also need a Couchbase connection.

Dependencies

Working with Couchbase on your Java project requires you to add the following dependency:


   dependencies {
        compile "com.couchbase.client:java-client:2.2.3"
    }

dependencies {

compile "com.couchbase.client:java-client:2.2.3"

}

This will give you access to the Couchbase Java SDK.

Configuration

A basic Couchbase configuration requires basically three properties: one server IP address, a bucket name, and a bucket password. Doing this in a Spring Boot fashion would look like this:


        @Configuration
        public class Database {

            @Value("${hostname}")
            private String hostname;

            @Value("${bucket}")
            private String bucket;

            @Value("${password}")
            private String password;

            public @Bean Cluster cluster() {
                return CouchbaseCluster.create(hostname);
            }

            public @Bean Bucket bucket() {
                return cluster().openBucket(bucket, password);
            }

        }

@Configuration

公開クラスデータベース {

@Value("${hostname}")

プライベートストリング hostname;

@Value("${bucket}")

プライベートストリングバケット;

@Value("${password}")

プライベートストリングパスワード;

公開 @Bean クラスタークラスタ() {

戻る CouchbaseCluster.作成する(hostname);

}

公開 @Bean バケットバケット() {

戻るクラスタ().オープンバケット(バケット, パスワード);

}

The properties hostname, bucket, and password can be added directly to your application properties file.


   # Hostnames, comma separated list of Couchbase node IP or hostname
    hostnames: localhost,127.0.0.1
    # Bucket name
    bucket: default
    # Bucket password
    password:

# Hostnames, comma separated list of Couchbase node IP or hostname

hostnames: localhost,127.0.0.1

# Bucket name

バケット: デフォルト

# Bucket password

パスワード:

Code

With Couchbase, the equivalent granularity level of a database would be a bucket, which is where you store documents. With the previous configuration you can simply inject a bucket and start playing around.


        @Autowired
        Bucket bucket;

        @Override
        public void doStuff() throws Exception {
            JsonDocument doc = bucket.get("key");
        }

@Autowired

バケットバケット;

@オーバーライド

公開ボイド doStuff() スロー例外 {

JsonDocument ドク = バケット.得る("key");

}

Tables

At this point you have a connection to a SQL database and Couchbase. Now we can start moving things around. The easiest way to move data is to consider each row of each table as a document.

Getting the SQL schema

Let’s start by getting the schema of the database automatically using the JdbcTemplate. The interesting object here is DatabaseMetaData, which can give us the complete structure of the database. The API is not the easiest to use, but at least it’s documented.

I will map the result of the DatabaseMetaData query to a list of Table and Column. I have created the following Java class to do so:


         public class Table {

            private String name;

            private List<Column> columns = new ArrayList<Column>();

            private String primaryKey;

            public Table(String tableName) {
             this.name = tableName;
            }

            public void setPrimaryKey(String primaryKey) {
             this.primaryKey = primaryKey;
            }

            public void addColumn(String name, int type) {
             columns.add(new Column(name, type));
            }

            public String getName() {
             return name;
            }

            public List<Column> getColumns() {
             return columns;
            }

            public String getPrimaryKey() {
             return primaryKey;
            }

            public JsonObject toJsonObject() {
             JsonObject obj = JsonObject.create();
             JsonArray jsonColumns = JsonArray.create();
             for (Column col : columns) {
                    jsonColumns.add(col.toJsonObject());
             }
             obj.put("tableName", name);
             obj.put("primaryKey", primaryKey);
             obj.put("columns", jsonColumns);
             return obj;
            }
     }

     public class Column {

            private String name;

            private int type;

            public Column(String name, int type) {
             this.name = name;
             this.type = type;
            }

            public String getName() {
             return name;
            }

            public int getType() {
             return type;
            }

            public JsonObject toJsonObject() {
             JsonObject obj = JsonObject.create();
             obj.put("name", name);
             obj.put("type", type);
             return obj;
            }

     }

公開クラス Table {

プライベートストリング名称;

プライベートリスト<Column> 列 = 新しい配列リスト<Column>();

プライベートストリング primaryKey;

公開 Table(ストリング tableName) {

これ.名称 = tableName;

}

公開ボイド setPrimaryKey(ストリング primaryKey) {

これ.primaryKey = primaryKey;

}

公開ボイド addColumn(ストリング名称, イントタイプ) {

列.追加(新しい Column(名称, タイプ));

}

公開ストリング getName() {

戻る名称;

}

公開リスト<Column> getColumns() {

戻る列;

}

公開ストリング getPrimaryKey() {

戻る primaryKey;

}

公開 JsonObject toJsonObject() {

JsonObject obj = JsonObject.作成する();

JsonArray jsonColumns = JsonArray.作成する();

にとって (Column col : 列) {

jsonColumns.追加(col.toJsonObject());

}

obj.置く("tableName", 名称);

obj.置く("primaryKey", primaryKey);

obj.置く("columns", jsonColumns);

戻る obj;

}

公開クラス Column {

プライベートストリング名称;

プライベートイントタイプ;

公開 Column(ストリング名称, イントタイプ) {

これ.名称 = 名称;

これ.タイプ = タイプ;

}

公開ストリング getName() {

戻る名称;

}

公開イント getType() {

戻るタイプ;

}

公開 JsonObject toJsonObject() {

JsonObject obj = JsonObject.作成する();

obj.置く(名前, 名称);

obj.置く(「タイプ, タイプ);

戻る obj;

}

It’s definitely not the most exciting code to write, but at the end you get a JSON representation of your SQL database tables.


        public void getDatabaseSchema() throws Exception {
         // get Database Medatadata objects to retrieve Tables schema
        DatabaseMetaData databaseMetadata = jdbcTemplate.getDataSource().getConnection().getMetaData();
            List<String> tableNames = new ArrayList<String>();
            // Get tables names
            ResultSet result = databaseMetadata.getTables(catalog, schemaPattern, tableNamePattern, types);
            while (result.next()) {
             String tablename = result.getString(3);
             String tableType = result.getString(4);
             // make sure we only import table(as oppose to Views, counter etc...)
             if (!tablename.isEmpty() && "TABLE".equals(tableType)) {
                    tableNames.add(tablename);
                    log.debug("Will import table " + tablename);
             }
            }
            // Map the tables schema to Table objects
            Map<String, Table> tables = new HashMap<String, Table>();
            JsonObject tablesSchema = JsonObject.create();
            for (String tableName : tableNames) {
             result = databaseMetadata.getColumns(catalog, schemaPattern, tableName, columnNamePattern);
             Table table = new Table(tableName);
             while (result.next()) {
                    String columnName = result.getString(4);
                    // Maps to JDBCType enum
                    int columnType = result.getInt(5);
                    table.addColumn(columnName, columnType);
             }
             result = databaseMetadata.getPrimaryKeys(catalog, schemaPattern, tableName);
             while (result.next()) {
                    String columnName = result.getString(4);
                    table.setPrimaryKey(columnName);
             }
             tables.put(tableName, table);
             tablesSchema.put(tableName, table.toJsonObject());
            }
            JsonDocument schemaDoc = JsonDocument.create(tablesSchemaId, tablesSchema);
            JsonDocument doc = bucket.upsert(schemaDoc);
         }

公開ボイド getDatabaseSchema() スロー例外 {

// get Database Medatadata objects to retrieve Tables schema

DatabaseMetaData databaseMetadata = jdbcTemplate.getDataSource().getConnection().getMetaData();

リスト<ストリング> tableNames = 新しい配列リスト<ストリング>();

// Get tables names

ResultSet 結果 = databaseMetadata.getTables(catalog, schemaPattern, tableNamePattern, types);

同時に (結果.次のページ()) {

ストリング tablename = 結果.ゲットストリング(3);

ストリング tableType = 結果.ゲットストリング(4);

// make sure we only import table(as oppose to Views, counter etc...)

もし (!tablename.isEmpty() && "TABLE".equals(tableType)) {

tableNames.追加(tablename);

ログ.デバッグ("Will import table " + tablename);

}

// Map the tables schema to Table objects

地図<ストリング, Table> tables = 新しいハッシュマップ<ストリング, Table>();

JsonObject tablesSchema = JsonObject.作成する();

にとって (ストリング tableName : tableNames) {

結果 = databaseMetadata.getColumns(catalog, schemaPattern, tableName, columnNamePattern);

Table テーブル = 新しい Table(tableName);

同時に (結果.次のページ()) {

ストリング columnName = 結果.ゲットストリング(4);

// Maps to JDBCType enum

イント columnType = 結果.getInt(5);

テーブル.addColumn(columnName, columnType);

}

結果 = databaseMetadata.getPrimaryKeys(catalog, schemaPattern, tableName);

同時に (結果.次のページ()) {

ストリング columnName = 結果.ゲットストリング(4);

テーブル.setPrimaryKey(columnName);

}

tables.置く(tableName, テーブル);

tablesSchema.置く(tableName, テーブル.toJsonObject());

}

JsonDocument schemaDoc = JsonDocument.作成する(tablesSchemaId, tablesSchema);

JsonDocument ドク = バケット.アップサート(schemaDoc);

}

Content

Here’s the fun part. This is where we start mapping a table row to a JsonDocument. The previous section puts us in a state where we can retrieve the name of all the tables. From one table name, we can create a SQL query that returns every row of the table.

Spring has a mechanism that allows you to define a RowMapper. For each row returned by the query, you can return the object you want. Since I am using Couchbase, I want a JsonDocument.

Following is an implementation example. This RowMapper needs a Table object previously defined; therefore, we have to implement the mapRow method. There are several things we need to do here.

The first task is to create a unique key. As rows are scoped by tables, some id can be exactly the same for rows in different tables. But documents are scoped by bucket, so we need to create a unique document key that reflects the row id and the table name. To keep track of where the document comes from, I will also add a _tableName field for the table name.

Then, the exciting step comes from the type mapping. JSON supports fewer types than a SQL database, so we have some conversion to do here. This is what the getJsonTypedValue method does. It makes sure most JDBC type can be converted to a native JSON type (String, number, boolean, array, object, null). At the end, we have a JsonDocument that can be saved in Couchbase.


        public class JSONRowMapper implements RowMapper<Document> {
        
        Table table;

        public JSONRowMapper(Table table) {
         this.table = table;
        }

        public JsonDocument mapRow(ResultSet rs, int rowNum) throws SQLException {
         String id = table.getName() + "::" + rs.getString(table.getPrimaryKey());
         JsonObject obj = JsonObject.create();
         obj.put("_tableName", table.getName());
         for (Column col : table.getColumns()) {
                Object value = getJsonTypedValue(col.type, rs.getObject(col.name), col.name);
                obj.put(col.name, value);
         }
         return JsonDocument.create(id, obj);
        }

        public Object getJsonTypedValue(int type, Object value, String name) throws SQLException {
         if (value == null) {
                return null;
         }
         JDBCType current = JDBCType.valueOf(type);
         switch (current) {
         case TIMESTAMP:
                Timestamp timestamp = (Timestamp) value;
                return timestamp.getTime();
         case TIMESTAMP_WITH_TIMEZONE:
                Timestamp ts = (Timestamp) value;
                JsonObject tsWithTz = JsonObject.create();
                tsWithTz.put("timestamp", ts.getTime());
                tsWithTz.put("timezone", ts.getTimezoneOffset());
                return tsWithTz;
         case DATE:
                Date sqlDate = (Date) value;
                return sqlDate.getTime();
         case DECIMAL:
         case NUMERIC:
                BigDecimal bigDecimal = (BigDecimal) value;
                return bigDecimal.doubleValue();
         case ARRAY:
                Array array = (Array) value;
                Object[] objects = (Object[]) array.getArray();
                return JsonArray.from(objects);
         case BINARY:
         case BLOB:
         case LONGVARBINARY:
                return Base64.getEncoder().encodeToString((byte[]) value);
         case OTHER:
         case JAVA_OBJECT:
                // database specific, default to String value
                return value.toString();
         default:
                return value;
         }
        }
 }

公開クラス JSONRowMapper 用具 RowMapper<ドキュメント> {

Table テーブル;

公開 JSONRowMapper(Table テーブル) {

これ.テーブル = テーブル;

}

公開 JsonDocument mapRow(ResultSet rs, イント rowNum) スロー SQLException {

ストリングアイドル = テーブル.getName() + "::" + rs.ゲットストリング(テーブル.getPrimaryKey());

JsonObject obj = JsonObject.作成する();

obj.置く("_tableName", テーブル.getName());

にとって (Column col : テーブル.getColumns()) {

対象価値 = getJsonTypedValue(col.タイプ, rs.getObject(col.名称), col.名称);

obj.置く(col.名称, 価値);

}

戻る JsonDocument.作成する(アイドル, obj);

}

公開対象 getJsonTypedValue(イントタイプ, 対象価値, ストリング名称) スロー SQLException {

もし (価値 == ヌル) {

戻るヌル;

}

JDBCType current = JDBCType.バリューオブ(タイプ);

switch (current) {

case TIMESTAMP:

Timestamp タイムスタンプ = (Timestamp) 価値;

戻るタイムスタンプ.getTime();

case TIMESTAMP_WITH_TIMEZONE:

Timestamp ts = (Timestamp) 価値;

JsonObject tsWithTz = JsonObject.作成する();

tsWithTz.置く(「タイムスタンプ, ts.getTime());

tsWithTz.置く("timezone", ts.getTimezoneOffset());

戻る tsWithTz;

case 日付:

日付 sqlDate = (日付) 価値;

戻る sqlDate.getTime();

case DECIMAL:

case NUMERIC:

BigDecimal bigDecimal = (BigDecimal) 価値;

戻る bigDecimal.doubleValue();

case ARRAY:

配列 array = (配列) 価値;

対象[] objects = (対象[]) array.getArray();

戻る JsonArray.より(objects);

case BINARY:

case BLOB:

case LONGVARBINARY:

戻る Base64.getEncoder().encodeToString((バイト[]) 価値);

case OTHER:

case JAVA_OBJECT:

// database specific, default to String value

戻る価値.文字列();

デフォルト:

戻る価値;

}

With that RowMapper it makes things really easy. We can loop through the table’s name, run the query, and save the results in Couchbase. Doing this in a synchronous fashion would look like this:


        for (String tableName : tableNames) {
         String sql = "select * from " + tableName + ";";
         List<JsonDocument&gt; rs = jdbcTemplate.query(sql, new JSONRowMapper(tables.get(tableName)));
         if (!rs.isEmpty()) {
             for (JsonDocument doc : rs) {
               bucket.upsert(doc); 
                } 
         }
        }
        bucket.upsert(schemaDoc);

にとって (ストリング tableName : tableNames) {

ストリング sql = "select * from " + tableName + ";";

リスト<JsonDocument> rs = jdbcTemplate.クエリー(sql, 新しい JSONRowMapper(tables.得る(tableName)));

もし (!rs.isEmpty()) {

にとって (JsonDocument ドク : rs) {

バケット.アップサート(ドク);

}

バケット.アップサート(schemaDoc);

But I prefer the async version:


   Observable.from(tableNames).flatMap(s -> {
        String sql = String.format("Select * from %s;", s);
        return Observable.from(jdbcTemplate.query(sql, new JSONRowMapper(tables.get(s))));
    })
    // start by a jsonDocument containing the tables to be imported.
    .startWith(schemaDoc).flatmap(doc -> asyncBucket.upsert(doc));

Observable.より(tableNames).flatMap(s -> {

ストリング sql = ストリング.フォーマット("Select * from %s;", s);

戻る Observable.より(jdbcTemplate.クエリー(sql, 新しい JSONRowMapper(tables.得る(s))));

})

// start by a jsonDocument containing the tables to be imported.

.スタートウィズ(schemaDoc).flatmap(ドク -> asyncBucket.アップサート(ドク));

Here I am not using the full potential of Rx; take a look at this function that writes a doc to Couchbase and handles timeout and error management.

For convenience, I have packaged all steps implemented and previously shown in a single project. All you have to do is make sure your properties file is configured right and run the importer:


   $ ./bin/couchbase-java-importer myConfiguration.properties

$ ./ビン/カウチベース-ジャワ-importer myConfiguration.プロパティ

を見てみよう。 README file for more information.

結論

Today we have learn how to move SQL content to Couchbase, but there is still some work to do. Next time I will tell you how to move the SQL business logic to the application layer.

ローラン・ドギャン

この記事をシェアする

Platform

Self-Managed

Services

Capabilities

Why Couchbase?

Migrate to Capella

By Use Case

By Industry

By Application Need

Popular Docs

By Developer Role

Quickstart

Resource Center

About

Partnerships

Our Services

Partners: Register a Deal

Ready to register a deal with Couchbase?

Marriott

Moving SQL database content to Couchbase

Connecting to a running SQL database

Dependencies

Configuration

Code

Connecting to Couchbase

Dependencies

Configuration

Code

Tables

Getting the SQL schema

Content

結論

Couchbaseブログの更新をメールで受け取る

著者

投稿者ローラン・ドギャン

コメントを残すコメントをキャンセル

Couchbase Capellaを始める準備はできましたか？

建設開始

カペラを無料で利用

連絡先

Platform

Self-Managed

Services

Capabilities

Why Couchbase?

Migrate to Capella

By Use Case

By Industry

By Application Need

Popular Docs

By Developer Role

Quickstart

Resource Center

About

Partnerships

Our Services

Partners: Register a Deal

Ready to register a deal with Couchbase?

Marriott

Moving SQL database content to Couchbase

Connecting to a running SQL database

Dependencies

Configuration

Code

Connecting to Couchbase

Dependencies

Configuration

Code

Tables

Getting the SQL schema

Content

結論

Couchbaseブログの更新をメールで受け取る

著者

投稿者 ローラン・ドギャン

コメントを残す コメントをキャンセル

Couchbase Capellaを始める準備はできましたか？

建設開始

カペラを無料で利用

連絡先

投稿者ローラン・ドギャン

コメントを残すコメントをキャンセル