Migrating From MySQL to YugabyteDB Using YugabyteDB Voyager
In this article, readers will learn how to use YugabyteDB Voyager to migrate a web application from MySQL database to a distributed YugabyteDB database cluster.
Join the DZone community and get the full member experience.
Join For FreeIn this article, I’m going to demonstrate how you can migrate a comprehensive web application from MySQL to YugabyteDB using the open-source data migration engine YugabyteDB Voyager.
Nowadays, many people migrate their applications from traditional, single-server relational databases to distributed database clusters. This helps improve availability, scalability, and performance.
Migrating to YugabyteDB allows engineers to use a familiar SQL interface while benefiting from the data-resiliency and performance characteristics of distributed databases.
YugaSocial Application
I’ve developed an application called YugaSocial, built to run on MySQL.
The YugaSocial application is a Facebook clone with the ability to make posts, follow users, comment on posts, and more!
Let’s start by deploying and connecting to a Google Cloud SQL database running MySQL. Later, we’ll migrate our data to a multi-node YugabyteDB Managed cluster.
Getting Started With MySQL
We could run MySQL on our machines using a local installation or in Docker, but I’m going to demonstrate how to migrate a database hosted on the Google Cloud Platform (GCP) to YugabyteDB Managed.
Setting Up Google Cloud SQL
I’ve deployed a MySQL instance on Google Cloud SQL named yugasocial
and set my public IP address to the authorized networks list so I can connect directly from my machine.
While beneficial for demonstration purposes, I’d recommend connecting securely from inside a VPC, with SSL certificates to properly secure your data transfers.
Connecting YugaSocial to MySQL in Node.js
Connecting to our MySQL instance in the cloud is easy with the MySQL driver for Node.js.
This is an application code snippet that connects to the MySQL instance:
// connect.js
...
import mysql from "mysql";
if (process.env.DB_TYPE === "mysql") {
const pool = mysql.createPool({
host: process.env.DB_HOST,
port: process.env.DB_PORT,
user: process.env.DB_USER,
password: process.env.DB_PASSWORD,
database: process.env.DB_NAME,
connectionLimit: 100
});
}
I’ve created a connection pool with up to 100 established connections. By setting environment variables with our Google Cloud SQL instance configuration, and running the application, we can confirm that our database has been configured properly:
> DB_TYPE=mysql DB_USER=admin DB_HOST=[HOST] DB_HOST=[PASSWORD] node index.js
Connection to MySQL verified.
Server running on port 8800.
After verifying our MySQL database running in the cloud, we can start migrating to YugabyteDB Managed.
Setting Up YugabyteDB Managed
It takes less than five minutes to get started with YugabyteDB Managed. First, create an account then follow the steps to create a YugabyteDB cluster.
I’ve chosen to deploy a three-node cluster to GCP, in the us-west-1 region. This configuration will provide fault tolerance across availability zones.
Add your IP address to the cluster allow list so you can connect from your machine to the remote database and download the database credentials before creating your cluster.
Once our cluster has been deployed, we’re ready to begin migrating with YugabyteDB Voyager.
Migrating to YugabyteDB
Having verified our MySQL deployment, it’s time to migrate from Cloud SQL to YugabyteDB using the YugabyteDB Voyager CLI.
YugabyteDB Voyager is a powerful, open-source, data-migration engine, which manages the entire lifecycle of data migration.
After installing YugabyteDB Voyager, we’ll begin by creating users in our source and target databases and granting them roles.
I’ve chosen to use the mysqlsh
command-line utility to connect to my cloud instance, but Google provides multiple connection options.
1. Create the ybvoyager
user in Cloud SQL and grant permissions:
> mysqlsh root@CLOUD_SQL_HOST --password='CLOUD_SQL_PASSWORD'
> \sql
SQL=> \use social
SQL=> CREATE USER 'ybvoyager'@'%' IDENTIFIED WITH mysql_native_password BY 'Password#123';
SQL=> GRANT PROCESS ON *.* TO 'ybvoyager'@'%';
SQL=> GRANT SELECT ON social.* TO 'ybvoyager'@'%';
SQL=> GRANT SHOW VIEW ON source_db_name.* TO 'ybvoyager'@'%';
SQL=> GRANT TRIGGER ON source_db_name.* TO 'ybvoyager'@'%';
SQL=> GRANT SHOW_ROUTINE ON *.* TO 'ybvoyager'@'%';
2. Repeat this process using the YugabyteDB Managed Cloud Shell:
// Optionally, you can create a database for import. Otherwise, the target database will default to 'yugabyte'.
yugabyte=> CREATE DATABASE social;
yugabyte=> CREATE USER ybvoyager PASSWORD 'password';
yugabyte=> GRANT yb_superuser TO ybvoyager;
Now, our source and target databases are equipped to use Voyager. In order to export from Cloud SQL, we first need to create an export directory and an associated environment variable:
> mkdir ~/export-dir
> export EXPORT_DIR=$HOME/export-dir
This directory will be used as an intermediary between our source and target databases. It will house schema and data files, as well as logs, metadata, and schema analysis reports.
Let’s begin migrating our database.
1. Export the schema from Google Cloud SQL:
> yb-voyager export schema --export-dir ~/export-dir \
--source-db-type mysql \
--source-db-host CLOUD_SQL_HOST \
--source-db-user ybvoyager \
--source-db-password 'Password#123' \
--source-db-name social
export of schema for source type as 'mysql'
mysql version: 8.0.26-google
exporting TABLE done
exporting PARTITION done
exporting VIEW done
exporting TRIGGER done
exporting FUNCTION done
exporting PROCEDURE done
Exported schema files created under directory: /export-dir/schema
2. Analyze the exported schema:
> yb-voyager analyze-schema --export-dir ~/export-dir --output-format txt
-- find schema analysis report at: /export-dir/reports/report.txt
By analyzing our schema before exporting data, we have the option to make any necessary changes to our DDL statements. The schema analysis report will flag any statements that require manual intervention.
In the case of YugaSocial, Voyager migrated the MySQL schema to PostgreSQL DDL without needing any manual changes.
3. Finally, export the data from Google Cloud SQL:
> yb-voyager export data --export-dir ~/export-dir \
--source-db-type mysql \
--source-db-host CLOUD_SQL_HOST \
--source-db-user ybvoyager \
--source-db-password 'Password#123' \
--source-db-name social
export of data for source type as 'mysql'
Num tables to export: 6
table list for data export: [comments likes posts relationships stories users]
calculating approx num of rows to export for each table...
Initiating data export.
Data export started.
Exported tables:- {comments, likes, posts, relationships, stories, users}
TABLE ROW COUNT
comments 1000
likes 502
posts 1000
relationships 1002
stories 1000
users 1004
Export of data complete ✅
After successfully exporting our schema and data, we’re ready to move our database to YugabyteDB Managed.
1. Import the schema to YugabyteDB Managed:
> yb-voyager import schema --export-dir ~/export-dir \
--target-db-host YUGABYTEDB_MANAGED_HOST \
--target-db-user ybvoyager \
--target-db-password 'password' \
--target-db-name yugabyte \
--target-db-schema social \
--target-ssl-mode require \
--start-clean
schemas to be present in target database "yugabyte": [social]
creating schema 'social' in target database...
table.sql: CREATE TABLE comments ( id bigserial, description varchar(200) NOT NULL, crea ...
table.sql: ALTER SEQUENCE comments_id_seq RESTART WITH 1;
table.sql: ALTER TABLE comments ADD UNIQUE (id);
table.sql: CREATE TABLE likes ( id bigserial, userid bigint NOT NULL, postid bigint NOT ...
table.sql: ALTER SEQUENCE likes_id_seq RESTART WITH 1;
table.sql: ALTER TABLE likes ADD UNIQUE (id);
table.sql: CREATE TABLE posts ( id bigserial, description varchar(200), img varchar(200) ...
...
As you can see from the terminal output, I’ve chosen to import into the public
schema. If you’d like to use a different schema, you can do this using the --target-db-schema
option.
2. Import the data to YugabyteDB Managed:
> yb-voyager import data --export-dir ~/export-dir \
--target-db-host YUGABYTEDB_MANAGED_HOST \
--target-db-user ybvoyager \
--target-db-password 'password' \
--target-db-name yugabyte \
--target-db-schema social \
--target-ssl-mode require \
--start-clean
import of data in "yugabyte" database started
Using 2 parallel jobs by default. Use --parallel-jobs to specify a custom value
skipping already imported tables: []
Preparing to import the tables: [comments likes posts relationships stories users]
All the tables are imported
setting resume value for sequences
YugabyteDB Voyager handles this data import with parallelism, making quick work of it.
3. To wrap things up, import indexes and triggers:
> yb-voyager import schema --export-dir ~/export-dir \
--target-db-host YUGABYTEDB_MANAGED_HOST \
--target-db-user ybvoyager \
--target-db-password ‘password’ \
--target-db-name yugabyte \
--target-db-schema social \
--target-ssl-mode require \
--start-clean \
--post-import-data
INDEXES_table.sql: CREATE INDEX comments_postid ON comments (postid);
INDEXES_table.sql: CREATE INDEX comments_userid ON comments (userid);
INDEXES_table.sql: CREATE INDEX likes_postid ON likes (postid);
...
We no longer need the ybvoyager
user in YugabyteDB Managed. To change ownership of the imported objects to another user in the YugabyteDB Managed Cloud Shell, run:
> REASSIGN OWNED BY ybvoyager TO admin;
> DROP OWNED BY ybvoyager;
> DROP USER ybvoyager;
It’s time to verify that our database was successfully migrated to YugabyteDB Managed, by reconfiguring our YugaSocial application.
Connecting YugaSocial to YugabyteDB Managed in Node.js
As mentioned, YugaSocial was developed to run on MySQL. However, I also added support for PostgreSQL. Since YugabyteDB is PostgreSQL-compatible, we can use the node-postgres
driver for Node.js to connect to our YugabyteDB Managed cluster.
In fact, Yugabyte has developed its own smart drivers, which add load-balancing capabilities to native drivers. This can drastically improve performance by avoiding excessive load on any single cluster node.
After installing Yugabyte’s fork of node-postgres, we’re ready to connect to our database:
// connect.js
...
const { Pool } = require("@yugabytedb/pg");
if (process.env.DB_TYPE === “yugabyte”) {
const pool = new Pool({
user: process.env.DB_USER,
host: process.env.DB_HOST,
password: process.env.DB_PASSWORD,
port: 5433,
database: process.env.DB_NAME,
min: 5,
max: 100,
ssl: {
rejectUnauthorized: false
}
});
}
This configuration is very similar to the MySQL driver. By restarting our application with the proper environment variables for our connection details, we’re able to confirm that our data was migrated successfully:
> DB_TYPE=yugabyte DB_USER=admin DB_HOST=[HOST] DB_HOST=[PASSWORD] node index.js
Our application functions just the same as before. This time I replied to Yana, to let her know that YugaSocial had officially been migrated to YugabyteDB Managed!
Conclusion
As you can see, YugabyteDB Voyager simplifies migration from MySQL to YugabyteDB. I encourage you to give it a try in your next coding adventure, whether you’re migrating from MySQL, or other relational databases, like PostgreSQL or Oracle.
Look out for more articles on distributed SQL and Node.js from me in the near future. Until then, don’t hesitate to reach out and keep on coding!
Opinions expressed by DZone contributors are their own.
Comments