CNPG Recipe 15 - PostgreSQL major online upgrades with logical replication
Table of Contents
This recipe shows how to perform an online major PostgreSQL upgrade using the
new declarative approach to logical replication introduced in CloudNativePG
1.25. By leveraging the Publication
and Subscription
CRDs, users can set up
logical replication between PostgreSQL clusters with ease. I will walk you
through configuring a PostgreSQL 15 publisher, importing schemas into a
PostgreSQL 17 subscriber, and verifying data synchronisation, with the broader
goal of highlighting the benefits of a repeatable and testable upgrade process.
NOTE: this article has been updated on December 23rd, 2024 with the most recent
version of cloudnative-pg
.
Upgrading PostgreSQL to a major version has historically been one of the most challenging tasks for users. To minimise downtime during the upgrade process, two primary approaches are commonly used:
Offline In-Place Upgrade Using
pg_upgrade
to ensure that the data files from an older version are converted to the newer version.Online Remote Upgrade Leveraging native logical replication for minimal downtime. For a detailed guide, check out “CNPG #5 - How to Migrate Your PostgreSQL Database in Kubernetes with ~0 Downtime from Anywhere”, as it covers concepts essential for fully understanding this article.
Although CloudNativePG does not currently support pg_upgrade
(
planned for version 1.26),
version 1.25 introduces a significant new feature:
declarative support for logical publications and subscriptions.
This article demonstrates how to perform an online major PostgreSQL upgrade on
your laptop using kind
(Kubernetes in Docker). I’ll use the Publication
and
Subscription
CRDs introduced in the version 1.25.
Use Case: Online Major PostgreSQL Upgrade #
Imagine you have a CloudNativePG-managed PostgreSQL 15 database running in your Kubernetes cluster. Your goal is to upgrade it to PostgreSQL 17 while meeting the following requirements:
- Repeatability: The process should be repeatable to ensure reliability.
- Testability: Run a test migration to provide ample time for developers and testers to validate the upgraded database. This also allows you to measure the time required for data transfer, and practice the cutover procedure.
- Production-Readiness: Ensure a fresh migration is performed from scratch before switching to the new database in production. This approach minimises downtime for the application, reducing it to near-zero levels.
Installing CloudNativePG #
Before proceeding, ensure you’ve completed the steps in
“CloudNativePG Recipe 1 - Setting Up Your Local Playground in Minutes”.
For this example, you’ll need version 1.25 to use the declarative Publication
and Subscription
CRDs.
To install version 1.25.0, run the following command:
kubectl apply --server-side -f \
https://raw.githubusercontent.com/cloudnative-pg/cloudnative-pg/release-1.25/releases/cnpg-1.25.0.yaml
Make sure you also have the latest version of the cnpg
plugin for kubectl
,
as described in CloudNativePG Recipe #1.
Setting Up the Origin Database #
Let’s start by defining a PostgreSQL 15 cluster with the following configuration:
apiVersion: postgresql.cnpg.io/v1
kind: Cluster
metadata:
name: pg-15
spec:
instances: 3
storage:
size: 1Gi
Populating the Database with Sample Data #
We’ll use pgbench
to generate and insert sample data into the database. Run
the following command:
kubectl cnpg pgbench \
--db-name app \
--job-name pgbench-init \
pg-15 \
-- --initialize
After running the command, inspect the job logs to confirm the operation:
kubectl logs jobs/pgbench-init
You should see output similar to the following:
dropping old tables...
NOTICE: table "pgbench_accounts" does not exist, skipping
NOTICE: table "pgbench_branches" does not exist, skipping
NOTICE: table "pgbench_history" does not exist, skipping
NOTICE: table "pgbench_tellers" does not exist, skipping
creating tables...
generating data (client-side)...
100000 of 100000 tuples (100%) of pgbench_accounts done (elapsed 0.01 s, remaining 0.00 s)
vacuuming...
creating primary keys...
done in 0.16 s (drop tables 0.00 s, create tables 0.01 s, client-side generate 0.07 s, vacuum 0.04 s, primary keys 0.04 s).
At this point, the app
database should contain the pgbench
tables and be
approximately 22 MB in size. You can verify this by running:
kubectl cnpg psql pg-15 -- app -c '\dt+' -c '\l+'
Setting Up the Destination Database #
Next, you’ll define the PostgreSQL 17 cluster and configure it to import the
schema from its source, pg-15
, which serves as the “publisher” in logical
replication terms:
apiVersion: postgresql.cnpg.io/v1
kind: Cluster
metadata:
name: pg-17
spec:
instances: 3
storage:
size: 1Gi
# Use the microservice import for the schema
bootstrap:
initdb:
import:
type: microservice
schemaOnly: true
databases:
- app
source:
externalCluster: pg-15
# Define the "publisher"
externalClusters:
- name: pg-15
connectionParameters:
host: pg-15-rw.default.svc
user: app
dbname: app
password:
name: pg-15-app
key: password
In the example above, the externalClusters
section specifies how to connect
to the publisher using the app
user.
Once the manifest is applied, the destination server will have the schema imported but without any data. You can verify this by running:
kubectl cnpg psql pg-17 -- app -c '\dt+'
You should get a similar output:
List of relations
Schema | Name | Type | Owner | Persistence | Access method | Size | Description
--------+------------------+-------+-------+-------------+---------------+---------+-------------
public | pgbench_accounts | table | app | permanent | heap | 0 bytes |
public | pgbench_branches | table | app | permanent | heap | 0 bytes |
public | pgbench_history | table | app | permanent | heap | 0 bytes |
public | pgbench_tellers | table | app | permanent | heap | 0 bytes |
(4 rows)
Setting Up the Publisher #
The first step in configuring logical replication is enabling a role to be used
in a logical publication in the app
database. For this example, we’ll use the
app
user for simplicity. Below, we assign the replication
privilege to the
app
user:
apiVersion: postgresql.cnpg.io/v1
kind: Cluster
metadata:
name: pg-15
spec:
instances: 3
storage:
size: 1Gi
managed:
roles:
- name: app
login: true
replication: true
We then create the Publication
resource that will be then used to replicate
all changes to all tables in the app
database.
apiVersion: postgresql.cnpg.io/v1
kind: Publication
metadata:
name: pg-15-publisher
spec:
cluster:
name: pg-15
dbname: app
name: publisher
target:
allTables: true
publicationReclaimPolicy: delete
Apply the manifest and check the app
user:
kubectl cnpg psql pg-15 -- app -c '\du app'
List of roles
Role name | Attributes
-----------+-------------
app | Replication
Setting Up the Subscriber #
With the pg-17
database schema imported, the next step is to configure the
subscription to the “publisher” on pg-15
. This establishes logical
replication between the two clusters:
apiVersion: postgresql.cnpg.io/v1
kind: Subscription
metadata:
name: pg-17-subscription
spec:
cluster:
name: pg-17
dbname: app
name: subscriber
externalClusterName: pg-15
publicationName: publisher
subscriptionReclaimPolicy: delete
Apply the manifest to initiate the subscription, then check the logs of both clusters. You should see entries indicating that logical replication has started, such as:
logical replication apply worker for subscription "subscriber" has started
or:
starting logical decoding for slot "subscriber"
Finally, verify that the pgbench_accounts
table on the destination database
contains 10,000 records by running the following command:
kubectl cnpg psql pg-17 -- app -qAt -c 'SELECT count(*) FROM pgbench_accounts'
Once you have migrated, you can delete the subscription and publication objects.
Conclusions #
This article introduced the basics of setting up logical replication using the
new declarative approach in CloudNativePG 1.25. While the focus was on core
concepts, there are additional aspects to consider, such as synchronising
sequences—which are not part of the pgbench
database. Detailed guidance on
this topic is available in
CNPG Recipe #5
through the use of plugins— this task is still imperative or delegated to the
application.
For cutover, transitioning your applications to the new pg-17-rw
service will
finalise the process. If your publisher resides outside the control of
CloudNativePG, you can still benefit from the declarative subscriptions.
The primary goal of this article was to showcase the streamlined and repeatable process for managing PostgreSQL native logical replication through the new declarative method for defining publications and subscriptions. This approach significantly enhances the manageability and efficiency of logical replication in Kubernetes environments.
Stay tuned for the upcoming recipes! For the latest updates, consider subscribing to my LinkedIn and Twitter channels.
If you found this article informative, feel free to share it within your network on social media using the provided links below. Your support is immensely appreciated!
Cover Picture: “Elephant Family with baby“.