Commit c2bfa2e2 authored by Stan Hu's avatar Stan Hu

Update Geo documentation to use PostgreSQL replication slots

The default settings made it easy for the database to get out of sync
when the secondary disconnected. Using replication slots ensures that
this will not happen, although the primary could run out of space.

[ci skip]

Closes #1844
parent 530166fc
...@@ -31,6 +31,11 @@ connect to `secondary` database servers (which are read-only too). ...@@ -31,6 +31,11 @@ connect to `secondary` database servers (which are read-only too).
In many databases documentation you will see `primary` being references as `master` In many databases documentation you will see `primary` being references as `master`
and `secondary` as either `slave` or `standby` server (read-only). and `secondary` as either `slave` or `standby` server (read-only).
New for GitLab 9.4: We recommend using [PostgreSQL replication
slots](https://medium.com/@tk512/replication-slots-in-postgresql-b4b03d277c75)
to ensure the primary retains all the data necessary for the secondaries to
recover. See below for more details.
### Prerequisites ### Prerequisites
The following guide assumes that: The following guide assumes that:
...@@ -73,6 +78,8 @@ The following guide assumes that: ...@@ -73,6 +78,8 @@ The following guide assumes that:
postgresql['listen_address'] = "1.2.3.4" postgresql['listen_address'] = "1.2.3.4"
postgresql['trust_auth_cidr_addresses'] = ['127.0.0.1/32','1.2.3.4/32'] postgresql['trust_auth_cidr_addresses'] = ['127.0.0.1/32','1.2.3.4/32']
postgresql['md5_auth_cidr_addresses'] = ['5.6.7.8/32'] postgresql['md5_auth_cidr_addresses'] = ['5.6.7.8/32']
# New for 9.4: Set this to be the number of Geo secondary nodes you want
postgresql['max_replication_slots'] = 1
# postgresql['max_wal_senders'] = 10 # postgresql['max_wal_senders'] = 10
# postgresql['wal_keep_segments'] = 10 # postgresql['wal_keep_segments'] = 10
``` ```
...@@ -118,6 +125,7 @@ The following guide assumes that: ...@@ -118,6 +125,7 @@ The following guide assumes that:
postgresql['listen_address'] = "10.1.5.3" postgresql['listen_address'] = "10.1.5.3"
postgresql['trust_auth_cidr_addresses'] = ['127.0.0.1/32','10.1.5.3/32'] postgresql['trust_auth_cidr_addresses'] = ['127.0.0.1/32','10.1.5.3/32']
postgresql['md5_auth_cidr_addresses'] = ['10.1.10.5/32'] postgresql['md5_auth_cidr_addresses'] = ['10.1.10.5/32']
postgresql['max_replication_slots'] = 1
# postgresql['max_wal_senders'] = 10 # postgresql['max_wal_senders'] = 10
# postgresql['wal_keep_segments'] = 10 # postgresql['wal_keep_segments'] = 10
``` ```
...@@ -138,9 +146,38 @@ The following guide assumes that: ...@@ -138,9 +146,38 @@ The following guide assumes that:
1. Check to make sure your firewall rules are set so that the secondary nodes 1. Check to make sure your firewall rules are set so that the secondary nodes
can access port 5432 on the primary node. can access port 5432 on the primary node.
1. Save the file and [reconfigure GitLab][] for the changes to take effect. 1. Save the file and [reconfigure GitLab][] for the changes to take effect.
1. New for 9.4: Restart your primary PostgreSQL server to ensure the replication slot changes
take effect (`sudo gitlab-ctl restart postgresql` for Omnibus-provided PostgreSQL).
1. Now that the PostgreSQL server is set up to accept remote connections, run 1. Now that the PostgreSQL server is set up to accept remote connections, run
`netstat -plnt` to make sure that PostgreSQL is listening to the server's `netstat -plnt` to make sure that PostgreSQL is listening to the server's
public IP. public IP.
1. New for 9.4: Choose a database-friendly name to use for your secondary to use as the
replication slot name. For example, if your domain is
`geo-secondary.mydomain.com`, you may use `geo_secondary_my_domain_com` as
the slot name.
1. New for 9.4: Create a replication slot in PostgreSQL for the secondary. To do this, you must
login to your PostgreSQL instance with database superuser account an issue a SQL
command:
```sql
SELECT * FROM pg_create_physical_replication_slot('geo_secondary_my_domain_com');
```
For example, on an Omnibus-provided PostgreSQL database, it would be done
in the following way:
```
$ sudo -u gitlab-psql /opt/gitlab/embedded/bin/psql -h /var/opt/gitlab/postgresql -d gitlabhq_production
psql (9.6.1)
Type "help" for help.
gitlabhq_production=# SELECT * FROM pg_create_physical_replication_slot('geo_secondary_my_domain');
slot_name | xlog_position
-------------------------+---------------
geo_secondary_my_domain |
(1 row)
```
1. Continue to [set up the secondary server](#step-2-configure-the-secondary-server). 1. Continue to [set up the secondary server](#step-2-configure-the-secondary-server).
### Step 2. Configure the secondary server ### Step 2. Configure the secondary server
...@@ -199,13 +236,22 @@ data before running `pg_basebackup`. ...@@ -199,13 +236,22 @@ data before running `pg_basebackup`.
1. Execute the command below to start a backup/restore and begin the replication: 1. Execute the command below to start a backup/restore and begin the replication:
``` ```
gitlab-ctl replicate-geo-database --host=1.2.3.4 gitlab-ctl replicate-geo-database --host=1.2.3.4 --slot-name=geo-secondary_my_domain_com
``` ```
Change the `--host=` to the primary node IP or FQDN. You can check other possible Change the `--host=` to the primary node IP or FQDN. You can check other possible
parameters with `--help`. When prompted, enter the password you set up for parameters with `--help`. When prompted, enter the password you set up for
the `gitlab_replicator` user in the first step. the `gitlab_replicator` user in the first step.
New for 9.4: Change the `--slot-name` to the name of the replication slot
created on the primary database. Note that this MUST match the name used
earlier. Otherwise, PostgreSQL on the secondary will report an error that
reads something like:
```
FATAL: could not start WAL streaming: ERROR: replication slot "geo_secondary_my_domain_com" does not exist
```
The replication process is now over. The replication process is now over.
### Next steps ### Next steps
......
Markdown is supported
0%
or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment