暂无图片
暂无图片
暂无图片
暂无图片
暂无图片

磐维数据库gs_probackup的PITR恢复演练

杨金福 2025-03-21
60

背景信息

当数据库崩溃或希望回退到数据库之前的某一状态时,磐维数据库的即时恢复功能(Point-In-Time Recovery,简称PITR)可以支持恢复到备份归档数据之后的任意时间

说明:

PITR仅支持恢复到物理备份数据之后的某一时间点。

仅主节点可以进行PITR恢复,备机需要进行全量build达成与主机数据同步。

恢复前建议对已归档的XLOG进行备份。因为若恢复后不满意当前的恢复状态,想重新进行恢复,但此时已归档的XLOG可能会被新归档的XLOG所覆盖,从而可能导致恢复失败。

前提条件

基于经过物理备份的全量数据文件。

基于已归档的WAL日志文件。

PITR恢复流程

将物理备份的文件替换目标数据库目录。

删除数据库目录下pg_xlog/中的所有文件。

将归档的WAL日志文件复制到pg_xlog文件中(此步骤可以省略,通过配置recovery.conf恢复命令文件中的restore_command项替代)。

在数据库目录下创建恢复命令文件recovery.conf,指定数据库恢复的程度。

启动数据库。

连接数据库,查看是否恢复到希望预期的状态。

若已经恢复到预期状态,通过pg_xlog_replay_resume()指令使主节点对外提供服务。

演练步骤

1.环境准备

注:本次演练基于PanWeiDB_V2.0单节点

1.1 检查数据库已开归档和归档路径

[omm@pwnode1 ~]$ gsql -r

gsql ((PanWeiDB_V2.0-S2.0.3_B01) compiled at 2024-05-25 22:36:36 commit e857234 )

Non-SSL connection (SSL connection is recommended when requiring high-security)

Type “help” for help.

postgres=# show archive_mode;

archive_mode

--------------

on

(1 row)

postgres=# show archive_dest;

archive_dest

--------------

/archive

(1 row)

postgres=#

1.2 检查数据库gs_probackup备份配置

[omm@pwnode1 ~]$ gs_probackup show -B /backup/probackup --instance=panweidb

=======================================================================================================================

Instance Version ID Recovery Time Mode WAL Mode TLI Time Data WAL Zratio Start LSN Stop LSN Type Status

=======================================================================================================================

[omm@pwnode1 ~]$

1.3 备份数据库

[omm@pwnode1 ~]$ gs_probackup backup --backup-path=/backup/probackup --instance=panweidb --backup-mode=full

INFO: Backup start, pw_probackup version: 2.4.2, instance: panweidb, backup ID: STGW7H, backup mode: FULL, wal mode: STREAM, remote: false, compress-algorithm: none, compress-level: 1

LOG: Backup destination is initialized

LOG: This PanWeiDB instance was initialized with data block checksums. Data block corruption will be detected

LOG: Database backup start

LOG: started streaming WAL at 0/6000000 (timeline 1)

[2025-03-21 17:40:30]: check identify system success

[2025-03-21 17:40:30]: send START_REPLICATION 0/6000000 success

[2025-03-21 17:40:30]: keepalive message is received

INFO: PGDATA size: 703MB

INFO: Start transferring data files

LOG: Creating page header map “/backup/probackup/backups/panweidb/STGW7H/page_header_map”

[2025-03-21 17:40:30]: keepalive message is received

[2025-03-21 17:40:35]: keepalive message is received

INFO: Data files are transferred, time elapsed: 6s

INFO: wait for pg_stop_backup()

INFO: pg_stop backup() successfully executed

LOG: stop_lsn: 0/6000270

LOG: Looking for LSN 0/6000270 in segment: 000000010000000000000006

LOG: Found WAL segment: /backup/probackup/backups/panweidb/STGW7H/database/pg_xlog/000000010000000000000006

LOG: Thread [0]: Opening WAL segment “/backup/probackup/backups/panweidb/STGW7H/database/pg_xlog/000000010000000000000006”

LOG: Found LSN: 0/6000270

[2025-03-21 17:40:41]:(null): not renaming “/backup/probackup/backups/panweidb/STGW7H/database/pg_xlog/000000010000000000000007”, segment is not complete.

LOG: finished streaming WAL at 0/70000B0 (timeline 1)

LOG: Getting the Recovery Time from WAL

LOG: Thread [0]: Opening WAL segment “/backup/probackup/backups/panweidb/STGW7H/database/pg_xlog/000000010000000000000006”

INFO: Syncing backup files to disk

INFO: Backup files are synced, time elapsed: 0

INFO: Validating backup STGW7H

INFO: Backup STGW7H data files are valid

INFO: Backup STGW7H resident size: 735MB

INFO: Backup STGW7H completed

[omm@pwnode1 ~]$

[omm@pwnode1 ~]$ gs_probackup show -B /backup/probackup --instance=panweidb

=======================================================================================================================================

Instance Version ID Recovery Time Mode WAL Mode TLI Time Data WAL Zratio Start LSN Stop LSN Type Status

=======================================================================================================================================

panweidb 9.2 STGW7H 2025-03-21 17:40:36+08 FULL STREAM 1/0 12s 719MB 16MB 0.98 0/6000028 0/6000270 FILE OK

[omm@pwnode1 ~]$

1.4 创建测试数据

插入数据并记录时间点

hrdb=> create table pitr_test(id int,t timestamp);

CREATE TABLE

hrdb=> insert into pitr_test select 1,now;

INSERT 0 1

hrdb=> select * from pitr_test;

id | t

----±--------------------------

1 | 2025-03-21 17:50:47.20499

(1 row)

hrdb=> select now;

now

-------------------------------

2025-03-21 17:51:50.906044+08

(1 row)

hrdb=>

切换wal日志

hrdb=> hrdb=> select pg_xlogfile_name(pg_switch_xlog());

pg_xlogfile_name

--------------------------

000000010000000000000007

(1 row)

插入数据并创建还原点

hrdb=> insert into pitr_test select 2,now;

INSERT 0 1

hrdb=> select * from pitr_test;

id | t

----±---------------------------

1 | 2025-03-21 17:50:47.20499

2 | 2025-03-21 17:56:56.838657

(2 rows)

hrdb=> select pg_create_restore_point(‘point1’);

pg_create_restore_point

-------------------------

0/80008C8

(1 row)

hrdb=>

切换wal日志

hrdb=> select pg_xlogfile_name(pg_switch_xlog());

pg_xlogfile_name

--------------------------

000000010000000000000008

(1 row)

hrdb=>

插入数据并记录事务ID

hrdb=> insert into pitr_test select 3,now;

INSERT 0 1

hrdb=> select * from pitr_test;

id | t

----±---------------------------

1 | 2025-03-21 17:50:47.20499

2 | 2025-03-21 17:56:56.838657

3 | 2025-03-21 18:02:23.628427

(3 rows)

hrdb=> select txid_current(),txid_current_snapshot();

txid_current | txid_current_snapshot

--------------±----------------------

29321 | 29321:29321:

(1 row)

hrdb=>

切换wal日志

hrdb=> select pg_xlogfile_name(pg_switch_xlog());

pg_xlogfile_name

--------------------------

000000010000000000000009

(1 row)

hrdb=>

插入数据并记录lsn

hrdb=> insert into pitr_test select 4,now;

INSERT 0 1

hrdb=> select * from pitr_test;

id | t

----±---------------------------

1 | 2025-03-21 17:50:47.20499

2 | 2025-03-21 17:56:56.838657

3 | 2025-03-21 18:02:23.628427

4 | 2025-03-21 18:05:19.635836

(4 rows)

hrdb=> select pg_current_xlog_location();

pg_current_xlog_location

--------------------------

0/A0003D8

(1 row)

hrdb=>

切换wal日志

hrdb=> select pg_xlogfile_name(pg_switch_xlog());

pg_xlogfile_name

--------------------------

00000001000000000000000A

(1 row)

hrdb=>

2.PITR恢复

2.1 备份wal归档日志

[root@pwnode1 /]# ls -ltr /archive

total 114692

-rw------- 1 omm dbgrp 16777216 Mar 21 17:40 000000010000000000000005

-rw------- 1 omm dbgrp 16777216 Mar 21 17:40 000000010000000000000006

-rw------- 1 omm dbgrp 324 Mar 21 17:40 000000010000000000000006.00000028.backup

-rw------- 1 omm dbgrp 16777216 Mar 21 17:53 000000010000000000000007

-rw------- 1 omm dbgrp 16777216 Mar 21 17:59 000000010000000000000008

-rw------- 1 omm dbgrp 16777216 Mar 21 18:04 000000010000000000000009

-rw------- 1 omm dbgrp 16777216 Mar 21 18:06 00000001000000000000000A

-rw------- 1 omm dbgrp 16777216 Mar 21 20:26 00000001000000000000000B

[root@pwnode1 /]# cp -a /archive /archive_bak

[root@pwnode1 /]#

说明:

恢复前建议对已归档的XLOG进行备份。因为若恢复后不满意当前的恢复状态,想重新进行恢复,但此时已归档的XLOG可能会被新归档的XLOG所覆盖,从而可能导致恢复失败。

2.2基于时间点恢复

2.2.1 停止数据库并删除data目录

[omm@pwnode1 ~]$ gs_om -t stop

Stopping cluster.

=========================================

Successfully stopped cluster.

=========================================

End stop cluster.

[omm@pwnode1 ~]$ rm -rf /database/panweidb/data

[omm@pwnode1 ~]$

2.2.2 查看备份并恢复数据库

[omm@pwnode1 ~]$ gs_probackup show --backup-path=/backup/probackup --instance=panweidb

=======================================================================================================================================

Instance Version ID Recovery Time Mode WAL Mode TLI Time Data WAL Zratio Start LSN Stop LSN Type Status

=======================================================================================================================================

panweidb 9.2 STGW7H 2025-03-21 17:40:36+08 FULL STREAM 1/0 12s 719MB 16MB 0.98 0/6000028 0/6000270 FILE OK

[omm@pwnode1 ~]$ gs_probackup restore -B /backup/probackup --instance=panweidb -i STGW7H -D /database/panweidb/data

LOG: Restore begin.

LOG: there is no file tablespace_map

LOG: check tablespace directories of backup STGW7H

LOG: check external directories of backup STGW7H

INFO: Validating backup STGW7H

INFO: Backup STGW7H data files are valid

LOG: Thread [1]: Opening WAL segment “/backup/probackup/backups/panweidb/STGW7H/database/pg_xlog/000000010000000000000006”

INFO: Backup STGW7H WAL segments are valid

INFO: Backup STGW7H is valid.

INFO: Restoring the database from backup at 2025-03-21 17:40:29+08

LOG: there is no file tablespace_map

LOG: Restore directories and symlinks… in /database/panweidb/data

INFO: Start restoring backup files. DATA size: 719MB

LOG: Start thread 1

INFO: Backup files are restored. Transfered bytes: 735MB, time elapsed: 0

INFO: Restore incremental ratio (less is better): 102% (735MB/719MB)

INFO: Syncing restored files to disk

INFO: Restored backup files are synced, time elapsed: 0

The pw_guc run with the following arguments: [gs_guc -D /database/panweidb/data -c pw_wal_directory=‘pg_xlog’ set ].

expected instance path: [/database/panweidb/data/postgresql.conf]

gs_guc set: pw_wal_directory=‘pg_xlog’: [/database/panweidb/data/postgresql.conf]

Total instances: 1. Failed instances: 0.

Success to perform gs_guc!

INFO: Restore of backup STGW7H completed.

[omm@pwnode1 ~]$

2.2.3 创建归档恢复配置文件,输入以下内容

[omm@pwnode1 ~]$ vi /database/panweidb/data/recovery.conf

[omm@pwnode1 ~]$ more /database/panweidb/data/recovery.conf

restore_command=‘cp /archive/%f %p’

recovery_target_time=‘2025-03-21 17:51:50.906044+08’

recovery_target_inclusive =true

[omm@pwnode1 ~]$

2.2.4 启动数据库并检查数据恢复情况

[omm@pwnode1 ~]$ gs_om -t start

Starting cluster.

=========================================

[SUCCESS] pwnode1

2025-03-21 21:36:25.178 [unknown] [unknown] localhost 139783888867904 0[0:0#0] 0 [BACKEND] WARNING: Error happen when load license, error code 2, error message cannot write data to dir /etc/panweidb/license .

2025-03-21 21:36:25.182 [unknown] [unknown] localhost 139783888867904 0[0:0#0] 0 [BACKEND] WARNING: could not create any HA TCP/IP sockets

2025-03-21 21:36:25.182 [unknown] [unknown] localhost 139783888867904 0[0:0#0] 0 [BACKEND] WARNING: could not create any HA TCP/IP sockets

2025-03-21 21:36:25.478 [unknown] [unknown] localhost 139783888867904 0[0:0#0] 0 [BACKEND] WARNING: Cgroup get_cgroup Gaussdb:omm information: Cgroup does not exist(50002)

2025-03-21 21:36:25.478 [unknown] [unknown] localhost 139783888867904 0[0:0#0] 0 [BACKEND] WARNING: Cgroup get_cgroup Gaussdb:omm/Class/DefaultClass information: Cgroup does not exist(50002)

=========================================

Successfully started.

[omm@pwnode1 ~]$ gsql -U hr -d hrdb -r

Password for user hr:

gsql ((PanWeiDB_V2.0-S2.0.3_B01) compiled at 2024-05-25 22:36:36 commit e857234 )

Non-SSL connection (SSL connection is recommended when requiring high-security)

Type “help” for help.

hrdb=> select * from pitr_test ;

id | t

----±--------------------------

1 | 2025-03-21 17:50:47.20499

(1 row)

hrdb=>

2.3基于还原点恢复

2.3.1 停止数据库并删除data目录

[omm@pwnode1 ~]$ gs_om -t stop

Stopping cluster.

=========================================

Successfully stopped cluster.

=========================================

End stop cluster.

[omm@pwnode1 ~]$ rm -rf /database/panweidb/data

[omm@pwnode1 ~]$

2.3.2 查看备份并恢复数据库

[omm@pwnode1 ~]$ gs_probackup show --backup-path=/backup/probackup --instance=panweidb

=======================================================================================================================================

Instance Version ID Recovery Time Mode WAL Mode TLI Time Data WAL Zratio Start LSN Stop LSN Type Status

=======================================================================================================================================

panweidb 9.2 STGW7H 2025-03-21 17:40:36+08 FULL STREAM 1/0 12s 719MB 16MB 0.98 0/6000028 0/6000270 FILE OK

[omm@pwnode1 ~]$ gs_probackup restore -B /backup/probackup --instance=panweidb -i STGW7H -D /database/panweidb/data

LOG: Restore begin.

LOG: there is no file tablespace_map

LOG: check tablespace directories of backup STGW7H

LOG: check external directories of backup STGW7H

INFO: Validating backup STGW7H

INFO: Backup STGW7H data files are valid

LOG: Thread [1]: Opening WAL segment “/backup/probackup/backups/panweidb/STGW7H/database/pg_xlog/000000010000000000000006”

INFO: Backup STGW7H WAL segments are valid

INFO: Backup STGW7H is valid.

INFO: Restoring the database from backup at 2025-03-21 17:40:29+08

LOG: there is no file tablespace_map

LOG: Restore directories and symlinks… in /database/panweidb/data

INFO: Start restoring backup files. DATA size: 719MB

LOG: Start thread 1

INFO: Backup files are restored. Transfered bytes: 735MB, time elapsed: 0

INFO: Restore incremental ratio (less is better): 102% (735MB/719MB)

INFO: Syncing restored files to disk

INFO: Restored backup files are synced, time elapsed: 0

The pw_guc run with the following arguments: [gs_guc -D /database/panweidb/data -c pw_wal_directory=‘pg_xlog’ set ].

expected instance path: [/database/panweidb/data/postgresql.conf]

gs_guc set: pw_wal_directory=‘pg_xlog’: [/database/panweidb/data/postgresql.conf]

Total instances: 1. Failed instances: 0.

Success to perform gs_guc!

INFO: Restore of backup STGW7H completed.

[omm@pwnode1 ~]$

2.3.3 创建归档恢复配置文件,输入以下内容

[omm@pwnode1 ~]$ vi /database/panweidb/data/recovery.conf

[omm@pwnode1 ~]$ more /database/panweidb/data/recovery.conf

restore_command=‘cp /archive/%f %p’

recovery_target_name=‘point1’

recovery_target_inclusive =true

[omm@pwnode1 ~]$

2.3.4 启动数据库并检查数据恢复情况

[omm@pwnode1 ~]$ gs_om -t start

Starting cluster.

=========================================

[SUCCESS] pwnode1

2025-03-21 21:48:22.760 [unknown] [unknown] localhost 139791161537088 0[0:0#0] 0 [BACKEND] WARNING: Error happen when load license, error code 2, error message cannot write data to dir /etc/panweidb/license .

2025-03-21 21:48:22.765 [unknown] [unknown] localhost 139791161537088 0[0:0#0] 0 [BACKEND] WARNING: could not create any HA TCP/IP sockets

2025-03-21 21:48:22.765 [unknown] [unknown] localhost 139791161537088 0[0:0#0] 0 [BACKEND] WARNING: could not create any HA TCP/IP sockets

2025-03-21 21:48:23.000 [unknown] [unknown] localhost 139791161537088 0[0:0#0] 0 [BACKEND] WARNING: Cgroup get_cgroup Gaussdb:omm information: Cgroup does not exist(50002)

2025-03-21 21:48:23.000 [unknown] [unknown] localhost 139791161537088 0[0:0#0] 0 [BACKEND] WARNING: Cgroup get_cgroup Gaussdb:omm/Class/DefaultClass information: Cgroup does not exist(50002)

=========================================

Successfully started.

[omm@pwnode1 ~]$ gsql -U hr -d hrdb -r

Password for user hr:

gsql ((PanWeiDB_V2.0-S2.0.3_B01) compiled at 2024-05-25 22:36:36 commit e857234 )

Non-SSL connection (SSL connection is recommended when requiring high-security)

Type “help” for help.

hrdb=> select * from pitr_test ;

id | t

----±---------------------------

1 | 2025-03-21 17:50:47.20499

2 | 2025-03-21 17:56:56.838657

(2 rows)

hrdb=>

2.4基于事务xid恢复

2.4.1 停止数据库并删除data目录

[omm@pwnode1 ~]$ gs_om -t stop

Stopping cluster.

=========================================

Successfully stopped cluster.

=========================================

End stop cluster.

[omm@pwnode1 ~]$ rm -rf /database/panweidb/data

[omm@pwnode1 ~]$

2.4.2 查看备份并恢复数据库

[omm@pwnode1 ~]$ gs_probackup show --backup-path=/backup/probackup --instance=panweidb

=======================================================================================================================================

Instance Version ID Recovery Time Mode WAL Mode TLI Time Data WAL Zratio Start LSN Stop LSN Type Status

=======================================================================================================================================

panweidb 9.2 STGW7H 2025-03-21 17:40:36+08 FULL STREAM 1/0 12s 719MB 16MB 0.98 0/6000028 0/6000270 FILE OK

[omm@pwnode1 ~]$ gs_probackup restore -B /backup/probackup --instance=panweidb -i STGW7H -D /database/panweidb/data

LOG: Restore begin.

LOG: there is no file tablespace_map

LOG: check tablespace directories of backup STGW7H

LOG: check external directories of backup STGW7H

INFO: Validating backup STGW7H

INFO: Backup STGW7H data files are valid

LOG: Thread [1]: Opening WAL segment “/backup/probackup/backups/panweidb/STGW7H/database/pg_xlog/000000010000000000000006”

INFO: Backup STGW7H WAL segments are valid

INFO: Backup STGW7H is valid.

INFO: Restoring the database from backup at 2025-03-21 17:40:29+08

LOG: there is no file tablespace_map

LOG: Restore directories and symlinks… in /database/panweidb/data

INFO: Start restoring backup files. DATA size: 719MB

LOG: Start thread 1

INFO: Backup files are restored. Transfered bytes: 735MB, time elapsed: 0

INFO: Restore incremental ratio (less is better): 102% (735MB/719MB)

INFO: Syncing restored files to disk

INFO: Restored backup files are synced, time elapsed: 0

The pw_guc run with the following arguments: [gs_guc -D /database/panweidb/data -c pw_wal_directory=‘pg_xlog’ set ].

expected instance path: [/database/panweidb/data/postgresql.conf]

gs_guc set: pw_wal_directory=‘pg_xlog’: [/database/panweidb/data/postgresql.conf]

Total instances: 1. Failed instances: 0.

Success to perform gs_guc!

INFO: Restore of backup STGW7H completed.

[omm@pwnode1 ~]$

2.4.3 创建归档恢复配置文件,输入以下内容

[omm@pwnode1 ~]$ vi /database/panweidb/data/recovery.conf

[omm@pwnode1 ~]$ more /database/panweidb/data/recovery.conf

restore_command=‘cp /archive/%f %p’

recovery_target_xid=‘29321’

recovery_target_inclusive =true

[omm@pwnode1 ~]$

2.4.4 启动数据库并检查数据恢复情况

[omm@pwnode1 ~]$ gs_om -t start

Starting cluster.

=========================================

[SUCCESS] pwnode1

2025-03-21 21:56:04.172 [unknown] [unknown] localhost 140198626169408 0[0:0#0] 0 [BACKEND] WARNING: Error happen when load license, error code 2, error message cannot write data to dir /etc/panweidb/license .

2025-03-21 21:56:04.176 [unknown] [unknown] localhost 140198626169408 0[0:0#0] 0 [BACKEND] WARNING: could not create any HA TCP/IP sockets

2025-03-21 21:56:04.176 [unknown] [unknown] localhost 140198626169408 0[0:0#0] 0 [BACKEND] WARNING: could not create any HA TCP/IP sockets

2025-03-21 21:56:04.412 [unknown] [unknown] localhost 140198626169408 0[0:0#0] 0 [BACKEND] WARNING: Cgroup get_cgroup Gaussdb:omm information: Cgroup does not exist(50002)

2025-03-21 21:56:04.412 [unknown] [unknown] localhost 140198626169408 0[0:0#0] 0 [BACKEND] WARNING: Cgroup get_cgroup Gaussdb:omm/Class/DefaultClass information: Cgroup does not exist(50002)

=========================================

Successfully started.

[omm@pwnode1 ~]$ gsql -U hr -d hrdb -r

Password for user hr:

gsql ((PanWeiDB_V2.0-S2.0.3_B01) compiled at 2024-05-25 22:36:36 commit e857234 )

Non-SSL connection (SSL connection is recommended when requiring high-security)

Type “help” for help.

hrdb=> select * from pitr_test ;

id | t

----±---------------------------

1 | 2025-03-21 17:50:47.20499

2 | 2025-03-21 17:56:56.838657

3 | 2025-03-21 18:02:23.628427

(3 rows)

hrdb=>

2.5基于lsn恢复

2.5.1 停止数据库并删除data目录

[omm@pwnode1 ~]$ gs_om -t stop

Stopping cluster.

=========================================

Successfully stopped cluster.

=========================================

End stop cluster.

[omm@pwnode1 ~]$ rm -rf /database/panweidb/data

[omm@pwnode1 ~]$

2.5.2 查看备份并恢复数据库

[omm@pwnode1 ~]$ gs_probackup show --backup-path=/backup/probackup --instance=panweidb

=======================================================================================================================================

Instance Version ID Recovery Time Mode WAL Mode TLI Time Data WAL Zratio Start LSN Stop LSN Type Status

=======================================================================================================================================

panweidb 9.2 STGW7H 2025-03-21 17:40:36+08 FULL STREAM 1/0 12s 719MB 16MB 0.98 0/6000028 0/6000270 FILE OK

[omm@pwnode1 ~]$ gs_probackup restore -B /backup/probackup --instance=panweidb -i STGW7H -D /database/panweidb/data

LOG: Restore begin.

LOG: there is no file tablespace_map

LOG: check tablespace directories of backup STGW7H

LOG: check external directories of backup STGW7H

INFO: Validating backup STGW7H

INFO: Backup STGW7H data files are valid

LOG: Thread [1]: Opening WAL segment “/backup/probackup/backups/panweidb/STGW7H/database/pg_xlog/000000010000000000000006”

INFO: Backup STGW7H WAL segments are valid

INFO: Backup STGW7H is valid.

INFO: Restoring the database from backup at 2025-03-21 17:40:29+08

LOG: there is no file tablespace_map

LOG: Restore directories and symlinks… in /database/panweidb/data

INFO: Start restoring backup files. DATA size: 719MB

LOG: Start thread 1

INFO: Backup files are restored. Transfered bytes: 735MB, time elapsed: 1s

INFO: Restore incremental ratio (less is better): 102% (735MB/719MB)

INFO: Syncing restored files to disk

INFO: Restored backup files are synced, time elapsed: 0

The pw_guc run with the following arguments: [gs_guc -D /database/panweidb/data -c pw_wal_directory=‘pg_xlog’ set ].

expected instance path: [/database/panweidb/data/postgresql.conf]

gs_guc set: pw_wal_directory=‘pg_xlog’: [/database/panweidb/data/postgresql.conf]

Total instances: 1. Failed instances: 0.

Success to perform gs_guc!

INFO: Restore of backup STGW7H completed.

[omm@pwnode1 ~]$

2.5.3 创建归档恢复配置文件,输入以下内容

[omm@pwnode1 ~]$ vi /database/panweidb/data/recovery.conf

[omm@pwnode1 ~]$ more /database/panweidb/data/recovery.conf

restore_command=‘cp /archive/%f %p’

recovery_target_lsn=‘0/A0003D8’

recovery_target_inclusive =true

[omm@pwnode1 ~]$

2.5.4 启动数据库并检查数据恢复情况

[omm@pwnode1 ~]$ gs_om -t start

Starting cluster.

=========================================

[SUCCESS] pwnode1

2025-03-21 22:03:09.076 [unknown] [unknown] localhost 140225485603392 0[0:0#0] 0 [BACKEND] WARNING: Error happen when load license, error code 2, error message cannot write data to dir /etc/panweidb/license .

2025-03-21 22:03:09.079 [unknown] [unknown] localhost 140225485603392 0[0:0#0] 0 [BACKEND] WARNING: could not create any HA TCP/IP sockets

2025-03-21 22:03:09.080 [unknown] [unknown] localhost 140225485603392 0[0:0#0] 0 [BACKEND] WARNING: could not create any HA TCP/IP sockets

2025-03-21 22:03:09.316 [unknown] [unknown] localhost 140225485603392 0[0:0#0] 0 [BACKEND] WARNING: Cgroup get_cgroup Gaussdb:omm information: Cgroup does not exist(50002)

2025-03-21 22:03:09.316 [unknown] [unknown] localhost 140225485603392 0[0:0#0] 0 [BACKEND] WARNING: Cgroup get_cgroup Gaussdb:omm/Class/DefaultClass information: Cgroup does not exist(50002)

=========================================

Successfully started.

[omm@pwnode1 ~]$ gsql -U hr -d hrdb -r

Password for user hr:

gsql ((PanWeiDB_V2.0-S2.0.3_B01) compiled at 2024-05-25 22:36:36 commit e857234 )

Non-SSL connection (SSL connection is recommended when requiring high-security)

Type “help” for help.

hrdb=> select * from pitr_test ;

id | t

----±---------------------------

1 | 2025-03-21 17:50:47.20499

2 | 2025-03-21 17:56:56.838657

3 | 2025-03-21 18:02:23.628427

4 | 2025-03-21 18:05:19.635836

(4 rows)

hrdb=>

2.6基于时间点的,以小步长重放xlog的恢复

在典型的设置中,找到一个合理的终止恢复的点的最好的方法是使用pause_at_recovery_target。如果它被设置为ture,达到恢复点时,PostgreSQL将不会自动转变为生产实例。相反,他将等待数据库管理员的指令。如果您不知道要重放xlog到什么位置,这是非常有用的。您可以重放,登录,看看到数据库的哪个位置,更改为下一个目标时间,并不断以小步长来重放。

假设"2025-03-21 18:03:00.628427"为我们所需重放的位置,下面将演练重放至“2025-03-21 17:51:50.906044+08”,“2025-03-21 18:02:20.628427+08”,““2025-03-21 18:03:00.628427+08”,多次重放xlog的恢复

2.6.1 停止数据库并删除data目录

[omm@pwnode1 ~]$ gs_om -t stop

Stopping cluster.

=========================================

Successfully stopped cluster.

=========================================

End stop cluster.

[omm@pwnode1 ~]$ rm -rf /database/panweidb/data

[omm@pwnode1 ~]$

2.6.2 查看备份并恢复数据库

[omm@pwnode1 data]$ gs_om -t stop

Stopping cluster.

=========================================

Successfully stopped cluster.

=========================================

End stop cluster.

[omm@pwnode1 data]$ rm -rf /database/panweidb/data

[omm@pwnode1 data]$

2.6.3 查看备份并恢复数据库

[omm@pwnode1 ~]$ gs_probackup show --backup-path=/backup/probackup --instance=panweidb

=======================================================================================================================================

Instance Version ID Recovery Time Mode WAL Mode TLI Time Data WAL Zratio Start LSN Stop LSN Type Status

=======================================================================================================================================

panweidb 9.2 STGW7H 2025-03-21 17:40:36+08 FULL STREAM 1/0 12s 719MB 16MB 0.98 0/6000028 0/6000270 FILE OK

[omm@pwnode1 ~]$ gs_probackup restore -B /backup/probackup --instance=panweidb -i STGW7H -D /database/panweidb/data

LOG: Restore begin.

LOG: there is no file tablespace_map

LOG: check tablespace directories of backup STGW7H

LOG: check external directories of backup STGW7H

INFO: Validating backup STGW7H

INFO: Backup STGW7H data files are valid

LOG: Thread [1]: Opening WAL segment “/backup/probackup/backups/panweidb/STGW7H/database/pg_xlog/000000010000000000000006”

INFO: Backup STGW7H WAL segments are valid

INFO: Backup STGW7H is valid.

INFO: Restoring the database from backup at 2025-03-21 17:40:29+08

LOG: there is no file tablespace_map

LOG: Restore directories and symlinks… in /database/panweidb/data

INFO: Start restoring backup files. DATA size: 719MB

LOG: Start thread 1

INFO: Backup files are restored. Transfered bytes: 735MB, time elapsed: 1s

INFO: Restore incremental ratio (less is better): 102% (735MB/719MB)

INFO: Syncing restored files to disk

INFO: Restored backup files are synced, time elapsed: 0

The pw_guc run with the following arguments: [gs_guc -D /database/panweidb/data -c pw_wal_directory=‘pg_xlog’ set ].

expected instance path: [/database/panweidb/data/postgresql.conf]

gs_guc set: pw_wal_directory=‘pg_xlog’: [/database/panweidb/data/postgresql.conf]

Total instances: 1. Failed instances: 0.

Success to perform gs_guc!

INFO: Restore of backup STGW7H completed.

[omm@pwnode1 ~]$

2.6.4 检查hot_standby已开启

[omm@pwnode1 ~]$ grep -i hot_standby $PGDATA/postgresql.conf

wal_level = hot_standby # minimal, archive, hot_standby or logical

hot_standby = on # “on” allows queries during recovery

#hot_standby_feedback = off # send info from standby to prevent

[omm@pwnode1 ~]$

2.6.5 创建归档恢复配置文件,输入以下内容

[omm@pwnode1 ~]$ vi /database/panweidb/data/recovery.conf

[omm@pwnode1 ~]$ more /database/panweidb/data/recovery.conf

restore_command=‘cp /archive/%f %p’

recovery_target_time=‘2025-03-21 17:51:50.906044+08’

recovery_target_inclusive =true

pause_at_recovery_target =true

[omm@pwnode1 ~]$

2.6.6 启动数据库并检查数据恢复情况

[omm@pwnode1 ~]$ gs_om -t start

Starting cluster.

=========================================

[SUCCESS] pwnode1

2025-03-21 22:48:50.707 [unknown] [unknown] localhost 140144426639936 0[0:0#0] 0 [BACKEND] WARNING: Error happen when load license, error code 2, error message cannot write data to dir /etc/panweidb/license .

2025-03-21 22:48:50.711 [unknown] [unknown] localhost 140144426639936 0[0:0#0] 0 [BACKEND] WARNING: could not create any HA TCP/IP sockets

2025-03-21 22:48:50.711 [unknown] [unknown] localhost 140144426639936 0[0:0#0] 0 [BACKEND] WARNING: could not create any HA TCP/IP sockets

2025-03-21 22:48:51.010 [unknown] [unknown] localhost 140144426639936 0[0:0#0] 0 [BACKEND] WARNING: Cgroup get_cgroup Gaussdb:omm information: Cgroup does not exist(50002)

2025-03-21 22:48:51.010 [unknown] [unknown] localhost 140144426639936 0[0:0#0] 0 [BACKEND] WARNING: Cgroup get_cgroup Gaussdb:omm/Class/DefaultClass information: Cgroup does not exist(50002)

=========================================

Successfully started.

[omm@pwnode1 ~]$ gsql -U hr -d hrdb -r

Password for user hr:

gsql ((PanWeiDB_V2.0-S2.0.3_B01) compiled at 2024-05-25 22:36:36 commit e857234 )

Non-SSL connection (SSL connection is recommended when requiring high-security)

Type “help” for help.

hrdb=> select * from pitr_test ;

id | t

----±--------------------------

1 | 2025-03-21 17:50:47.20499

(1 row)

hrdb=> select pg_is_xlog_replay_paused();

pg_is_xlog_replay_paused

--------------------------

t

(1 row)

hrdb=>

2.6.7 更改恢复时间点,继续重放xlog

[omm@pwnode1 data]$ vi /database/panweidb/data/recovery.conf

[omm@pwnode1 data]$ more /database/panweidb/data/recovery.conf

restore_command=‘cp /archive/%f %p’

recovery_target_time=‘2025-03-21 18:02:20.628427’

recovery_target_inclusive =true

pause_at_recovery_target =true

[omm@pwnode1 data]$

2.6.8 重启数据库继续恢复

[omm@pwnode1 ~]$ gs_om -t stop && gs_om -t start

Stopping cluster.

=========================================

Successfully stopped cluster.

=========================================

End stop cluster.

Starting cluster.

=========================================

[SUCCESS] pwnode1

2025-03-21 23:22:27.683 [unknown] [unknown] localhost 140388788097600 0[0:0#0] 0 [BACKEND] WARNING: Error happen when load license, error code 2, error message cannot write data to dir /etc/panweidb/license .

2025-03-21 23:22:27.687 [unknown] [unknown] localhost 140388788097600 0[0:0#0] 0 [BACKEND] WARNING: could not create any HA TCP/IP sockets

2025-03-21 23:22:27.687 [unknown] [unknown] localhost 140388788097600 0[0:0#0] 0 [BACKEND] WARNING: could not create any HA TCP/IP sockets

2025-03-21 23:22:27.916 [unknown] [unknown] localhost 140388788097600 0[0:0#0] 0 [BACKEND] WARNING: Cgroup get_cgroup Gaussdb:omm information: Cgroup does not exist(50002)

2025-03-21 23:22:27.916 [unknown] [unknown] localhost 140388788097600 0[0:0#0] 0 [BACKEND] WARNING: Cgroup get_cgroup Gaussdb:omm/Class/DefaultClass information: Cgroup does not exist(50002)

=========================================

Successfully started.

[omm@pwnode1 ~]$

2.6.9 检查数据恢复情况及恢复进程状态

[omm@pwnode1 ~]$ gsql -U hr -d hrdb -r

Password for user hr:

gsql ((PanWeiDB_V2.0-S2.0.3_B01) compiled at 2024-05-25 22:36:36 commit e857234 )

Non-SSL connection (SSL connection is recommended when requiring high-security)

Type “help” for help.

hrdb=> select * from pitr_test ;

id | t

----±---------------------------

1 | 2025-03-21 17:50:47.20499

2 | 2025-03-21 17:56:56.838657

(2 rows)

hrdb=> select pg_is_xlog_replay_paused();

pg_is_xlog_replay_paused

--------------------------

t

(1 row)

hrdb=>

2.6.10 再次更改恢复时间点,继续重放xlog

[omm@pwnode1 ~]$ vi /database/panweidb/data/recovery.conf

[omm@pwnode1 ~]$ more /database/panweidb/data/recovery.conf

restore_command=‘cp /archive/%f %p’

recovery_target_time=‘2025-03-21 18:03:00.628427+08’

recovery_target_inclusive =true

pause_at_recovery_target =true

[omm@pwnode1 ~]$

2.6.11 重启数据库继续恢复

[omm@pwnode1 ~]$ gs_om -t stop && gs_om -t start

Stopping cluster.

=========================================

Successfully stopped cluster.

=========================================

End stop cluster.

Starting cluster.

=========================================

[SUCCESS] pwnode1

2025-03-21 23:30:07.593 [unknown] [unknown] localhost 140514130022976 0[0:0#0] 0 [BACKEND] WARNING: Error happen when load license, error code 2, error message cannot write data to dir /etc/panweidb/license .

2025-03-21 23:30:07.597 [unknown] [unknown] localhost 140514130022976 0[0:0#0] 0 [BACKEND] WARNING: could not create any HA TCP/IP sockets

2025-03-21 23:30:07.597 [unknown] [unknown] localhost 140514130022976 0[0:0#0] 0 [BACKEND] WARNING: could not create any HA TCP/IP sockets

2025-03-21 23:30:07.834 [unknown] [unknown] localhost 140514130022976 0[0:0#0] 0 [BACKEND] WARNING: Cgroup get_cgroup Gaussdb:omm information: Cgroup does not exist(50002)

2025-03-21 23:30:07.834 [unknown] [unknown] localhost 140514130022976 0[0:0#0] 0 [BACKEND] WARNING: Cgroup get_cgroup Gaussdb:omm/Class/DefaultClass information: Cgroup does not exist(50002)

=========================================

Successfully started.

[omm@pwnode1 ~]$

2.6.12 检查数据恢复情况

[omm@pwnode1 ~]$ gsql -U hr -d hrdb -r

Password for user hr:

gsql ((PanWeiDB_V2.0-S2.0.3_B01) compiled at 2024-05-25 22:36:36 commit e857234 )

Non-SSL connection (SSL connection is recommended when requiring high-security)

Type “help” for help.

hrdb=> select * from pitr_test ;

id | t

----±---------------------------

1 | 2025-03-21 17:50:47.20499

2 | 2025-03-21 17:56:56.838657

3 | 2025-03-21 18:02:23.628427

(3 rows)

hrdb=> select pg_is_xlog_replay_paused();

pg_is_xlog_replay_paused

--------------------------

t

(1 row)

hrdb=> \q

2.6.13 终止恢复,使实例可读可写,对外提供服务

[omm@pwnode1 ~]$ gsql -r

gsql ((PanWeiDB_V2.0-S2.0.3_B01) compiled at 2024-05-25 22:36:36 commit e857234 )

Non-SSL connection (SSL connection is recommended when requiring high-security)

Type “help” for help.

postgres=# select pg_xlog_replay_resume();

pg_xlog_replay_resume

-----------------------

(1 row)

postgres=# select pg_is_xlog_replay_paused();

ERROR: recovery is not in progress

HINT: Recovery control functions can only be executed during recovery.

CONTEXT: referenced column: pg_is_xlog_replay_paused

postgres=# create table test1(a int);

CREATE TABLE

postgres=#

3.参考文档

https://docs.opengauss.org/zh/docs/latest/docs/ToolandCommandReference/gs_probackup.html

https://docs.opengauss.org/zh/docs/latest/docs/DatabaseOMGuide/%E7%89%A9%E7%90%86%E5%A4%87%E4%BB%BD%E4%B8%8E%E6%81%A2%E5%A4%8D.html#pitr%E6%81%A2%E5%A4%8D

最后修改时间:2025-03-24 09:46:06
「喜欢这篇文章,您的关注和赞赏是给作者最好的鼓励」
关注作者
【版权声明】本文为墨天轮用户原创内容,转载时必须标注文章的来源(墨天轮),文章链接,文章作者等基本信息,否则作者和墨天轮有权追究责任。如果您发现墨天轮中有涉嫌抄袭或者侵权的内容,欢迎发送邮件至:contact@modb.pro进行举报,并提供相关证据,一经查实,墨天轮将立刻删除相关内容。

评论

暂无图片
获得了1385次点赞
暂无图片
内容获得404次评论
暂无图片
获得了141次收藏