Design and Implementation of a Performant Restore System in FDB

jzhou · August 14, 2020, 9:33pm

Yes. This is the old backup system (6.2 and prior), which writes to the system keyspace.

In 6.3, there is a new role “BackupWorker” in the cluster, which pulls mutations from transaction logs and upload them to blob (i.e., pushing mutations out). Note BackupWorkers are FDB internal processes. In contrast, backup agents are external, and only talk to Proxies to read data out.

SteavedHams · August 14, 2020, 10:06pm

Backup agents are just clients, they talk to storage servers for reads like any other.

osamarin · August 15, 2020, 7:24pm

In 6.3, there is a new role “BackupWorker” in the cluster, which pulls mutations from transaction logs and upload them to blob (i.e., pushing mutations out). Note BackupWorkers are FDB internal processes. In contrast, backup agents are external, and only talk to Proxies to read data out.

Seems a solution with pull approach would the backup agent to play the BackupWorker role instead of the fdbserver. But it would require exposing transaction logs for some external processes, ex backup agents. Maybe these processes are neither external nor internal, they are intermediate. There are no any API and protocol for intermediate clients now, but might be good to have them.

Another possibility is to make DrWorker role that woulld be responsible to send transaction logs to dr.

osamarin · December 8, 2020, 12:50pm

However, it does have a major concern:
Now FDB has multi-region configuration (also called fearless configuration), which no long has a separate FDB cluster in the remote DC. The multi-region configuration will become the recommended configuration for high-availability service.
The DR-based backup and restore solution won’t work out of box for the multi-region configuration.

Seems we can make the same thing with the multi-region configuration: 1. stop a remote datacenter. 2. make a could copy of it. 3. restart the remote datacenter

Topic		Replies	Views
Restore is slow and parallel restore doesn't achieve performance boost Using FoundationDB	10	1369	May 23, 2020
FDB Restore: performance tuning and availability of parallel restore Using FoundationDB	0	314	October 18, 2022
Lots of questions about backup and restore Using FoundationDB	2	769	September 28, 2021
About incremental backup and restore for fdb dr Development	1	461	May 1, 2022
Backup & restore performance tuning Using FoundationDB performance	17	2810	May 6, 2020

Design and Implementation of a Performant Restore System in FDB

Related topics