Our FDB cluster configuration has two regions and three datacenters. Region 1 (West Coast) contains DC1 and DC2, and Region 2 ( East Coast) contains DC3. DC1 is the primary DC and DC2 is the satellite DC. DC3 is the standby DC.
In our multi-DC FDB cluster, bulk loading is invoked to populate the initial data, before the whole database is able to serve the traffic. We are currently trying to improve our bulk loading utility tool. Following the strategy described in the section of “Migrating a database to use a region configuration” in the FDB architecture document, when we do the bulk loading, we only need to have the West Coast Region Setup and then use the FDB data synchronization protocol to make a full copy to DC3 of the East Coast Region.
Question 1: Can we only use Primary DC 1 for bulk loading, so that we can save the log-store data synchronization to the satellite DC 2? In this initial data loading, we really do not need to worry about having two Log-Stores. How can we configure only the Primary DC setup?
Question 2: To load the data into the Primary DC, can we start with the configuration that has just only 2 data copies (replication factor =2). And only when the data loading is finished, we increase the replication factor to be 3 and thus force the FDB cluster to perform data re-balancing to meet the data replication requirement. Can this solution work?