Sorry if I’m just missing something from the docs, but I’ve noticed that running
sudo service foundationdb stop to temporarily shut off one of the fdbserver processes in my cluster can cause roughly 5 seconds of downtime depending on which roles the fdbserver was performing within the cluster.
I know that the
exclude command helps when you want to permanently remove a machine from the cluster, but my understanding is that it moves all the data off the machine. If you just want to have the machine offline for a minute or two to upgrade the OS, it seems expensive to move all the data rather than just running for a minute or two without the server and catching up when it comes back online.
Am I understanding things correctly? Is
exclude the recommended tool for this sort of situation or is there any other way to avoid blips of unavailability when taking a server offline temporarily?