How to remove a node from fdb cluster

Minaxi · November 13, 2019, 7:43pm

how to remove a node from fdb cluster ? for example if a machine from the cluster gets terminated from my aws cloud . and it happens to be one of the co-ordinator .how to deal with this situation
its a 3 node fdb cluster

alexmiller · November 13, 2019, 7:47pm

See removing a machine from a cluster for the instructions on how to safely remove a machine. As mentioned in there, exclude doesn’t relocate coordinators, so you’ll need to explicitly change coordinators.

As it sounds like you’ve already lost a machine in your cluster, I’ll assume that with a three node FDB cluster, you were using double replication and had three coordinators. Then you should be able to just add another machine and change to having a new set of three available coordinators, and data distribution should take care of restoring two full copies of all of your data.

Minaxi · November 13, 2019, 8:07pm

thanks for your immediate reply, so here is the series of steps i assume
3 node cluster – all 3 are coordinators
1 node lost
change coordinators in existing nodes accordingly
spin new node with all packages
how will the new node join the existing 2 nodes ?

alexmiller · November 13, 2019, 8:11pm

I think the easiest and safest ordering would be to:

Create and configure a new machine that will run the new coordinator, but don’t start it yet
Change coordinators in existing nodes to include the new node
Start the new coordinator node using the new cluster file written from one of the other two nodes, so it will both become a coordinator, and join the new cluster as a worker.

Minaxi · November 13, 2019, 8:13pm

before the second point we need to change coordinators on 2 nodes to remove the dead node first correct ?

alexmiller · November 13, 2019, 8:20pm

You could, and instead do:

Change coordinators to the two still-live machines
Create and configure a new machine that will run the new coordinator
Start the new process with a cluster file for the current two coordinator cluster
Change the coordinators to be all three machines

Which would work equally well. I just elided an extra step by suggesting that you could configure the coordinators only once to include the ip:port where a coordinator will be once you start it.

If you were to write automation to do this, I’d probably expect it to look like

Create and configure a new machine, and start it to join the current cluster (that has a dead coordinator).
Change the coordinators to only be the now three alive machines.

… which I now realize works equally well manually, and is even one more step shorter.

Minaxi · November 13, 2019, 8:25pm

ok sure thanks, will try these

Minaxi · November 13, 2019, 9:06pm

sorry one q on the point 1 of second method

*ceate and configure a new machine, and start it to join the current cluster (that has a dead coordinator). . when i configure the new machine and start it, how will it join the current clluster ?? how will it know ?

alexmiller · November 13, 2019, 9:08pm

Give it the same cluster file that the other two machines have. It’s going to be a cluster file where one of the coordinators is dead, but that’s fine, as 2/3 are alive and you still have quorum.

Minaxi · November 13, 2019, 10:09pm

the 2nd method works perfectly fine, thanks a lot

Topic		Replies	Views
Changing all coordinators in one step? Running FoundationDB	2	69	May 20, 2025
How to restore cluster after accidentally dropping coordinators Using FoundationDB	9	2277	February 11, 2021
Coordinator-only process Using FoundationDB	2	643	October 20, 2018
Coordinators unavailable when 1 node out of 3 is down in 'single' redudancy mode? Running FoundationDB	1	269	September 20, 2023
Permanently remove excluded IP addresses Using FoundationDB	5	628	August 20, 2019

How to remove a node from fdb cluster

Related topics