'Locking coordination' state after process removal

alexmiller · July 10, 2019, 11:09pm

fdbcli> status json includes the process class settings and role recruitments.

See https://pastebin.com/kj5XCNPM (from Are spikes of 500ms+ MaxRowReadLatency normal?) for an example.

            "54a7a3995096944c1ecb563e81ff61d9" : {
                "class_source" : "command_line",
                "class_type" : "storage",
                [snip]
                "roles" : [
                    {
                        "role" : "storage",
                        [snip]
                    }
                ],
            },

If you remove Replication Factor machines or more from a cluster without excluding them first, and waiting for the exclude to finish, then you’re going to break your cluster, because there’s data (including system metadata) that will be permanently missing.

The recovery step of locking_coordinated_state also waits for the previous generation of TLogs to come back, so that we can read out the system metadata. As you’ve removed >=Replication Factor number of machines, that’s never going to finish.

(I’ve also been confused by this naming, so maybe we should go rename this step sometime…)

I’m confused though that fdbcli> configure single ssd shouldn’t bring you back to a working cluster. Running fdbcli> configure new single ssd and thus throwing away the previous database might? Did you happen to elide the new by accident when posting, or should I go think harder?

Topic		Replies	Views
How to remove process from test server Running FoundationDB	5	828	April 19, 2021
Can I remove a process? Using FoundationDB	8	2169	June 18, 2019
Locking coordination state. Verify that a majority of coordinattion server process are active. Single machine Using FoundationDB	4	1174	March 8, 2021
FoundationDB processes - 2 (less 0 excluded; 1 with errors) Using FoundationDB performance	7	890	March 13, 2020
How are 'contributing_workers' computed? Using FoundationDB	19	1975	May 12, 2018

'Locking coordination' state after process removal

Related topics