Constant Data Movement

panghy · January 13, 2019, 5:58pm

This is running on 5.2 but on one 4-node cluster, there’s constant 100% disk saturating data movement which doesn’t seem to be going anywhere (always with a residual 4G-12G of data movement left at healthy). Looking at “SendRelocateToDDQx100” messages, there seems to be shards with teams of 3 to 4 processes while the cluster is only doubly redundant. Adding 4 new nodes and excluding the older 4 never completes either (and the constant disk activity continues). Seems like the system isn’t able to figure out that a data move has been completed and just tells storage servers to constantly move data over.

ajbeamon · January 14, 2019, 10:55pm

Do you know if the master died when you tried the exclude? If not, what happens if you kill it?

panghy · January 14, 2019, 10:58pm

(also sent you an email)

We did reboot the cluster a number of times and it goes to a weird 40T data movement state (cluster has 8T of KVs) which quickly drops down but it just stays constantly moving data afterwards. It’s almost like it’s trying to move data to all nodes.

Topic		Replies	Views
FoundationDb stucked excluding a node Running FoundationDB	0	188	February 28, 2024
Constantly repartitioning (under no load) and moving large volumes of data Using FoundationDB	15	1775	January 15, 2020
How would I recover from this failed cluster move? Running FoundationDB	11	809	October 16, 2024
R/w performance impact and replicas consistency while moving data Using FoundationDB	0	526	July 13, 2018
Memory cluster seems to be stuck in moving data state Running FoundationDB	0	436	August 13, 2020

Constant Data Movement

Related topics