Rebalancing not happening on read hot shards in 7.4.43?

mpatou_openai · March 31, 2025, 8:39pm

Over the course of the week-end one of our cluster is starting to see a lot more load and some of the storage servers are using ~75% of CPU on average when some others are mostly idle.

I already mentioned this a bit in this: How accurate is readOpsPerSec when readSample is enabled?

But this post is focused on the apparent lack of rebalancing

Using the hotrange command I can see that we have 2 ss with very different read profiles:

fdb> hotrange 10.0.49.70:4505 bytes "" "\xff"  1

[
    {
        "begin" : "",
        "bytes" : 647490485,
        "end" : "\u00FF\u0002/<snip>",
        "readBytesPerSec" : 7500,
        "readOpsPerSec" : 6124100
    }
]
fdb> hotrange 10.0.235.187:4503 bytes "" "\xff"  1

[
    {
        "begin" : "",
        "bytes" : 946779588,
        "end" : "\u00FF\u0002/<snip>",
        "readBytesPerSec" : 453333.33333333331,
        "readOpsPerSec" : 4803600
    }
]

I would have expected fdb to actually rebalance the read hot shards to the one that are mostly doing nothing but it’s not happening.

One thing I suspect is happening is because all datadistribution is enabled that that the reblance_disk is taking precedence on the rebalance_read.
But so far I don’t have data to back this hypothesis.

Any advices or clues ?

mpatou_openai · April 1, 2025, 12:29am

Replying to myself after reading a bit of the code.

you need to check that --knob_read_sampling_enabled is set to true on both the storage and the stateless nodes especially on the node data as the datadistributor role
If you don’t see "DDRebalancePaused" in the logs of the datadistributor when you disable rebalance_read (ie. datadistribution disable rebalance_disk) then it’s a sign that rebalance is not happening on reads.

anleg · April 2, 2025, 7:14am

Hey Matthieu

Indeed read_sampling_enabled knob is required to have read aware distribution enabled: doc link
Furthermore, keep in mind that this feature will only redistribute shards to spread the load on storages. It doesn’t split shard ! It means that you can still have hot shard that consume all the cpu of a storage.

mpatou_openai · April 2, 2025, 9:20pm

Hi Anleg,

So it would help only if you have multiple shards that are hot on the same storage server right ?

And is there any plan to split shards so that when you have a one hot read shard it is split in smaller ones ?

Topic		Replies	Views
Debugging Data Distribution Using FoundationDB	3	909	November 14, 2018
Repartionning after storage server (ss) restart Using FoundationDB	0	25	November 2, 2024
Seeing lots of rebalancing after fleet wide restarts Running FoundationDB performance	1	490	May 31, 2021
What is the status of storing extra copies of hot key ranges in memory? FoundationDB Core	6	681	June 26, 2024
How to speed up balancing? Using FoundationDB performance	11	1512	August 21, 2019

Rebalancing not happening on read hot shards in 7.4.43?

Related topics