The relationship between uniform distribution of data and the number of storage units

vivan · April 16, 2024, 2:17am

In our testing, we found that the data cannot be evenly distributed onto the disks, leading to some disks being excessively utilized, resulting in abnormal cluster states. I want to know if adding more storage can make the data writes distribute more evenly across the disks, or if there are other ways to avoid this issue.

vivan · April 16, 2024, 2:35am

Our deployment consists of a single Availability Zone (AZ) with 10 nodes. Each node has 10 processes, including 6 storage, 3 stateless, and 1 log process. Below is a screenshot of the monitoring.

SteavedHams · April 18, 2024, 4:38am

What storage engine are you using and how are you calculating disk utilization? Does each storage process have its own disk?

vivan · April 19, 2024, 1:59am

We are using the SSD-2 storage engine and calculating disk utilization using fdb-exporter. Each group of three storage processes shares one disk.

SteavedHams · April 19, 2024, 2:51am

I’m not familiar with that tool so I don’t know what fields it is actually using for this.

The metric which FDB should balance across StorageServers total logical KV bytes it holds replicas for. This is reported in two places.

Status JSON as stored_bytes for a role=storage role in a process
Trace log in the Type=StorageMetrics trace events as BytesStored for each StorageServer

Check if this metric is balanced across your Storage Servers.

If you look at disk utilization or file sizes, there are several reasons they will not match. A storage server holds X amount of logical data but in > X amount of disk usage because of overhead, internal fragmentation, and internal reusable free space.

Topic		Replies	Views
Data distribution / Disk usage uneven: bifurcated at 2 tiers Using FoundationDB	6	528	August 6, 2020
Storage servers 95% full - how to recover Using FoundationDB	8	1557	May 1, 2024
Understanding load balancing between storage processes under bulk writes Using FoundationDB	5	1820	January 11, 2019
Repartition from cluster expansion results in uneven data distribution Using FoundationDB	0	448	November 5, 2020
How to troubleshoot throughput performance degrade? Using FoundationDB performance	35	4342	June 20, 2019

The relationship between uniform distribution of data and the number of storage units

Related topics