Hi,
We have been playing around with FoundationDB for a few months now and and things are going great, however we have been having some trouble getting things to perform when initially loading large volumes of data into a cluster. When running a large load I pretty much always see the following message in fdbcli:
Performance limited by process: Storage server performance (storage queue).
In the details view for that particular process it doesn’t look like it is particularly struggling:
10.22.64.25:4501 ( 57% cpu; 14% machine; 0.045 Gbps; 60% disk IO; 3.4 GB / 7.3 GB RAM )
Looking at the status json for that process I do see that the durability lag is large:
"durability_lag": {
"seconds": 88.504000000000005,
"versions": 88503993
}
and that the storage queue is full.
With regards to my workload we basically load some complex documents and populate simple indexes. In a single transaction we insert a number of documents so there can be a large number of keys across different subspaces in a single write. One behaviour I also see on the cluster is that initially when I start populating it I get a write rate of about 180,000 Hz (transactions committed about 350Hz) however this slowly degrades over time, for example after about 15 hours the workload see is 80,000Hz (transactions committed 170Hz) with about 500GB of KV data loaded in that time.
I have tried a number of different cluster setups but my current is 6 nodes each with 8 vCPUs, 30GB memory and 2 375GB NVME disks. I run 4 processes on each node 2 stateless class and 2 storage class, on 2 of the nodes I run a log class instead of a storage.
Does anyone know what may be going on here or what I should be investigating further? I’ve read around other posts and tried things out but haven’t been able to get past this. And in general does anyone have any thoughts on if/how I can improve the write performance for these initial data loads into the cluster?
Thank you! (Apologies for the large post, we are relatively new to running FDB so not sure what information is most important here and what to be looking at.)