What is the good ratio of the storage processes and the log processes, and the associated metrics for monitoring?

jltz · June 3, 2019, 2:57pm

In the FDB architecture document, https://apple.github.io/foundationdb/configuration.html#guidelines-process-class-config, it is stated that “…The recommended minimum number of class=transaction (log server) processes is 8 (active) + 2 (standby)…”.

I am now running a cluster with 60 Kubernetes Storage pods (in a single DC) with each pod having 3 storage processes. I would like to know how I should scale the number of the log processes correspondingly. Certainly different write workload would require different number of the log processes. So the related question is, what are the performance metrics that I can found in the status.json that allow me to determine whether the log processes get saturated or not.

alexmiller · June 3, 2019, 8:22pm

… man, we really need to go clean up and rewrite most of the docs surrounding recommended configuration and performance numbers.

It really depends on your storage characteristics. For running on physical SSDs, which isn’t the network attached storage that Kubernetes deployments sometimes have, we’ve typically seen something around 1:8 for ssd and 1:2 for memory being near optimal. Running a write benchmark and seeing if it improves if you add more logs is the easy and accurate way to find your optimal ratio.

jltz · June 5, 2019, 8:35am

A related question: What about the number of the proxies? The FDB architecture document only states the minimum number for class=stateless processes is 4 proxies. Should that the number of the proxies be increased with the number of the storage pods as well, or the number can be fixed?

alexmiller · June 5, 2019, 5:51pm

My personal benchmarking has generally found keeping number of proxies and number of logs roughly the same is optimal. I found diminishing returns for each additional proxy added above the number of logs. Depending on your particular workload, this could change, but it’s probably a decent starting place.

Topic		Replies	Views
Scaling log server and log to storage ratio Using FoundationDB	5	88	May 15, 2025
WARNING: A single process is both a transaction log and a storage server Using FoundationDB	16	1767	August 13, 2019
WARNING A single process is both a transaction log and a storage server Using FoundationDB	2	1906	January 31, 2023
We have some machines and cpu and a little ssd Using FoundationDB	5	775	July 23, 2019
Production optimizations Using FoundationDB	20	6413	August 15, 2018

What is the good ratio of the storage processes and the log processes, and the associated metrics for monitoring?

Related topics