Using FoundationDB with Samba

slowfranklin · September 16, 2023, 7:24am

Hi,

I’m working on a research project where I’m looking into using FoundationDB as a database backend for Samba, the opensource implementation of the Microsoft fileserver protocol SMB.

I’m struggling with achieving scalability when going from one fdb node to more, up to 32. When running an SMB benchmark I’m only seeing an increase in performance by a factor of two when going from one fdb node to 32. I’m maxing out at at 24k fdb txn/s with 32 nodes, regardless of changing the number of log or storage server.

Some background: Windows maintains additional per file-handle state like caching info and mandatory sharemodes which can’t be mapped to POSIX and hence Samba has to store them in database.

Traditionally on a single node server we use our homegrown TDB, which is basically a mmapped hashtable with chaining. But then, Samba also supports clustering by sharing a cluster filesystem like Ceph, GlusterFS, GPFS or any other cluster filesystem.

There, in order to maintain the Windows specific state in a cluster coherent fashion, we use our homegrown clustered database “ctdb”.

This works quite well with exceptional performance and reasonable scalability. However, ctdb cheats: database records are not replicated so the database will loose data on node failure. Historically these crash semantics have been “good enough” for the fileserver, but in oder to follow suite of MS who specified and implemented the so called “Transparent Failover” for the SMB protocol, Samba needs a failsafe distributed database with at least linearizable consistency.

Which brings us to FoundationDB.

I’m currently running benchmarks of a Samba prototype that connects to fdb using Python C bindings in Samba to hook to a Python script that consumes the fdb Python module.

While this generally works and I was able to achieve up to 2000 SMB protocol open/close ops per second in the past with a proof-of-concept three node fdb server in a local datacenter that was not really well configured or tuned.

To allow testing with larger Samba and FoundationDB clusters I’m now using terraform with Azure backend where we can set the cluster size just by changing some terraform variables.

Unfortunately, I’m not seeing the scalability I was expecting. I’m able to drive up to 24,000 txn/s with the following config:

32 VMs in Azure
each VM has two cores and two disks
each VM configured with two fdb processes
datadir goes to the dedicated second disk
the disk does roughly 8k IOPS per second (measured with fio)
assuming performance would benefit from log processes I’ve configured the first fdb process on all nodes with class=log and left the second one unspecified
db access pattern is completely non-concurrent 100% writes, small keys (24 bytes), small records (1KB max)

I’ve been scraping the forums for some advice on how to properly configure and tune fdb, which at least made me switch from fat VMs with many cores to two core CPUs giving a better disk/CPU ratio.

It seems FoundationDB gets overwhelmed once more then a few thousand (sic) clients are connected. I had to tweak a few system resource limits (max-files, POSIX aio contexts) to get past 1000 clients.

I’m able to run my bechmark tool with 4000 clients, which translates to 4000 fdb connections, but beyond that operation latency sky rockets and I’m getting application failures. Is fdb supposed to handle a large number of client, say 5k, 50k or more?

Looooong story short: am I hitting scalability limits or is my setup just not properly configured?

Thanks in advance for any pointer!

jzhou · September 29, 2023, 4:41pm

Can you try configure more proxies? I guess you use the default value. For 7.1, you can specify commit_proxies and grv_proxies via fdbcli. In our largest deployment, we have 26 commit_proxies and 4 grv_proxies, and average connection counts per process can be around 800 without any issues.

slowfranklin · September 29, 2023, 4:58pm

Thanks for the suggestion! I tried to play around with both tunables, increasing them to a few different values following the default 4:1 (iirc) ratio (commit:grv). Iirc what I saw was an optimum at 2:1, and a decreasing performance with anything higher then that.

I wonder whether the real limitting factor is just the slow IO capped disks in the Azure VMs. I’m still planning to do some more testing on VMs with some decent NVMe disks where IOPs are not artifially limitted.

Topic		Replies	Views
Scalability performance benchmark Using FoundationDB performance	6	2553	March 27, 2019
Best practices for bulk load Using FoundationDB	7	4390	May 25, 2018
Performance benchmark Using FoundationDB	4	1690	October 5, 2021
Benchmarking FoundationDB on AWS Using FoundationDB	18	8958	October 30, 2018
Experience of combining FoundationDB with a MPP data warehouse Community	5	1233	May 24, 2018

Using FoundationDB with Samba

Related topics