DPDK networking interface

ryanworl · June 20, 2018, 11:28pm

Given the well-factored and testable nature of the FoundationDB networking interface, has anyone explored an implementation of the interface using the DPDK?

Removing the context switches from packet processing may benefit latency and throughout due to a lack of copying and locks. The single threaded model already works well if you assign one thread to each queue on the NIC. No locks or context switches needed.

ScyllaDB, for example, uses this architecture of one thread per NIC queue and sharding the database based on the NIC queues within each machine in the cluster.

alexmiller · June 20, 2018, 11:42pm

I’ve looked into it briefly, and it would be nice. Our commit flow involves a decent number of network hops, so lowering the networking latency with a better networking stack would make a good impact on our minimum possible commit latency.

However, the awkward part is that both the FDB and DPDK want to control the main run loop. FDB needs to control it to be able to do deterministic scheduling, but DPDK normally wishes to control it instead. So it’d be a non-trivial, but cool change.

ryanworl · June 20, 2018, 11:58pm

That is a complication I hadn’t considered.

If there is a way to reconcile that in the future, the Seastar framework from ScyllaDB already has an Apache licensed implementation of all the DPDK code they needed for their database.

For reference: https://github.com/scylladb/seastar

dave · June 23, 2018, 2:20pm

I haven’t studied DPDK, but there’s no obvious reason you couldn’t write an INetwork implementation that would hand over control of the run loop to DPDK or Seastar or whatever, although a “reactor” interface would be a little easier. The amount of code to be written is probably on the order of what’s in Net2.actor.cpp now. Be wary, though: this code won’t be under test in simulation!

I think the biggest potential performance improvement might actually be read throughput on large clusters, where batching of multiple reads to the same storage server can’t work very well and so reads tend to be limited by the performance of the network stack.

Topic		Replies	Views
Are spikes of 500ms+ MaxRowReadLatency normal? Using FoundationDB	7	1206	July 11, 2019
Performance of read-only transactions Using FoundationDB performance	6	1570	June 1, 2021
Containerized networking Running FoundationDB	0	53	March 4, 2025
Multiple network threads Using FoundationDB performance	2	1317	August 23, 2021
1Gbps or 10Gbps network cards? Using FoundationDB	9	895	March 5, 2019

DPDK networking interface

Related topics