Many failed transactions during replication

Daniel-B-Smith · January 7, 2021, 8:51pm

When I have a cluster doing nothing but replication, I’ll see cases where it will seem like most of the replication transactions are failing. An example status:

Workload:
  Read rate              - 4568 Hz
  Write rate             - 17 Hz
  Transactions started   - 84 Hz
  Transactions committed - 2 Hz
  Conflict rate          - 0 Hz

Am I reading too much into the transactions started/committed stats? Or is something actually going wrong here?

SteavedHams · January 13, 2021, 9:30am

I believe data distribution only does a transaction when the work queue changes, so approximatily when a shard is queued, started, or finishes, so the committed transaction rate would not be high. Two shards changing state per second is reasonable.

Topic		Replies	Views
Transaction & Conflict Rates Using FoundationDB	2	694	September 10, 2019
Replication overwhelming cluster Using FoundationDB	0	390	October 26, 2020
Need to understand read/write rate in Hz Using FoundationDB	4	760	February 21, 2019
Debugging Data Distribution Using FoundationDB	3	910	November 14, 2018
High rate of transaction retries with error code 1009 (Request for future version) Using FoundationDB performance	39	5117	April 30, 2020

Many failed transactions during replication

Related topics