Getting the number of key/value pairs

dave · April 22, 2018, 6:13pm

I’ll add that

(1) it would be possible for a storage engine to offer O(log N) counting and key selector offsets (this would require extensions to the IKeyValueStore interface. it would be more straightforward if we move to a MVCC interface to storage)

(2) if an application or layer needs these operations (or related ones like picking a key uniform randomly or determining the “rank” of a value in some index) to be fast for some specific data it can make them reasonably efficient via (somewhat tricky) data modeling. There used to be a released python layer RankedSet that demonstrated this; I can’t remember how good the implementation was. Basically the approach is to store, besides, the data, a number of increasingly sparse samples of the data (say, 1/100 of keys, then 1/10000 of keys, etc) and with each of these samples an accurate, atomically incremented counter of the number of keys between it and the next sampled key at the same level. When adding or removing data you update all these (with careful use of snapshot reads and manual key ranges to reduce conflicts), and when you want to count or offset from a particular point. You can “count” whatever metric or metrics you want rather than FoundationDB k/v pairs specifically.

Topic		Replies	Views
Missing API for getting just the count of a key range? FoundationDB Core	13	3564	September 10, 2018
Sum of key-value sizes seems incorrect Using FoundationDB performance	3	913	August 2, 2021
Tools to estimate size per directory Using FoundationDB	2	491	June 30, 2020
Limiting the cardinality of a key range Using FoundationDB	1	1246	August 27, 2018
[Java] API to get only the keys? Using FoundationDB	2	530	December 23, 2018

Getting the number of key/value pairs

Related topics