Should I optimize for single reads?

janderland · February 19, 2019, 8:01pm

So I have a blob of data. This blob of data has some metadata associated with it. Both are static and won’t be changing after insertion. Originally, I was planning on creating a key like this:

(category 1, category 2, timestamp, ID, version, timestamp ) = blob

Does this key seem too large to anyone? It’s designed this way because I’ll potentially be doing range reads on each prefix in this key (except for possibly the last timestamp). Another pro is that a single entry gives me the blob and all it’s metadata (instead of reading from multiple entries to get all this info).

Am I using the abstractions properly here? Should I be using directories for any of these prefixes instead of simple keys?

SteavedHams · February 19, 2019, 9:08pm

This sounds perfectly fine for what you are doing. Note that blob cannot be more than 100k. The main reason to split your unchanging blobs into multiple key-value pairs is if you want to be able to read some of the split entries but not the entire blob value. The current cost of splitting is that the repeated keys are stored in full on disk because the ssd storage engine does not compress keys. The upcoming Redwood storage engine will have key prefix compression, so such splitting will become cheaper.

Topic		Replies	Views
How does FoundationDB store keys with duplicate prefixes? Using FoundationDB	3	1090	March 17, 2019
Can you iterate through keys without reading values? Using FoundationDB	9	835	July 7, 2022
Storing one billion floats with dense keys Using FoundationDB	1	464	April 23, 2019
Considerations for key and value sizes Using FoundationDB	2	2039	November 28, 2018
Best practice of storing structs. Should I pack or store fields separately Using FoundationDB	5	1982	May 17, 2018

Should I optimize for single reads?

Related topics