Sharing the metadataVersionKey for multiple tenants

andrew.noyes · September 30, 2019, 11:56pm

Suppose we have n tenants and one foundationdb database. Each tenant has a set of infrequently updated keys that need to be read in every transaction. To avoid read hot spots on these keys, clients use the value of the metadataVersionKey to determine whether their local cache is still valid, and writers promise to update the metadataVersionKey any time they change one of these keys.

The problem with this is that writing to one tenant’s metadata invalidates every tenant’s caches, and causes every in flight transaction to abort.

Some ideas for dealing with this:

Verify the value of the metadataVersionKey, and then manually remove the read conflict on the metadataVersionKey. Add a read conflict range on just the keys you care about. This way at least you don’t abort your transaction if another tenant’s metadata changes.
Have one key per tenant that you can use to check that tenant’s cache’s validity. If another tenant’s metadata changes, you only need to re-read one key
Batch writes that require changes to the metadataVersionKey, so that you can change the key less frequently.

Do these make sense? Are there other techniques here?

alloc · October 1, 2019, 8:48pm

FWIW, the way the Record Layer handles this is that we always read the meta-data version stamp key at SNAPSHOT isolation level:

github.com

FoundationDB/fdb-record-layer/blob/f34a53dc0877122cd48e793a96d6efbf2efdcfb8/fdb-record-layer-core/src/main/java/com/apple/foundationdb/record/provider/foundationdb/storestate/MetaDataVersionStampStoreStateCache.java#L111


final SubspaceProvider subspaceProvider = recordStore.getSubspaceProvider();
final FDBRecordStoreStateCacheEntry existingEntry = cache.getIfPresent(subspaceProvider);
if (existingEntry == null) {
    recordStore.increment(FDBStoreTimer.Counts.STORE_STATE_CACHE_MISS);
    return FDBRecordStoreStateCacheEntry.load(recordStore, existenceCheck).whenComplete((cacheEntry, err) -> {
        if (err == null && cacheEntry.getRecordStoreState().getStoreHeader().getCacheable()) {
            addToCache(subspaceProvider, cacheEntry);
        }
    });
} else {
    return recordStore.getContext().getMetaDataVersionStampAsync(IsolationLevel.SNAPSHOT).thenCompose(metaDataVersionStamp -> {
        if (metaDataVersionStamp == null || existingEntry.getMetaDataVersionStamp() == null ||
                ByteArrayUtil.compareUnsigned(metaDataVersionStamp, existingEntry.getMetaDataVersionStamp()) != 0) {
            recordStore.increment(FDBStoreTimer.Counts.STORE_STATE_CACHE_MISS);
            return FDBRecordStoreStateCacheEntry.load(recordStore, existenceCheck).whenComplete((cacheEntry, err) -> {
                if (err == null && metaDataVersionStamp != null) {
                    if (cacheEntry.getRecordStoreState().getStoreHeader().getCacheable()) {
                        addToCache(subspaceProvider, cacheEntry);
                    } else {
                        invalidateOlderEntry(subspaceProvider, metaDataVersionStamp);
                    }

We use this key in order to cache some configuration information that each client needs to load when they open a record store and should be maintained transactionally with the record store’s data. If the meta-data version has changed, then we need to go read the (no longer cached) key. If the value hasn’t changed, then we add a read conflict range only to the cached keys, so if the cached values change, then the transaction is failed, but changes to the meta-data version stamp key itself don’t result in transaction failures.

Because we’re only using this to cache one key per record store (which I believe should count as a “tenant” according to the post) with a relatively small value, I don’t think we’d gain too much from a separate “per-tenant cache key”, but I could see other use cases that have larger cached values wanting to have a separate “cache invalidation key” from their “data key(s)”.

I think a user could also implement (3) using our system, but as this value changes each time a record store is upgraded, the procedure would be to upgrade multiple record stores at once, which could be done. It doesn’t scale particularly well to many record stores, but if there are many record stores, then the need to cache (assuming we’re concerned about hot shards rather than request latency) is relatively low.

kocolosk · October 3, 2019, 5:32pm

Good topic. We’re not yet live with this but we are heading towards Option 1 + Option 2 in our deployment. Every time some tenant-specific metadata is updated we will bump both the metadataVersion and a tenant-specific key, which keeps the invalidation cost low for all the other tenants and doesn’t carry any overhead in the happy path.

Rjerk · June 25, 2023, 7:47am

Does FDB has tenant-specific metadata now?

Topic		Replies	Views
A new tool for managing layer metadata FoundationDB Core	10	2468	April 3, 2019
Transactionally operating on multiple Tentants FoundationDB Core bindings	2	406	April 27, 2022
Tenant feature metadata changes in 7.2 release Development	1	615	July 21, 2022
Should mutations performed via fdbcli always update the \xff/metadataVersion key? Using FoundationDB	5	930	May 14, 2020
Deferred value checks as an alternative to the metadataVersion key for caching in layers Development	2	848	May 26, 2020

Sharing the metadataVersionKey for multiple tenants

Related topics