Storage Queue Size

trevor.clinkenbeard · August 24, 2018, 10:07pm

Hi, I am a software engineer for Snowflake Computing working on FoundationDB. My understanding of the calculation of the storage queue size is that it is based on pessimistic estimates. When a mutation is added to a storage server’s mutation log, the size of the storage queue (bytesInput - bytesDurable, as reported to the ratekeeper) is incremented by mvccStorageBytes(m):

static int mvccStorageBytes( MutationRef const&amp; m )
{
    return VersionedMap<KeyRef, ValueOrClearToRef>::overheadPerItem * 2 +
           (MutationRef::OVERHEAD_BYTES + m.param1.size() + m.param2.size()) * 2;
}

This accounts for 8 128-byte PTree nodes being allocated for every mutation. However, this looks to be a worst case scenario, because each PTree insertion does not necessarily require the allocation of 4 128-byte nodes (even though VersionedMap::overheadPerItem == 128*4). Furthermore, applying each mutation sometimes only requires one insertion into the PTree (e.g. in the case of a ClearRange mutation), but the mvccStorageBytes calculation accounts for two insertions.

I ran some tests tracking the number of allocations of PTree nodes in the storage queue under a variety of workloads, and there were significantly fewer allocations than what bytesInput reports. Is this a correct interpretation of how bytesInput is calculated, and if so, would it be safe to change to reporting a more exact storage queue size?

Thank you.

ajbeamon · August 27, 2018, 4:30pm

I haven’t looked into the specifics of the mvccStorageBytes accounting, but I can say that bytesInput accounts for the memory used by each mutation as well as some extra overhead per version. It should be possible to make the version overhead relatively insignificant in a test by having each commit contain lots of mutations, but just make sure that you are accounting for that in your tests.

github.com

apple/foundationdb/blob/f8262a2f951ccae1f2cbc732bebeaf8e0a1f5889/fdbserver/storageserver.actor.cpp#L321


		// return existing version...
		auto m = mutationLog.find(v);
		if (m != mutationLog.end())
			return m->second;


		// ...or create a new one
		auto& u = mutationLog[v];
		u.version = v;
		if (lastArena.getSize() >= 65536) lastArena = Arena(4096);
		u.arena() = lastArena;
		counters.bytesInput += VERSION_OVERHEAD;
		return u;
	}


	MutationRef addMutationToMutationLog(Standalone<VersionUpdateRef> &mLV, MutationRef const& m){
		byteSampleApplyMutation(m, mLV.version);
		counters.bytesInput += mvccStorageBytes(m);
		return mLV.mutations.push_back_deep( mLV.arena(), m );
	}


	StorageServerDisk storage;

I think if you are able to provide a better upper bound on the actual memory being used that is cheap to compute, then that sounds like a good change. I do recommend using an upper bound, though, because otherwise you may find that the storage server behaves poorly under some types of workloads.

If you do make a change like that while keeping the same queue sizes, it would be somewhat similar to increasing the size of the queue in the current implementation. I don’t have much experience running larger storage queues, but I’m not aware of any reason why that would cause problems. If you wanted, you could experiment with it and make sure everything still performs well when the queues are full.

trevor.clinkenbeard · August 27, 2018, 7:48pm

Thank you. The tests we ran kept accounting for the version overhead, and only changed the tracking of the number of versioned map nodes allocated. We will make sure that we continue to track an upper bound on storage queue size, and never underreport the size.

Topic		Replies	Views
Storage queue limiting performance when initially loading data Using FoundationDB	10	2666	October 14, 2019
Repartition from cluster expansion results in uneven data distribution Using FoundationDB	0	446	November 5, 2020
Full disk on one machine results in 99% performance degradation Using FoundationDB	5	2193	November 8, 2018
Transaction size limit calculation Using FoundationDB	5	2145	January 26, 2019
Continous growth of Redwood disk usage with no client connected, possible bug? Using FoundationDB	15	857	December 20, 2022

Storage Queue Size

Related topics