Copying data from one "table" to another

ThomasJ · November 10, 2018, 7:48pm

This may be incredibly basic but I’m having issues to figure it out on my own.

I have this structure
component/sub/type/key = timestamp
so for example
rack/test-1/measurement1/space = 1541879017

Now I want to copy data from one subspace to another, e.g. rack/test-1/measurement1 -> rack/test-1/measurement2/space = 1541879017

I have millions of keys in the test-1 and I want to get them to test-2. So I wrote this go code that reads through the range and copies data into the next one. Because I have 5s to finish up the transaction, I copy only smaller % of the ranges. However, I don’t know how to define the end range properly, so I use 0xFF. Works fine, but it never finishes. It essentially runs forever.

So my questions, is how to properly define the end range?

func main() {

	fdb.MustAPIVersion(510)
	db, err := fdb.Open("./fdb.cluster", []byte("DB"))
	if err != nil {
		panic(err)
	}

	dir, err := directory.CreateOrOpen(db, []string{"rack"}, nil)

	if err != nil {
		panic(err)
	}

	sub := dir.Sub("test-1")

	var lastKey string
	var i int
	for {
		_, err = db.Transact(func(tr fdb.Transaction) (ret interface{}, err error) {

			ri := tr.GetRange(fdb.KeyRange{
				Begin: sub.Pack(tuple.Tuple{"measurement1", lastKey}),
				End:   fdb.Key{0xFF},
			}, fdb.RangeOptions{Limit: 10000}).Iterator()

			for ri.Advance() {
				kv := ri.MustGet()
				r, err := sub.Unpack(kv.Key)
				if err != nil {
					return nil, err
				}
				lastKey = r[1].(string)

				tr.Set(sub.Pack(tuple.Tuple{"measurement2", lastKey}), []byte{})
			}

			return
		})
		if err != nil {
			panic(err)
		}

	}
}

alloc · November 12, 2018, 5:43pm

The way the range is set up, it will scan from the beginning of your table all the way to the end of the database, whereas I think you only want your scan to go to the end of the table. In some of the other languages, there is a method on subspaces to get the range of keys corresponding to that subspace. That doesn’t appear to be the case in our Go bindings, for whatever reason, but something like:

begin := sub.Pack(tuple.Tuple{"measurement1", lastKey})
end := append(sub.Pack(tuple.Tuple{"measurement1"}), 0xff)

Should work to define your begin and end ranges so that it doesn’t go all the way to the end any more. Also, I think there’s a bug where keys right at the end of ranges get written twice (which isn’t the end of the world because the operation is idempotent). I think you could fix that by adding a zero byte to the end of your beginning range. Something like:

begin := append(sub.Pack(tuple.Tuple{"measurement1", lastKey}), 0x00)
end := append(sub.Pack(tuple.Tuple{"measurement1"}), 0xff)

ThomasJ · November 12, 2018, 6:30pm

That is very helpful. I don’t think I would figured this one on my own.

One other question.

Is there a way to scan in reverse, or from the end to the beginning?

alloc · November 12, 2018, 6:46pm

Yeah, you can use the RangeOptions class and specify that the range should be read in reverse. Something like:

ri := tr.GetRange(fdb.KeyRange{Begin: begin, End: end},
                            fdb.RangeOptions{Limit: limit, Reverse: true}))

Then much of the logic for applying the operation across multiple transactions remains the same, except you keep Begin set to sub.Pack(tuple.Tuple{"measurement1"})) and end is the thing you set to sub.Pack(tuple.Tuple{"measurement1", lastKey})). Note that the end of the range is always exclusive, so you don’t need to do the thing where you add something to the key to make sure it doesn’t get read/added twice.

ThomasJ · November 12, 2018, 6:47pm

Fantastic. Thank you

ajbeamon · November 26, 2018, 5:33pm

Subspace implements ExactRange, so it can be passed in as the first argument of a GetRange call. You can also call subspace.FDBRangeKeys() to get the begin and end keys of the range.

ravilution · July 29, 2019, 8:46pm

Can you share your final code? For some reason when I append 0xff to the key, I don’t get the expected result. I am not a GO LANG expert. I am making a mistake somewhere.

ThomasJ · July 29, 2019, 8:50pm

I don’t have an exact code for you, but wrote this quickly, where it iterates over the table from begging to the end.

Hope it helps

package main

import (
	"github.com/apple/foundationdb/bindings/go/src/fdb"
	"github.com/apple/foundationdb/bindings/go/src/fdb/directory"
	"github.com/apple/foundationdb/bindings/go/src/fdb/tuple"
)

func main() {

	fdb.MustAPIVersion(510)

	// Open the default database from the system cluster
	db, err := fdb.Open("./fdb.cluster", []byte("DB"))
	if err != nil {
		panic(err)
	}

	dir, err := directory.CreateOrOpen(db, []string{"test"}, nil)

	if err != nil {
		panic(err)
	}

	sub := dir.Sub("test2")

	var lastKey string
	_, err = db.Transact(func(tr fdb.Transaction) (ret interface{}, err error) {

		ri := tr.GetRange(fdb.KeyRange{
			Begin: append(sub.Pack(tuple.Tuple{"test3", lastKey}), 0x00),
			End:   append(sub.Pack(tuple.Tuple{"test3"}), 0xff),
		}, fdb.RangeOptions{Limit: 5000, Reverse: false}).Iterator()
		
		for ri.Advance() {
			kv := ri.MustGet()
			r, err := sub.Unpack(kv.Key)
			if err != nil {
				return nil, err
			}

			lastKey = r[1].(string)
		}

		return
	})
	
	if err != nil {
		panic(err)
	}
}

ravilution · July 29, 2019, 8:54pm

Instead of Iterator and Advance, I am using GetSliceWithError. Not sure if that is causing the issue. I will troubleshoot and let you know.

ravilution · September 19, 2019, 7:28pm

Hi Thomas, life happened. I was unable to continue my project. I am back now. Can you try and tell me if your code works if Begin: append(sub.Pack(tuple.Tuple{"test3"}), 0x00)

ravilution · September 19, 2019, 9:56pm

Hi @ThomasJ, I solved it. Great to back. I had to knock off the last bit before adding 0xFF. Below is the code.

begin = searchSS.Pack(tuple.Tuple{"user", search})
baseEndKey := searchSS.Pack(tuple.Tuple{"user", search})
end := append(baseEndKey[:len(baseEndKey)-1], 0xff)

Topic		Replies	Views
Foundation DB Go Lang Pagination Using FoundationDB	16	1960	December 4, 2023
Range returned by `Subspace#range` are inclusive on both ends Using FoundationDB bindings	3	323	November 17, 2023
Ranges without explicit end (Go) Using FoundationDB bindings	12	3273	October 25, 2018
Range keys for subspace don't include first or last key Using FoundationDB	3	873	September 16, 2020
How to get exact range of keys using fdb_transaction_get_range in C Programming Using FoundationDB bindings	4	1550	May 20, 2019

Copying data from one "table" to another

Related topics