Record Layer Index splits

dispalt · January 30, 2024, 5:43pm

Hi there, I’ve been working on a simple spark adapter on top of record layer.

Spark has a notion of pushing down filters to the underlying datastore. This would be a good fit for an index scan or primary key scan vs doing a full table scan with client side (spark client) filtering. However, part of determining how that query looks to spark, is dividing up the work into partitions. Now for the primary key part that is easy, basically use the getPrimaryKeyBoundaries function, however for an index I don’t think it would really matter, right? Indexes aren’t materialized, correct, they are just a pointer to the primary key?

Thanks for the help.

Topic		Replies	Views
Full range scan performed in sort when not required Record Layer	3	994	March 3, 2020
Understanding text index on fdb record layer Record Layer	2	3540	January 30, 2019
Multiple questions about Indexes, functions and watches to implement etcd-layer Record Layer	7	3140	June 3, 2020
RecordLayer: index only query Using Layers	2	491	July 27, 2022
Number types in Tuples may not be null \| Reverse scan Record Layer	3	1022	February 20, 2020

Record Layer Index splits

Related topics