Is the compaction blog ready? #2

iAziz786 · 2022-01-16T13:14:45Z

Hey @adambcomer, hope you are doing well! I have been following this blog series and it's really great.

I am now waiting for the other parts to arrive, I know you might be busy that's why you haven't put them yet.

If possible can you please discuss how to go about the compaction? I am in a blocker to finish the database engine.

Thanks anyways :)

adambcomer · 2022-01-19T16:48:28Z

Unfortunately, I have gotten very busy finishing university. I assure you that I haven't forgotten this project and will complete it.

Since you asking about compaction, I'll assume you've built the Memtable, Write Ahead Log, and SSTables. Leveled Compaction, although scary sounding, isn't very complex. It functions in three steps: joining SSTables with overlapping key ranges, removing overwritten records, and migrating the new SSTable up a level.

First, joining SSTables with common key ranges. Currently, I'm working on some graphics to easily explain this, but I'll try my best without them. To do this, iterate over each of the identified tables and take the union of the sets of records.

Second, when joining each set of records, you might encounter the same key in multiple SSTables. In my tutorial, I store the timestamp the record was written. Use this value to keep the latest record and discard the rest.

Third, the prior two steps will produce a new set of records that are written into a new SSTable. Put the new SSTable on the next level and delete the old SSTables.

Finally, repeat this process for each level.

You can see that this process will naturally move records into progressively larger files and clear out old records on each pass.

Hope this helps,
Adam Comer

iAziz786 · 2022-01-20T15:09:16Z

Hi @adambcomer, thanks for taking out some of your time to explain it well.

Basically I'm at the point where the last tutorial is ended, that means I haven't created the SSTable yet. I guess once they will be ready I will be able to do the Leveled Compaction. I am working on making the SSTable work right now.

Will keep you updated,
Have a nice day :)

iAziz786 · 2022-01-22T18:12:25Z

Hi @adambcomer,

Hope you are doing well :)

So basically I'm stuck in implementing sstable.rs file. I am not able to understand how to go about that file 🤷‍♂️ Can I ask you for one big help?

If you can let me know the methods that this file may contain and just sudo code about each method then I think I will be able to complete the implementation. Just a rough steps about when to write to file, when to load in memory etc.

I tried to comprehend the blocked-base table for SSTable from RocksDB but didn't understood it clearly.

I know you might be very much occupied with your college, anyways, stay safe!

@iAziz786

adambcomer · 2022-01-24T00:58:50Z

@iAziz786

A Sorted Strings Table, SSTable for short, is an immutable set of records from the Memtable. You can use any disk format you want. I plan on using the same format as WAL because this is an explainer blog article series. I've copied the WAL block structure for reference.

+---------------+---------------+-----------------+-...-+--...--+-----------------+
| Key Size (8B) | Tombstone(1B) | Value Size (8B) | Key | Value | Timestamp (16B) |
+---------------+---------------+-----------------+-...-+--...--+-----------------+
Key Size = Length of the Key data
Tombstone = If this record was deleted and has a value
Value Size = Length of the Value data
Key = Key data
Value = Value data
Timestamp = Timestamp of the operation in microseconds

Building the SSTable requires integrating the Memtable, WAL, and SSTables. So it's not clear what the methods will be for the SSTable and Database structs.

I'll look into getting the code on github for you to preview, then publish the articles later.

iAziz786 · 2022-01-24T03:33:32Z

@adambcomer

Just one clarification, what benefits that we get from SSTable if is sorted by keys? I know it can be useful for compaction but how does it helps to search any value?

Even if it's sorted by key we have to do a linear search on each entry, right?

adambcomer · 2022-01-24T03:40:05Z

@iAziz786

Sorting the keys gives O(log n) runtime with binary search.

iAziz786 · 2022-01-24T04:14:32Z

@adambcomer

Assuming the key is not in the memtables and SSTables are structured like the WAL but sorted, even in that case how can we binary search on a data that's stored in file?

adambcomer · 2022-01-24T11:49:24Z

@iAziz786

The keys are indexed in memory. In the Rocksdb BlockBasedTable Format, they use a block based index. My project's SSTable is similar to the PlainTable Format without the hashing. In that doc, they explain how RocksDB builds an in-memory index using record offsets. Of course, they have many optimizations for a bunch of key-value patterns, but you can ignore them to get an understanding of how the basic system functions.

iAziz786 · 2022-01-26T03:30:38Z

@adambcomer

Apparently RocksDB has something called FileMetaData which stores some important information like the largest and smallest key of that file. I assume that FileMetaData is a class in the memory and got created when the database starts. For each SST file in the database there will be a FileMetaData. And when doing the Get() query we will first find the correct file based on the FileMetaData and do a linear search on that one file.

Does this sound good?

adambcomer · 2022-01-26T17:19:51Z

@iAziz786

I think you are missing some foundational knowledge about LSM databases. I highly recommend you read more about the subject to understand the theoretical underpinnings of this database. Reading the original paper should fill the holes in your understanding.

Original paper: https://www.cs.umb.edu/~poneil/lsmtree.pdf

Best of luck,
Adam Comer

iAziz786 · 2022-01-26T18:03:02Z

@adambcomer

Yeah, that's correct. I lack some fundamental understand about the LSM database. I will get back to you after reading the paper.

Thank you!

huangzixun123 · 2023-06-30T08:47:23Z

Hey @adambcomer , I have been following this project. And i want to know how long will take it to complete? I'm really looking forward to this project!

iAziz786 · 2023-06-30T11:08:47Z

I too.

huangzixun123 · 2023-06-30T11:09:10Z

Hi, i have received your email. Thanks!This is a auto-reply email.

muqiuhan · 2023-06-30T12:27:00Z

You can check @adambcomer ‘s other project: WiscKey.
I read WiscKey and LSM-Tree original paper, which can help you understand the unfinished chapters of this repo.

huangzixun123 · 2023-06-30T14:09:42Z

@muqiuhan good advice! thx

adambcomer added the question Further information is requested label Jan 24, 2022

adambcomer self-assigned this Jan 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is the compaction blog ready? #2

Is the compaction blog ready? #2

iAziz786 commented Jan 16, 2022

adambcomer commented Jan 19, 2022

iAziz786 commented Jan 20, 2022

iAziz786 commented Jan 22, 2022

adambcomer commented Jan 24, 2022 •

edited

Loading

iAziz786 commented Jan 24, 2022

adambcomer commented Jan 24, 2022

iAziz786 commented Jan 24, 2022

adambcomer commented Jan 24, 2022

iAziz786 commented Jan 26, 2022

adambcomer commented Jan 26, 2022

iAziz786 commented Jan 26, 2022

huangzixun123 commented Jun 30, 2023

iAziz786 commented Jun 30, 2023

huangzixun123 commented Jun 30, 2023 via email

muqiuhan commented Jun 30, 2023

huangzixun123 commented Jun 30, 2023

Is the compaction blog ready? #2

Is the compaction blog ready? #2

Comments

iAziz786 commented Jan 16, 2022

adambcomer commented Jan 19, 2022

iAziz786 commented Jan 20, 2022

iAziz786 commented Jan 22, 2022

adambcomer commented Jan 24, 2022 • edited Loading

iAziz786 commented Jan 24, 2022

adambcomer commented Jan 24, 2022

iAziz786 commented Jan 24, 2022

adambcomer commented Jan 24, 2022

iAziz786 commented Jan 26, 2022

adambcomer commented Jan 26, 2022

iAziz786 commented Jan 26, 2022

huangzixun123 commented Jun 30, 2023

iAziz786 commented Jun 30, 2023

huangzixun123 commented Jun 30, 2023 via email

muqiuhan commented Jun 30, 2023

huangzixun123 commented Jun 30, 2023

adambcomer commented Jan 24, 2022 •

edited

Loading