Це відео не доступне.
Перепрошуємо.

The Secret Sauce Behind NoSQL: LSM Tree

Поділитися
Вставка
  • Опубліковано 29 сер 2022
  • Subscribe to our weekly system design newsletter: bit.ly/3tfAlYD
    Checkout our bestselling System Design Interview books:
    Volume 1: amzn.to/3Ou7gkd
    Volume 2: amzn.to/3HqGozy
    Digital version of System Design Interview books: bit.ly/3mlDSk9
    Animation tools: Illustrator and After Effects
    ABOUT US:
    Covering topics and trends in large-scale system design, from the authors of the best-selling System Design Interview series.

КОМЕНТАРІ • 189

  • @Kxneki2433
    @Kxneki2433 6 місяців тому +41

    IMPORTANT: Don't forget the Memtable is stored in memory, so if the system crashes, that data will be lost. To avoid losing data, we can maintain a separate log file on disk. Every time we write to the Memtable, we'll also append that write to the log file (no need to sort it as we just use it to restore after a crash). Then once the Memtable contents get written out to a SSTable file, we can erase the log file. That way the log helps us avoid losing writes stuck in memory when a crash happens.

    • @jerichaux9219
      @jerichaux9219 2 місяці тому

      This seems like a fairly important detail.

    • @sujithmenonp
      @sujithmenonp Місяць тому

      Kafka is used as the logging system by default in most scenarios

  • @sealuke2724
    @sealuke2724 Рік тому +93

    Bruh, this is just awesome... keep going

  • @JoelBrubaker
    @JoelBrubaker Рік тому +178

    This is the perfect amount of depth and overview I’m looking for. Great videos and visuals!!

  • @phanphong3533
    @phanphong3533 Рік тому +21

    This guy is better than my teamlead in term of explaining a concept on NoSQL, thank you , make my day

  • @CommandantNOVA
    @CommandantNOVA Рік тому +53

    LSM trees are actually used a lot in modern SQL databases as well. the idea is to represent a relational table as a series of packed records - [k: primary key v: field 1, field 2]. Indices can be created on other fields by creating additional k:v pairs - [k: field 1 v: primary key] so an index scan just becomes two NoSQL lookups for each result.

    • @romannasuti25
      @romannasuti25 Рік тому +11

      Spot on. One thing to keep in mind is that all databases, SQL or noSQL, are based on the principles of key value operations and the transaction consistency algorithms that allow for data integrity. The core calculus and proofs behind these systems are all done in the perspective of only two types of operation: read and write, each to some concept of a key.
      These algorithms vary wildly and come with varied pros and cons, some with very weird limitations that actually DO prevent SQL semantics (the FaunaDB and FoundationDB distributed algorithms require the W/R set to be known before starting the transaction, and SQL requires conditional and unbounded W/R sets to fully abide by the spec).

    • @greyowl3787
      @greyowl3787 Рік тому +2

      @@romannasuti25 what do you mean by “W/R set” ?

    • @romannasuti25
      @romannasuti25 Рік тому +1

      @@greyowl3787 write/read. Basically, Fauna and Foundation use a distributed transaction system that can only provide full ACID guarantees with exact read and write keys known before the transaction even starts. This means dependent querying is usually not possible, like "SELECT name IN users WHERE id = (SELECT friend IN users WHERE name = x);" simply because the result of the inner query changes the write set of the outer query based on its result. In some cases a good query engine can narrow down to a fixed "maybe W/R" set if the results are well bounded (either single element or known countable range), but generally they don't bother with SQL support for this reason.

    • @akash-kumar737
      @akash-kumar737 Рік тому +3

    • @ismailmo4
      @ismailmo4 Рік тому +1

      @@greyowl3787 write/read i assume

  • @mr_nature
    @mr_nature Рік тому +25

    I appreciate your efforts. Thanks for making system design more palatable than ever.

  • @maxil122
    @maxil122 Рік тому +15

    That's the best system design content I have ever seen on youtube ! This channel is absolutely amazing. It must be tough to squeeze all that valuable knowledge into less than a 10 minutes video. Keep up the excellent work!

  • @AminAramoon
    @AminAramoon Рік тому +12

    These videos are superb man, keep up the good work

  • @bazoo513
    @bazoo513 Рік тому

    This is the most information-dense video on what's so special about NoSQL databases I ever saw. Not a word was superfluous, but the key concepts were clearly transmitted.

  • @nishantgoel769
    @nishantgoel769 5 місяців тому

    One of the concise video to understand how elastic/lucene is using these things to fast write and read.
    Great work man!

  • @Andrew-rc3vh
    @Andrew-rc3vh Рік тому +11

    That's a cool trick. So what it is essentially doing is spreading out the computer's resources across time, where with a traditional database you will get lots of spikes in processing on the timeline when doing reads and writes. Mind you the attraction of SQL was that you could have multiple indexes and create custom views on the data in a highly relational way, but granted that was very expensive on resources. I think traditional databases were mostly optimised for the deficiencies of the mechanical hard disk drive. I think it may end up as redundant in future as we store our data more on chips.
    Thanks for the video. I like videos that don't waste your time with long BS intros.

    • @BRBearUSA
      @BRBearUSA Рік тому +1

      You read my mind when it comes to "spreading the computer's resources across time"... Which for some use cases makes perfect sense, but not all use cases.
      Not sure I agree with the "deficiencies of mechanical HDDs" part of your comment though. But great comment overall.

  • @arthursoares610
    @arthursoares610 Рік тому +12

    The dark mode was awesome. I think it could be the default from now on

    • @guptaanmol184
      @guptaanmol184 Рік тому

      I came here to say this! +1 for dark more, finally!

  • @rafaelacioly3252
    @rafaelacioly3252 Рік тому +1

    This channel is by far the best channel that I've found on yt about tech

  • @lechprotean
    @lechprotean Рік тому +20

    great, you make it sound so simple, I'm writing my own nosql db this weekend ;)

  • @danielgospodinow
    @danielgospodinow 5 місяців тому

    I can't recall the last time I stumbled upon such great material! Fascinating work!

  • @bardhan.abhirup
    @bardhan.abhirup Рік тому +8

    These videos are incredible! Very well paced and presented!

  • @frederickbarbarossa2746
    @frederickbarbarossa2746 Рік тому

    besides bloom filter, a sparse index is to help find a key quickly so we only look into a small number of sstables

  • @CppExpedition
    @CppExpedition Рік тому +2

    LOVE THE DARK BACKGROUND! :D:D:D:D (Also the video of course!!)

  • @BRBearUSA
    @BRBearUSA Рік тому

    VERY informative without going into too much complexity. THANKS and congrats for a great video. I'm an MS SQL Server DBA, and the high level explanation you provided was awesome. Thanks again. Best, R.

  • @DK-ox7ze
    @DK-ox7ze Рік тому +1

    Great explanation. Resolved all my doubts on how NoSql DBs work. However, I wanted to understand
    1) Whether the balanced tree and keys in sorted set is only the object key with pointer to data value or it also contains the actual data?
    2) Can a NoSql DB index multiple keys?
    3) Why can't SQL DB also implement flushing mechanism in order to speed up writes? I know that they are highly consistent so they need to persist data to disk, but they can simply append the entry in a log file just like NoSql DBs do, and in case of a network partition, first check the log file to sync data in actual database?

    • @bojandolinar1535
      @bojandolinar1535 Рік тому

      Re 3 afaik that's what they already do. But sooner or later it has to write them to b-tree, which I guess is the real bottleneck.

  • @LeoFuso
    @LeoFuso Рік тому +1

    Great work! I wish I had watched this video before trying to learn the architecture behind RocksDB by only looking their documentation, hahaha! Awesome work!

  • @richsadowsky8580
    @richsadowsky8580 Рік тому

    Really awesome overview of how SQL and NoSQL differ. Agree with Joel below me just the right level of detail to provide value.

  • @susingh2
    @susingh2 Рік тому

    I never experienced such a communicative video with such a simple and easy explanation.. Thanks Alex ..Please keep it up and upload more such videos. I have anyway bought both volume of System Design Book. Thank you so much !!!

  • @quirkyquerty
    @quirkyquerty 3 місяці тому

    in their book, bloom filters are stored on disk, but here they're shown to be in memory. Hopefully we'll get some eventual consistency

  • @anilkaliya3375
    @anilkaliya3375 8 місяців тому

    Nicely explained. I read all this information in designing data intensive application book. But few topics like memorable and sstables were still bit unclear to me. Got the whole idea now. Great stuff. Keep going

  • @Konservator69
    @Konservator69 Рік тому +3

    Good video. For the further topic development it'd be interesting to see a LSM tree vs Redis RDM/AOF persistence schema comparison.

  • @bobdinitto
    @bobdinitto Рік тому

    I've often wondered how NOSQL databases can achieve higher write throughput than relational DB. Thank you for sharing the techniques involved. Your explanation is clear and the graphics are excellent!

  • @tomtomsiesie5436
    @tomtomsiesie5436 Рік тому +1

    Another amazing video! This format is so vaiuable.

  • @carolinegr
    @carolinegr Рік тому +4

    One additional thing to mention (around ua-cam.com/video/I6jB0nM9SKU/v-deo.html perhaps) is that the writes are written to memory AND a transaction log to ensure durability. Otherwise whatever was in the MemTable will not persist after a crash. The transaction log can be replayed to rebuild the MemTable.

    • @ByteByteGo
      @ByteByteGo  Рік тому +5

      Thank you for the feedback.
      We did have that initially, but decided to take it out to focus on the LSM tree itself.
      We knew someone would bring it up, but didn't think it would take this long. 😂
      We are glad you did.

    • @ibgib
      @ibgib Рік тому +1

      I was actually wondering if this were a difference with what I understand of relational dbs. Thanks for pointing it out.

  • @arno.claude
    @arno.claude Рік тому

    This channel is such a gold mine!

  • @TheOnlyRealBreadIntheWorld
    @TheOnlyRealBreadIntheWorld 3 місяці тому

    amazing content, thank you for all the hard work you do!

  • @plussin2760
    @plussin2760 5 місяців тому

    LSM Tree에 대한 이해와 작동 방식에 대한 개요를 알 수 있었습니다. ㄳ 합니다

  • @modolief
    @modolief Рік тому +1

    Sublime, superb, excellent ... again.

  • @balaclava351
    @balaclava351 Рік тому

    Great video. I'm a junior dev that has to implement a chat feature in the next few weeks. This really helped me understand NoSQL. Thanks.

  • @sahilchadha9621
    @sahilchadha9621 14 днів тому

    Please create a video on size tiered and level compaction strategy as well

  • @eliyahubasa9401
    @eliyahubasa9401 Рік тому +1

    Great Content, thank you very much.
    I'm already waiting for Tuesday for a new video, as much as I wait for a new One Piece episode.

  • @shakedrosenblat1925
    @shakedrosenblat1925 7 місяців тому

    Thank you. great video, as always.
    I'd like if you guys could go into more detail

  • @pashachechehov3483
    @pashachechehov3483 Рік тому

    Great visualization of "Designing Data-Intensive Applications" book

  • @StephenGillie
    @StephenGillie Рік тому

    Very cool. Could have
    - one process just taking DB writes and putting them in memory
    - another writes too-big variables to files on disk
    - next would go through files and flatten them (like continuous truncate/shrink on a transaction log)
    - last would take DB reads and go through the memory, bloom filters, and file structure to find and return the requested data.

  • @sampleshawn5380
    @sampleshawn5380 Рік тому +1

    Man this video is awesome, so much information loved it!

  • @ibgib
    @ibgib Рік тому

    Like many have said, this is a great dense video at a good beginner depth for people like me. I hope there will be a relational db video that complements this if at all possible, since the quality level is so high 🤞

  • @singhsaubhik
    @singhsaubhik Рік тому +2

    This is an awesome overview of LSM tree. If someone wants dig deeper read "Design Data intensive applications".

  • @arunkutube
    @arunkutube 5 місяців тому

    great explanation for beginners

  • @shoobidyboop8634
    @shoobidyboop8634 Рік тому

    Stuff like this is the future of many forms of education.

  • @obedgennius1401
    @obedgennius1401 Рік тому +1

    I really appreciated these videos !! Thanks you very much
    I would to know what software are you using to produce such presentation 🙏

  • @KostiantynKostin
    @KostiantynKostin Рік тому +2

    6:56 correction regarding Bloom Filter
    If the Key doesn't exist, bloom filter will NOT return False Negative
    If the Key doesn't exist, bloom filter MAY return False Positive
    In other words.
    If bloom filter says there's NO key in page, there's definitely no key in page.
    If it says that there's the key in page, there MIGHT be the key in page with some probability.
    This probability data structure helps to reduce unnecessary reads.

    • @Winnetou17
      @Winnetou17 Місяць тому

      That's exactly what is said in the video

  • @doxologist
    @doxologist Рік тому

    Perhaps the best educational systems content on the whole of UA-cam right now. Great stuff

  • @AndyThomasStaff
    @AndyThomasStaff Рік тому +1

    Great explanation, thank you, this helped reinforce my learnings about LSM-trees from reading. The graphics were especially helpful

  • @aus10d
    @aus10d Рік тому

    Very interesting! Loved this video

  • @GyroCannon
    @GyroCannon Рік тому

    Not at all a question I had, but glad I watched because I extensively use Mongo for my own app

  • @sophiiisticated
    @sophiiisticated Рік тому

    I think the movie is recommended to left 30 seconds instead of 10s for the ending credit because the next suggestion video covers (in the left bottom side) in the summary time

  • @AllOfMyWat
    @AllOfMyWat Рік тому

    I can't wait for my next systems design interview!

  • @roct07
    @roct07 11 місяців тому

    This is so high quality. Thank you :)

  • @ThangNguyen-je8kv
    @ThangNguyen-je8kv Рік тому

    The next step after watching this video is to … watch it again 😂. Awesome as always.

  • @alamelu85
    @alamelu85 Рік тому

    Alex - Your lectures and contents are great assets to software engineering community.

  • @jiamingliu192
    @jiamingliu192 Рік тому

    Absolutely amazing. Thank you for making the video!

  • @paramvirsingh5640
    @paramvirsingh5640 Рік тому

    Looking at how beautifuly my tired brain understood this, this video deserves a noble prize.

  • @kwalter_6557
    @kwalter_6557 Рік тому +3

    You’ve been hitting it out of the park with these videos! Really enjoy the content.
    6:50 “(bloom filters) return a firm no if a key does not exist”. That’s not quite right. Bloom filters returns a firm yes if a key exists. If a key does not exist, it might still return yes with a low probability.

    • @santoshjoshi3396
      @santoshjoshi3396 Рік тому +3

      He is correct .. false positive is possible but not false negative

    • @kwalter_6557
      @kwalter_6557 Рік тому +2

      That’s exactly right!
      The subtlety here is in his claim that
      1. Key does not exist -> returns no
      2. Key “might” exist -> probably returns yes
      Whereas bloom filters really guarantees
      1. Returns no -> key does not exist
      2. Returns yes -> key probably exists
      i.e. “bloom filters returns a firm no (only) if a key does not exist”

    • @KostiantynKostin
      @KostiantynKostin Рік тому

      You are correct. Video didn't communicate properties of Bloom Filter well.

    • @yankhonski
      @yankhonski Рік тому

      in other words, a query returns either "possibly in set" or "definitely not in set".
      definitely not in set -> if the key does not exists, 100 % no answer.
      en.wikipedia.org/wiki/Bloom_filter#:~:text=A%20Bloom%20filter%20is%20a,a%20member%20of%20a%20set.

    • @Winnetou17
      @Winnetou17 Місяць тому

      What is wrong with you people ? That's exactly what's said in the video. And much simpler than what you try to explain. Am I missing something ? There's other comments like this too, arguing that somehow the video is incorrect, when in fact it is, or at least what they say is the same as what the video says.
      Which is if the bloom filter says there's no key, you can trust that and skip to the next level.

  • @thelonearchitect
    @thelonearchitect Рік тому +4

    Thanks for the video. Your explanation rises a concern to me : since the memtable is in memory, what happens if the server crashes before flushing ?
    Is that memtable distributed or replicated ?

    • @lwfeagan
      @lwfeagan Рік тому +2

      Cassandra, for example, still has a write ahead log.

    • @nitinagrawalbst
      @nitinagrawalbst Рік тому +1

      Generally for the memory table write ahead log is maintained. Once the memory table is moved to create SSDTable write ahead log is deleted. In case of crashes write ahead log can be used to restore the memory table.

  • @peepeepoopoo2243
    @peepeepoopoo2243 Рік тому +1

    Great video!

  • @mnchester
    @mnchester Рік тому +1

    amazing video!

  • @jiankuang9890
    @jiankuang9890 18 днів тому

    Question regarding the books: volume 1 and volume 2 are considered as two separate editions for the same content or the same edition for different contents? Another way to ask: Should I but both volumes or just volume 2?

  • @galeop
    @galeop Рік тому

    Amazing video
    1:15 what is that "object key" ? The row key that is being edited/added to the keyspace ?

  • @BiggRanger
    @BiggRanger Рік тому

    Excellent presentation!

  • @MostafaZeinali
    @MostafaZeinali Рік тому

    Thank you for the great video. Keep up the good work. ❤

  • @abhijit-sarkar
    @abhijit-sarkar Рік тому

    How is a key-value actually stored and retrieved from disk? Although a SSTable is sorted within itself, since the keys in different SSTables are not related, how does a read request find a key? Surely not by trying all SSTables one by one, that'd be too slow.

  • @caseyspaulding
    @caseyspaulding Рік тому

    Thank you!

  • @RandomNullpointer
    @RandomNullpointer 2 місяці тому

    Any idea how Domino databases work in this regard? Do they follow the same concepts? Is the compaction process "Tiering" or "Leveling"?

  • @DarknessGu1deMe
    @DarknessGu1deMe Рік тому

    What would be a good example of an application benefiting from "fast write slow read" property of NoSQL DB? Based on what's presented, I'd say most user-facing application, like a typical service a startup would build (e.g. personal calendar organization, etc) doesn't sound like a good fit given reads are pretty important in user-facing traffic.

  • @wave9303
    @wave9303 Рік тому

    hi , can you do a video on how Splunk is use for devops and how it storing its data ?

  • @jrabelo_
    @jrabelo_ Рік тому

    perfect explanation, thanks

  • @javisartdesign
    @javisartdesign Рік тому

    very well explained. Thanks

  • @chaoluncai4300
    @chaoluncai4300 Рік тому

    this is brilliant! I'm also wondering would this amount/level of knowledge for an advanced DS is enough for tech/system design interview? Obviously I think the interviewer won't ask for implementation so... ig im trying to know how deeper do we need to go than e.g. this channel's few minutes videos?

  • @willl0014
    @willl0014 Рік тому

    So much knowledge!!!

  • @rbelatamas
    @rbelatamas Рік тому

    great explanation ❤

  • @faris_id_music
    @faris_id_music Рік тому

    one of the best videos so far

  • @ziakhan-tk7rk
    @ziakhan-tk7rk Рік тому +4

    What software do you use for animation in your videos?
    I am very curious

    • @wrondonparticual5113
      @wrondonparticual5113 Рік тому

      I want to know as well. It is perfect!!!

    • @biwer-r
      @biwer-r Рік тому

      @@wrondonparticual5113 Maybe this is something :-) ua-cam.com/video/H5GETOP7ivs/v-deo.html

  • @abhinavsingh4221
    @abhinavsingh4221 Рік тому +2

    Why DynamoDB uses B-Tree?

    • @JohnDoe-my5ip
      @JohnDoe-my5ip Рік тому

      DynamoDB is highly optimized for low latency reads.

  • @aaronprindle385
    @aaronprindle385 Рік тому

    Awesome work, thanks for this!

  • @noahgsolomon
    @noahgsolomon 9 місяців тому

    awesome explanation

  • @paddyd7642
    @paddyd7642 Рік тому +1

    Thank you! When you say sorted, is it by some object id or time?

    • @big0bad0brad
      @big0bad0brad Рік тому +2

      Object ID, unless you design the system to use a high precision timestamp as the id, which could maybe be an interesting idea. If the Object ID is a timestamp, then object creations are already sorted which could further boost write performance, though there is probably no advantage in the case of updates or deletes.

  • @hamsalekhavenkatesh3440
    @hamsalekhavenkatesh3440 Рік тому +1

    Amazing!

  • @BuyHighSellLo
    @BuyHighSellLo Рік тому +2

    How does Mongodb compare to the two types explained in the video?

    • @ckirkyg
      @ckirkyg Рік тому +2

      This was my question as well. Does it also use an lsm tree or some other approach

  • @swaroopas5207
    @swaroopas5207 Рік тому +1

    When searching why does it need to search at all levels? It needs to search at the latest level right? Can someone help me understand this.

    • @karolinaa9478
      @karolinaa9478 Рік тому

      The keys, when compacted, are deleted from their original level, and added to the next level.

  • @thisisnotok2100
    @thisisnotok2100 Рік тому +1

    yeah I freakin love this channel

  • @dhpark7509
    @dhpark7509 Рік тому

    great explanation!

  • @chandragie
    @chandragie Рік тому

    I don't really understand them, but it's cool!

  • @KristjanKask
    @KristjanKask Рік тому

    Awesome video! I'd love to know how densely ordered vectors for database storage work next :)

  • @mohawkgwai
    @mohawkgwai Рік тому

    Cassandra also has Leveled Compaction Strategy so that slide comparing it to RocksDB is a little misleading

    • @raphaelcarvalho4288
      @raphaelcarvalho4288 Рік тому

      Cassandra initially had size tiered only and later borrowed leveled from RocksDB to solve the space amplification problem, so it's not completely misleading.

  • @axa993
    @axa993 Рік тому

    Awesome overview.

  • @adamaiken00
    @adamaiken00 Рік тому

    This is a great video. If I want ti go further is there any good reference for nosql?

  • @junkahoolik
    @junkahoolik Рік тому

    relational databases don't use btrees, they use b+ trees. the only db i know of that uses btrees is actually mongodb.

  • @raviv5109
    @raviv5109 Рік тому

    Thank you so much!

  • @vikasgupta1828
    @vikasgupta1828 Рік тому

    Thanks

  • @sanzharsuleimenov6380
    @sanzharsuleimenov6380 Рік тому

    Great explanation!

  • @allisonmachado
    @allisonmachado Рік тому

    awesome video indeed! thank you

  • @ethanmye-rs
    @ethanmye-rs Рік тому

    Thanks! One of the things I find difficult to find good information on is structuring data. Given I want these x properties, how do so arrange the information to get them, and what technologies are required to do it.

  • @VirgoBoy2991
    @VirgoBoy2991 Рік тому

    Hi, i just wonder why the second volume has not been available in kindle yet :( i'm from Vietnam and shipping is expensive

  • @AmrishPandey
    @AmrishPandey Рік тому

    This is amazing video

  • @Chauhannitin
    @Chauhannitin Рік тому

    Very good animation