S2024 #03 - Data Formats & Encoding Part 2 (CMU Advanced Database Systems)
Вставка
- Опубліковано 28 січ 2024
- Andy Pavlo (www.cs.cmu.edu/~pavlo/)
Slides: 15721.courses.cs.cmu.edu/spri...
Notes: 15721.courses.cs.cmu.edu/spri...
15-721 Advanced Database Systems (Spring 2024)
Carnegie Mellon University
15721.courses.cs.cmu.edu/spri... - Наука та технологія
Thanks for this! I've always found the semistructured stuff hard to understand. I just want to point out, though, that the example in the referenced paper for shredding has different values in the columnar decomposition. In particular, for value 'en' in Name.Language.Code, the repetition level is 2, because it is a repetition of the 2nd repeated field (according to the paper).
Great lecture! What would it take for a new file format becomes mainstream? Parquet/ORC are so popular, is it possible for a new format to rise?
Thank you.
The Link for the notes are not there