DP-203: 07 - Common file types (PARQUET)

Поділитися
Вставка
  • Опубліковано 30 лис 2024

КОМЕНТАРІ • 30

  • @pawegambus6083
    @pawegambus6083 10 місяців тому +16

    I can't state enough how helpful your videos are. I'm preparing for my certification and the way you explain the concepts is just out of this world. I really like the fact that you use your own notes and explain the ideas that led to building particular tools and processes happening behind the scenes.
    A lot of the concepts have been really abstract to me so far and, although I could explain them, I could not really "feel" them. It might sound silly in the context of IT topics but that's the way I perceive it.
    After listening to you, I feel like having physical tools which I know how to use and also how not to use. You gave me a whole lot of confidence. Thank you so much for your help and I keep my fingers crossed for your channel!

    • @TybulOnAzure
      @TybulOnAzure  10 місяців тому +9

      Wow. If I ever lost a motivation to record new videos, I'll just get back to this comment. Thanks!

  • @smbs47
    @smbs47 Рік тому +6

    Just finished. Looking forward to 08!

  • @prabhuraghupathi9131
    @prabhuraghupathi9131 8 місяців тому +4

    Earlier used to know only the Parquet format as just one of file format and now got to know how powerful to use it for analytics purpose!! Really helpful!

  • @PamTiwari
    @PamTiwari 2 місяці тому +1

    I can't express my gratitude in words to you, it was very in-depth knowledge about parquet file format and so valuable.

  • @melvin9993
    @melvin9993 7 місяців тому +1

    Your videos are like a big reference. Detailed and clear. Thank you for the time you put in for this series.

  • @fervabatool1037
    @fervabatool1037 5 місяців тому +3

    keep it up!! very sincere teacher. You love to pick the things from root that is very important.

  • @AkshayKokare-ci4lv
    @AkshayKokare-ci4lv 9 місяців тому +1

    Thank you!! Really appreciate for the knowledge you are sharing.
    Please keep posting more videos ☺

  • @onghuiling6937
    @onghuiling6937 3 місяці тому +1

    Really informative and good channel! Recommend to everyone who would like to kickstart the journey as an Azure Data Engineer!

  • @xelilxelilli2011
    @xelilxelilli2011 4 місяці тому +2

    Really great material , everything is explained with a lot of care. I am sure if you will be consistently uploading material this channel will grow. Bardzo panu dziękuję za wszystko :)

  • @pavans4014
    @pavans4014 15 днів тому

    never thought you would go so deep in subjects....... your a addiction

  • @geetanjali-xh4zz
    @geetanjali-xh4zz 8 місяців тому +3

    Hi piotr thanks for posting the videos. These are super helpful . I am waiting for more videos like these on stream analytics , event hubs ..

    • @TybulOnAzure
      @TybulOnAzure  8 місяців тому

      It will be there but I want to finish batch processing first.

  • @SarangKulkarni-b9i
    @SarangKulkarni-b9i 3 місяці тому +1

    Honestly, Idk if any other tutor has got me hooked up to a course like this. Damn you are amazing. I wanna cover everything and get my certificate asap!

  • @EternalEvanesce
    @EternalEvanesce Місяць тому +1

    Thank you for making this video! Are columnstore indexes similar to hybrid storage?? Minus metadata and dictionary encoding 😅

    • @TybulOnAzure
      @TybulOnAzure  Місяць тому +1

      Yes, the concept is pretty much the same.

  • @SAJO91
    @SAJO91 Місяць тому +1

    why the hybrid storage stores data in chunk of groups?
    can't we just index data row by row so it will be more optimized than grouping(we will make sure that we will read the needed rows only)?

    • @TybulOnAzure
      @TybulOnAzure  Місяць тому

      It depends on type of queries that are run on the data. If there are also WHERE predicates used (and very often they are), then row groups will be faster as optimizer can eliminate the whole row group.

  • @PumpkinCigar
    @PumpkinCigar Рік тому +1

    Parquet files are really powerful

  • @darshakkumarranpariya867
    @darshakkumarranpariya867 Рік тому +2

    Operation Parquet - The Untold Truth About Parquet :)

  • @leonardomelia92
    @leonardomelia92 9 місяців тому

    HI, with parquetViewer can i explore the domain (distinct values) of each field? i would need a solution with that

    • @TybulOnAzure
      @TybulOnAzure  9 місяців тому +1

      I don't think so. Take a look at documentation: github.com/mukunku/ParquetViewer/wiki/Running-Queries

  • @zouhair8161
    @zouhair8161 11 місяців тому

    hello thanks for your efforts, can you make a video about azure blobs please because it seems more comlicated than ADLS

    • @TybulOnAzure
      @TybulOnAzure  11 місяців тому

      What do you mean by Azure blobs?

    • @zouhair8161
      @zouhair8161 11 місяців тому

      @@TybulOnAzure in azure torage account you can choose to store data as blobs and you have choise between blocks blobs, append blobs and page blobs and you can create by example python app that consume api client library for uploading files as blobs container and downloading them. in brief i have some imbiguity in this topic

  • @LATAMDataEngineer
    @LATAMDataEngineer 8 місяців тому

    Excellent explain thanks.