Set Up and Use Apache Iceberg Tables on Your Data Lake - AWS Virtual Workshop

Поділитися
Вставка

КОМЕНТАРІ • 14

  • @che5ari
    @che5ari Рік тому +1

    Thanks for this very clear presentation on more of the details of Iceberg. Whilst there are lot of talks about Iceberg they gloss over the details which are quite important for those who need them.

  • @AnNguyen-en3tz
    @AnNguyen-en3tz 6 місяців тому +1

    thanks. easy to understand and follow it
    that saved my day

  • @user-uf7ie5pt9e
    @user-uf7ie5pt9e 7 місяців тому

    Hi, excellent video about iceberg. I have a question, i have a datalake with many parquet files and i want to use iceberg tables. what is the correct way to deals with this parquet files, do i read all parquet files and insert data into iceberg table? or is there any to link iceberg table to existing parque files without copy then into iceberg table?

  • @anandsharma213
    @anandsharma213 Рік тому

    Lovely presentation. Thanks for sharing!

  • @senro3960
    @senro3960 Рік тому

    When you add a new column for instance, it create a new snapshot and you can query the snapshot you want. But how performant is it ? Let's say our team use iceberg and over a year, 1000 snapshots were created, with some time the create of a new column added or the deletion of another.
    If the snapshots store the transactions, does it means that when we are going to query the first snapshot, it reapply all the 1000 modifications done, and then query this version of the table ? Or does it create new data file each time that copy our table with the modification ?

  • @hariporandla8044
    @hariporandla8044 Рік тому

    great information. very clear demo. thanks

  • @amirabraham100
    @amirabraham100 Рік тому

    excellent presentation !

  • @tranminhhaifet
    @tranminhhaifet Рік тому

    thank you, very clear and easy to understand

  • @nagusameta366
    @nagusameta366 Рік тому

    I created iceberg tables inside an EMR notebook, and while they do show up in Athena, the columns do not load. When I went to view the table in Glue, well the columns are also not there. Why does this happen? I can only interact with the table within the Spark session, but in Athena or in Glue, it's just an empty table with the name but no columns nor the data.

    • @awssupport
      @awssupport Рік тому +1

      Sorry about this inconvenience you've faced here. I recommend reaching out via our re:Post forum and posting your question there for more visibility & insight from our tech community. You can do that via this link: go.aws/aws-repost. ^BG