ETL | Incremental Data Load from Amazon RDS MySQL to Amazon Redshift Using AWS Glue | Datawarehouse

Поділитися
Вставка
  • Опубліковано 4 лис 2024

КОМЕНТАРІ • 23

  • @liubrian6843
    @liubrian6843 29 днів тому +1

    In your case, in each job run, it will grab all data(instead of new data only) from the rds table to redshift and then do the merge. Let's say the table is very big ---over several hundreds of Gigbytes, the operation will be very expensive. Correct? Can you add a sql filter transformation step in between to grab only the new data changed since the last job run, so that only the new data is merged?

    • @cloudquicklabs
      @cloudquicklabs  26 днів тому

      Thank you for watching my videos.
      Indeed , I shall make videos on this point.
      This real very good input.

  • @JothiLakshmi-j7v
    @JothiLakshmi-j7v 3 місяці тому +1

    Thank you so much for the session. Its really helpful for the beginner like me..

    • @cloudquicklabs
      @cloudquicklabs  3 місяці тому

      Thank you for watching my videos.
      Glad that it helped you.

  • @abhishekanand2782
    @abhishekanand2782 3 місяці тому +1

    The reason it got appended into the target table is because, the "Matching Keys" involves all of the column. Had it been just the "industry_name_anzsic" in matching keys. It would have updated it. Actually, I think you assumed that the just the leftmost column is the Matching key which happens most of the time as left is usually the primary key column and we do merges and joins on it. Hence, This was a honest mistake happened due to old habits. Old habits die hard.

    • @cloudquicklabs
      @cloudquicklabs  3 місяці тому

      Thank you for watching my videos.
      It's built on capability for Glue that I have used. But I am happy to explore more about it.

  • @preetybisht6618
    @preetybisht6618 5 місяців тому +1

    hello, will it move the whole data from rds to Redshift or only a copy of rds data to Redshift?

    • @cloudquicklabs
      @cloudquicklabs  5 місяців тому

      Thank you for watching my videos.
      It's copy of Data will be moved not back and lift and shift.

  • @udaynayak-o4f
    @udaynayak-o4f 8 місяців тому +1

    much thankful video, can you please share the script/code which was generated in etl glue

    • @cloudquicklabs
      @cloudquicklabs  8 місяців тому

      Thank you for watching my videos.
      Glad that it helped you.
      This time I did not collect the scripts.
      But if you follow the scenarios as explained in video , you would getting required script there.

  • @senhuayu
    @senhuayu 6 місяців тому +1

    Can we do the opposite way, that is, load data from Redshift to RDS postgreSQL? I tried but it doesn't work. Can you make it work and make a video?

    • @cloudquicklabs
      @cloudquicklabs  6 місяців тому

      Thank you for watching my videos.
      It's unique requirement though , I shall try creating a videos on this soon.

  • @akshaygarg9150
    @akshaygarg9150 9 місяців тому +1

    please tell me what are the policies you have attach in iam role

    • @cloudquicklabs
      @cloudquicklabs  9 місяців тому

      Thank you for watching my videos.
      As this is a demo video I am using 'admin' access which is not recommended in production.

    • @akshaygarg9150
      @akshaygarg9150 7 місяців тому

      policies in iam role as i am facing timeout error in aws glue@@cloudquicklabs

  • @ashishkamble2530
    @ashishkamble2530 3 місяці тому +1

    hi brother
    im able to collect data one by one through table but when im trying to establish connection through crowler its says unable to connect or establish connection then its unable to connect is that possible to add all tables at a time

    • @cloudquicklabs
      @cloudquicklabs  3 місяці тому

      Thank you for watching my videos.
      There could be multiple reasons like below.
      1. Check if vpc endpoints for rds
      2. Check if inbound security group has required ports enabled here.
      3. Check if credentials are correctly provided.

  • @faisalmali3809
    @faisalmali3809 8 місяців тому +1

    Please make a video on the pyspark script.

    • @cloudquicklabs
      @cloudquicklabs  8 місяців тому

      Thank you for watching my videos.
      Indeed I shall make pyspark script videos

  • @tataatreya
    @tataatreya 3 місяці тому +1

    How to get classes ?

    • @cloudquicklabs
      @cloudquicklabs  3 місяці тому

      Thank you for watching my videos.
      I don't take classes but help through my videos , let me know if you have any topic to cover in videos.

  • @dimba8119
    @dimba8119 10 днів тому +1

    That clicking sound from windows 98 is very distracting.

    • @cloudquicklabs
      @cloudquicklabs  10 днів тому

      It was my old system.
      New videos are being built from new Windows 11.
      Hope you Appreciate it.