AWS Tutorials - AWS Glue Pipeline to Ingest Multiple SQL Tables

Поділитися
Вставка
  • Опубліковано 22 лип 2024
  • Code Location - github.com/aws-dojo/analytics...
    There are scenarios where one has to ingest data from multiple SQL tables to the data lake. It raises the debate about whether to use individual glue job and pipelines or use single glue job and pipeline. This tutorial discusses the debate in detail and also shows demo for single pipeline single job scenario.
  • Наука та технологія

КОМЕНТАРІ • 17

  • @terrcan1008
    @terrcan1008 2 роки тому +3

    Thanks for great content. The videos in you content is relatable in terms of real world problems which is great. Looking forward to get more of like this and if possible put all these steps on your website as well as easy to compare during practice session. 😀

  • @durgarasane-kolapkar1842
    @durgarasane-kolapkar1842 Рік тому

    very nice explanation and implementation... thank you so much !

  • @sonynavi
    @sonynavi 2 роки тому

    Nice content. A video on CDC would be great.!

  • @khandoor7228
    @khandoor7228 2 роки тому +1

    Great content from new sub. Please do more big data stuff!

  • @cassianocalimansantos5258
    @cassianocalimansantos5258 Рік тому

    Hello sir.
    Do you have any content about how to ingest from a external DB for the GLue ingestion job, using VPC (such as using a connection - MySQL or SQLServer datasource instead an AWS Redshift source)

  • @nagarjunau1803
    @nagarjunau1803 Рік тому

    Hi if orderdata failed to write into destination others will fail or flow is running

  • @deveshv9273
    @deveshv9273 5 місяців тому

    How to create parameterized AWS Glue Job but with CDC injestion, because in this case the job will be run continuously every 5 minutes to update data (or doing an Upsert). Is there a way to make upserts in a generic way (or parameterized way)?

  • @debaratiaich16
    @debaratiaich16 2 місяці тому

    Is that cost effective to have a single job running multiple times or multiple job runs once?

  • @mohammedkandelhassan
    @mohammedkandelhassan 2 роки тому +1

    Superb👍

  • @veerachegu
    @veerachegu 2 роки тому +2

    Can you pls do concurrent run on workflow also

    • @AWSTutorialsOnline
      @AWSTutorialsOnline  2 роки тому

      yes both Glue Workflow and Step Function can run concurrent.

  • @Nobrezando
    @Nobrezando 2 роки тому

    I crrated a step function just like yours, but my step function is running forever

    • @AWSTutorialsOnline
      @AWSTutorialsOnline  2 роки тому +1

      not sure why. did you refresh the browser?

    • @Nobrezando
      @Nobrezando 2 роки тому +4

      @@AWSTutorialsOnline I found out, when you create the IAM role for step functions, you also need to grant awsglue:getjobstatus in the policy