How to import CSV file from Amazon S3 to Redshift using AWS Glue Jobs

Поділитися
Вставка
  • Опубліковано 23 лип 2024
  • In this video, I teach you how to create an AWS Glue Job to perform the transfer of CSV files from Amazon S3 to Amazon Redshift.
    In the next video I tell you how to connect this database on Redshift to Power BI.

КОМЕНТАРІ • 23

  • @yadi4diamond
    @yadi4diamond 8 місяців тому

    Very much helpful , simple, short and covered all details. Thank you!

  • @asishb
    @asishb 10 місяців тому +1

    Very very underrated video. I hope it reaches everyone.
    A question - what is the use of the classifier ?

  • @antonietapintorebelo5020
    @antonietapintorebelo5020 Рік тому +1

    Thank you so much! This was a very helpful video. Keep it up!

    • @cafeconlechecodes
      @cafeconlechecodes  Рік тому +1

      Hello there! I'm really glad that I helped you. Thanks for your kind words!

    • @denisgomonov2320
      @denisgomonov2320 Рік тому

      @@cafeconlechecodes Hey !
      What is the difference between doing this via Glue or DataPipeline, can both be automated in the same fashion?

    • @cafeconlechecodes
      @cafeconlechecodes  Рік тому

      @@denisgomonov2320 Hello! The main difference for me is that with Glue you "only" focus on the ETL job whereas DataPipeline allows you to manage even the smallest instance details (like VPC in EC2 for example).
      Yes, both can be automated to do the same job.
      Fun fact: I have even replaced Glue with only Lambda for some specific tasks.

  • @rajatpathak4499
    @rajatpathak4499 Рік тому

    great job

  • @rajatpathak4499
    @rajatpathak4499 Рік тому

    if you can add more videos with some transformation in glue then load data, and some workflow setup that would be great

  • @alexissebastian7614
    @alexissebastian7614 7 місяців тому

    how can you assign a dynamic name in glue??

  • @roshinidrakshapally5011
    @roshinidrakshapally5011 8 місяців тому

    Hi, I am getting an error "An error occurred while calling o128.pyWriteDynamicFrame. Exception thrown in awaitResult:" when I run the job. Please help me with this. Thanks in advance!

  • @neon_horizon
    @neon_horizon 11 місяців тому

    I got an error when creating crawler-redshift. I found VPC S3 endpoint validation failed in cloud watch log.

    • @asishb
      @asishb 10 місяців тому

      You need to add a VPC endpoint connection - repost.aws/knowledge-center/glue-s3-endpoint-validation-failed

  • @AmolModhekar
    @AmolModhekar Рік тому

    Can you tell me if it is we have 1000 table in S3 then how we dealing with redshift special schema issue if possible suggest proper way to resolve this

    • @cafeconlechecodes
      @cafeconlechecodes  Рік тому +1

      Hello. If you have 1000 csv in S3, you can run a first crawl to enable the crawler to record the initial schema. If every time you run a Job, you have 1000 new csv files, then I suggest setting up your crawler for New files only.

    • @AmolModhekar
      @AmolModhekar Рік тому

      @@cafeconlechecodes thanks for reply

  • @user-fb2ly9gd2c
    @user-fb2ly9gd2c Рік тому

    Am not able to have successful connection established to redshift. Please help?

    • @cafeconlechecodes
      @cafeconlechecodes  Рік тому

      Hello. Which error are you getting? And also tell me in which part of the video you are getting the error.

  • @SreevishakhBC-yo4kv
    @SreevishakhBC-yo4kv Місяць тому

    Can you send that CSV file ?

  • @fawadh
    @fawadh Рік тому

    I'm Having an error in a job!
    Can't fix it!
    can you help?
    An error occurred while calling o131.pyWriteDynamicFrame. Exception thrown in awaitResult:

    • @cafeconlechecodes
      @cafeconlechecodes  Рік тому

      Please, tell in which part of the video you are getting the error. I guess it's in the Glue Job?