The Future of Data Engineering

Поділитися
Вставка
  • Опубліковано 14 січ 2025

КОМЕНТАРІ • 61

  • @thedatajanitor9537
    @thedatajanitor9537  2 роки тому +17

    Stop learning the wrong shit. There are no entry level jobs in this space. Start focusing on a more entry level career like the data analyst.
    LogikBot has everything you need. You just need to do the work.
    www.logikbot.com

  • @andreyandrey852
    @andreyandrey852 6 місяців тому +3

    A great breakdown on a data engineer path. I’ve been a BI MicroStrategy Developer for 2.5 years. We use Snowflake warehouse. Considering to switch to a data engineer .

  • @tomastruchly9484
    @tomastruchly9484 2 роки тому +25

    Working with Snowflake as DE the last 5 months and must say it's great. Worked with Oracle before and that was also great. A great job for good money, however not easy to get and even harder to do. Therefore I love it 😝

    • @thedatajanitor9537
      @thedatajanitor9537  2 роки тому +15

      The level of technical acumen needed keeps increasing.

    • @tomastruchly9484
      @tomastruchly9484 2 роки тому +14

      Yes, therefore as you already mentioned in previous videos person needs to focus on certain skills & technologies. E.g. choose 1 cloud provider from the big 3, choose ideally between Snowflake & Databricks (major modern DWH players, mastering both would be overkill and extremely difficult), learn Git (absolut must), choose maybe 1-2 scripting languages (definitely recommend Python & JavaScript because of Snowflake ). Also there are ton of ETL tools, choose 1 and learn dbt! That's enough for start (learning curve 2-3 years to get a normal DE job if you are a maniac :D)

    • @scrotiemcboogerballs2133
      @scrotiemcboogerballs2133 Рік тому +1

      @@tomastruchly9484 Was your Oracle job based in the US? When I look at Oracle postings a lot of their data jobs are for India

    • @akindia8519
      @akindia8519 5 місяців тому

      ​​​@@thedatajanitor9537 hi. If we could hypothetically (not in real life, just taking an extreme scenario) assume that ai replacement is happening on mass scale in IT industry, then, *out of AI Engineer, ML Engineer, and Data Engineer roles which will be less likely to be replaced?* I'm asking this because, many people say that it's extremely hard(very rare) to build models more better than AutoML, which could put even data scientist/ML Engineer roles at risk of automation in future. Even though you might say that ML Engineer role can't be replaced, I still would appreciate to know out of Data Engineer and ML engineer, which one would be in a better position?

  • @mattiaslp9645
    @mattiaslp9645 2 роки тому +9

    thanks data janitor. you always give top notch information. i’ll get right to it!

  • @palomarAI
    @palomarAI Рік тому +1

    Good suggestions for considering niches.

  • @marciofernandes7091
    @marciofernandes7091 2 роки тому +4

    You missed the streaming niche, Kafka niche. Although I believe it will be a short lived trend.
    Love your videos no exception so far.

    • @thedatajanitor9537
      @thedatajanitor9537  2 роки тому +2

      Thanks. Yeah, simply haven't seen a ton of jobs for them. Thanks for the compliment. Much appreciated.

    • @br.3250
      @br.3250 6 місяців тому

      Why short trend? Doesn't it have many applications?

  • @majedabdulla3739
    @majedabdulla3739 2 роки тому +4

    Here in Egypt there is one man do data analysis and DBA and propably do business intelligence!

  • @theunrulycat
    @theunrulycat 2 роки тому +5

    Hey mike, I’m interested in your thoughts on the mass tech layoffs and its impact on the job market. I understand data-anything is growing as a field and “the jobs will always be there” but there must be a lot of competition once the ex faangers start looking for work again; the prospects for new grads and newbs in the data/tech space look pretty bleak especially here in the bay area

    • @thedatajanitor9537
      @thedatajanitor9537  2 роки тому +3

      The prospects for new grads has always been bleak. Now, it's just more bleak. However, you aren't competing with someone from any top tech company as a recent grad. You are starting out much lower on the totem pole than any of these top tech types will.

  • @mklarso9570
    @mklarso9570 Рік тому +3

    DE or ML. Which one is more promising. Which one pays better on the long run? Which one is more "top tier"? Which one would you suggest not based on the criteria of reports and meetings? Thanks!

    • @thedatajanitor9537
      @thedatajanitor9537  Рік тому +2

      It won't matter. It's about the same. There are more DE jobs. Both are top tier jobs that pay really well once you're skilled at them.

  • @Iva087
    @Iva087 Рік тому +2

    I hope you are very well Mr. Mike. I've been following his answers on quora for a long time, as soon as I realize he has a UA-cam channel, I immediately subscribed. Mr. Mike I am from Colombia, I am 19 years old and I am in my second year of computer science. I wanted to ask you a question, I have seen that the fields that really interest me are software engineering (Java backend, distributed systems and architecture) and data engineering. For you, which of the two roles could have the best future in the long term? Thank you very much for all your contributions.

    • @thedatajanitor9537
      @thedatajanitor9537  Рік тому +7

      The top job on earth is the data engineer. Google said nothing else is close. They said it will be the top job for decades to come.

    • @Iva087
      @Iva087 Рік тому +1

      @@thedatajanitor9537 Decades, great.
      Thank you so much!!

    • @arasukiasyan4808
      @arasukiasyan4808 Рік тому +1

      Hi Mike I see you say data engineer as a job, but what about the versatility of starting your own business as a software engineer. Are you taking this into consideration as well

  • @dexterslab7750
    @dexterslab7750 Рік тому +1

    Now it's been 11 months you have posted this video.
    Do you have the same thought as you have told in the video?

  • @philippebolduc4057
    @philippebolduc4057 2 роки тому +2

    Hello sir! I would like to know more about your statement that data engineer is the top job in tech over ai engineer! This could also make a good video! Thanks :)

  • @1220MrCool
    @1220MrCool 2 роки тому +2

    Hey Mike,
    As always, I am appreciative of your content. Thank you!
    I work as a faker scientist right now (as of 5 months), and I just passed AZ-900 (Fundamentals). I am considering to go into Data Engineering within the next one to two years. I already work with data sourcing on SQL and use git to push and pull code that is being tested and/or developed. Besides this work experience and passing AZ-900, would AZ-104 (Administrator Associate) be a good Microsoft certification to aid in becoming a Data Engineer? I know there is a specified/tailored DE certification Microsoft provides, but is AZ-104 a good stepping stone to have to become a DE?

    • @ladistar
      @ladistar 2 роки тому

      lmao faker scientist hahaha I can see Mike West is starting to rub off on you lol

    • @1220MrCool
      @1220MrCool 2 роки тому

      @ladistar yeah lol. But I mean he has a point; DS tend to be applied statisticians who are not generally well-versed in software development and IT. So I am trying to learn as much as I can about IT through experience and certifications on technology I am or will be using.

    • @ladistar
      @ladistar 2 роки тому +2

      @@1220MrCool nice! That's great to hear. I'm currently a reporting and data analyst for an insurance company, been doing analyst-type work for close to 5 years now and am now trying to break into data engineering. I'm currently studying for the GCP Data Engineer exam next month. Hope everything works out for you man.

    • @thedatajanitor9537
      @thedatajanitor9537  2 роки тому +3

      Yes. Looks like you've chose to focus on Azure. I do a lot of work on AWS but the best overall experience for you and I is Azure. The best cloud interface hands down. Microsoft is simply a better engineering company than Amazon.

    • @thedatajanitor9537
      @thedatajanitor9537  2 роки тому +1

      @@ladistar Don't forget about the exam simulator. Half of the questions on the Google DE Cert are machine learning questions. Also, a ton on BigQuery.

  • @risaiahgamers5686
    @risaiahgamers5686 2 місяці тому +1

    How would we learn snowflake?

    • @thedatajanitor9537
      @thedatajanitor9537  2 місяці тому

      Take a course on it. Start learning SQL. Learning SQL is more important than SF.

  • @darrenching8351
    @darrenching8351 2 роки тому +3

    Hi Mike, what do you think of a business intelligence as a entry level career?

    • @thedatajanitor9537
      @thedatajanitor9537  2 роки тому +2

      It's not entry level at all but if you can go that route, hell yeah. Most BI jobs are senior level roles. BI is almost the same job as a machine learning engineer.

    • @darrenching8351
      @darrenching8351 2 роки тому +2

      @@thedatajanitor9537 Interesting, what would your opinion be for Business Analyst?

    • @thedatajanitor9537
      @thedatajanitor9537  2 роки тому +4

      @@darrenching8351 Not technical. Most don't live in IT. Not a fan. :)

  • @ryansandy2433
    @ryansandy2433 Рік тому +1

    Most people say data engineering is a stressful job, would you also consider machine learning engineering to be stressful work or less so?

    • @thedatajanitor9537
      @thedatajanitor9537  Рік тому +8

      Both can be stressful. However, most of that stress comes when you don't know what to do. After you're all skilled up, it's not that stressful. There's always another job with your skill set!! :)

    • @ryansandy2433
      @ryansandy2433 Рік тому

      Thanks, so after you learn the tools you use it’s less stressful. But are the toolsets constantly evolving and growing ?

  • @dingding4898
    @dingding4898 2 роки тому +1

    How to find the right niche?

    • @thedatajanitor9537
      @thedatajanitor9537  2 роки тому +2

      You spend a few yeas in an entry level role and learn the ropes. Then you decide what you want to do. Data engineering is a top tier role, there are very few to zero entry level roles.

  • @emrec.7433
    @emrec.7433 2 роки тому +1

    Mike, do you have a recommended course/book for studying system design?
    Topics such as real time data analysis with distributed computing are very confusing.

    • @thedatajanitor9537
      @thedatajanitor9537  2 роки тому +1

      Real time data analysis just means the data is being replicated from production to another environment like Snowflake. There are tools like Fivetran to do this. With the advent of the cloud, we don't really worry about distributed computing.

    • @emrec.7433
      @emrec.7433 2 роки тому +1

      @@thedatajanitor9537 What do you think about Open Source?
      Cloud makes a lot of things really easy. However, it was the Open Source-based tools that I specifically mentioned here.

    • @thedatajanitor9537
      @thedatajanitor9537  2 роки тому

      @@emrec.7433 Sorry. I'm not understanding the question. What open source tools do you mean?

    • @emrec.7433
      @emrec.7433 2 роки тому +1

      For example, I'm talking about end-to-end architecture examples that go on like kafka-flink-kubernetes-neo4j... In architectures where a lot of technology is used together.
      Articles about how this architecture is created and how it is built in accordance with the problem, seem very confusing.

    • @emrec.7433
      @emrec.7433 2 роки тому +1

      @@thedatajanitor9537 Example article on medium : Airbnb System Architecture
      Look at the image about hadoop, cassandra, redis, kafka

  • @patrickchan2503
    @patrickchan2503 Рік тому

    Is data engineering repetitive and therefore boring? The data cleaning exercises I'm doing seem mundane. Hopefully the course will get more exciting. What are most exciting DE tasks in your opinion? Thanks.

  • @ManPursueExcellence
    @ManPursueExcellence 2 роки тому

    So, become a Data Analyst for a few years. While working as a Data Analyst, learn Snowflake and become a Snowflake Data Engineer. You said Snowflake works with Azure, AWS, and GCP.
    *Does that mean I have to know ALL 3 of those cloud providers first if I want to go down the Data Engineer Warehouse specialty?*

    • @thedatajanitor9537
      @thedatajanitor9537  2 роки тому +7

      You can't really know all three. Simply too much shit to know. You pick one, I'd suggest AWS if you are going the SF route. After you learn basic AWS stuff, you only focus on the data movement tools.

    • @ManPursueExcellence
      @ManPursueExcellence 2 роки тому +1

      @@thedatajanitor9537 Ok. Interesting.

    • @ManPursueExcellence
      @ManPursueExcellence 2 роки тому

      @@thedatajanitor9537wing Following up with our last convo here, would you get the AWS Big Data Certification, and then the Snowflake premium Certification (assuming I’m taking the SF Data Warehousing route)?
      In your video “Top 3 Data Engineering Certifications,” you mention the AWS cert but, you have a problem with it because it’s not strictly DE. It’s Big Data.
      Your video: ua-cam.com/video/QXGQ9H2xoCw/v-deo.html&feature=share

  • @jacksmith7160
    @jacksmith7160 2 роки тому +1

    I think tools will decrease the importance of data engineers
    We can see how the tools got improve day by day.

    • @thedatajanitor9537
      @thedatajanitor9537  2 роки тому +5

      Nope. Someone has to be able to use the tools. The job hasn't "decreased" in over a decade. Sorry. The DE is the top job now and going forward and no other job is close.

  • @smeetchaturvedi9006
    @smeetchaturvedi9006 2 роки тому

    I think best job is blockchain developer than data engineering.
    It has more demand other than any job be it data engineering machine learning

    • @thedatajanitor9537
      @thedatajanitor9537  2 роки тому +12

      LMAO. You have no clue what you're talking about. Please don't post shit on my channel or I'll need to delete the post and you. I know what the numbers are, I've worked at the top tech companies. Blockchain developers aren't even on the top ten list.