Full Python Portfolio Project! Create a smart program to download & transcribe top podcasts.

Поділитися
Вставка
  • Опубліковано 7 чер 2024
  • Check out www.assemblyai.com/?... to start transcribing as many podcasts as your heart desires!
    In this video we create a Python program that can automatically scrape the RSS feeds of your favorite podcasters, pulling out the episodes you’ll find most interesting, and downloading + transcribing them.
    This project leverages a wide range of Python skills making it a good portfolio project. In it you’ll use the BeautifulSoup + requests libraries to first web scrape & download MP3 podcast files. You can use the regex library (re) and other NLP libraries to smart search for specific episodes that you'll enjoy. Next you'll see how you can use AssemblyAI's speech to text API to transcribe all of the episodes that you download. This code will be leveraged to create a text corpora that is going to be used in language analysis in upcoming tutorials.
    To view/download source code for project: github.com/keithgalli/podcast...
    If you have any questions, let me know in the comments!
    Make sure to smash like + subscribe if you enjoyed this video :)
    -------------------------
    Follow me on social media!
    Instagram | / keithgalli
    Twitter | / keithgalli
    TikTok | / keithgalli
    -------------------------
    If you are curious to learn how I make my tutorials, check out this video: • How to Make a High Qua...
    Practice your Python Pandas data science skills with problems on StrataScratch!
    stratascratch.com/?via=keith
    Join the Python Army to get access to perks!
    UA-cam - / @keithgalli
    Patreon - / keithgalli
    *I use affiliate links on the products that I recommend. I may earn a purchase commission or a referral bonus from the usage of these links.
    -------------------------
    Video timeline!
    0:00 - Video Introduction
    1:19 - How podcasts work (RSS feeds overview)
    5:11 - How can we utilize the XML webpages? (breakdown of RSS feed information & how we’ll use it to create a smart program)
    7:47 - Accessing this project on GitHub
    9:22 -Writing Python code to download podcasts locally (requests & beautifulsoup libraries)
    18:10 - Modify our script to be able to download many podcasts
    22:51 - Building in smart search capabilities to grab podcasts we’ll find most interesting!
    31:00 - Using the AssemblyAI API to transcribe the podcasts we’ve downloaded
    1:06:08 - Cleaning our code with functions & classes and putting everything into Python scripts.
    1:18:09 - Portfolio project extension ideas! (Spotify API, NLP semantic search)
    1:19:56 - Smash like & subscribe pretty please :)

КОМЕНТАРІ • 35

  • @andyn6053
    @andyn6053 Рік тому

    I absolutely loved your videos! Please make more data science projects!

  • @abhaytiwari5991
    @abhaytiwari5991 Рік тому +3

    Well-done Keith 👏🔥

  • @professuh2618
    @professuh2618 Рік тому +1

    Awesome video idea! Can't wait to watch on lunch break

  • @wiz8058
    @wiz8058 Рік тому +2

    💪🔥🔥 you’re doing the most Keith.

  • @Dividendology
    @Dividendology Рік тому +1

    Awesome!!

  • @heyits_sree
    @heyits_sree Рік тому +1

    Goodto have you back Keith...

  • @chillydoog
    @chillydoog Рік тому

    Whoa. This is rad. I so pumped to build this!
    thanks dog!

  • @zahra_koohestani
    @zahra_koohestani Рік тому

    very interesting! thank u 🙏🏼

  • @jonpounds1922
    @jonpounds1922 Рік тому

    love it

  • @mirshodoripov1035
    @mirshodoripov1035 Рік тому

    great video as always

  • @nolimit167
    @nolimit167 Рік тому +1

    I recommend Darknet Diaries if you like computer crime podcasts. Also, thanks for this video!

  • @ukaszdugozima816
    @ukaszdugozima816 Рік тому +1

    Hello! Thank you for your invaluable video! I find it extremely useful for beginners! I would like to ask about one thing regarding data. I learnt Pandas in terms of Data Wrangling and Transformation. Therefore, how about Pandas for Data Engineers? Is it useful tool for ETL/ELT transformations? Obviously, the next step will be PySpark, but I would like to start learninig Pandas. It seems it is a good path for the next one. What do you think about it ? I would appreciate it if you could share your views about it.

  • @konstantinpluzhnikov4862
    @konstantinpluzhnikov4862 Рік тому

    The return of the legend!

  • @robertleechford4250
    @robertleechford4250 4 місяці тому

    Great

  • @drakeweissman6499
    @drakeweissman6499 Рік тому +2

    Really cool! Could you do more on model deployment?

    • @KeithGalli
      @KeithGalli  Рік тому

      What types of things are you trying to learn about model deployment?? Happy to look into that further.

    • @drakeweissman6499
      @drakeweissman6499 Рік тому

      @@KeithGalli The CS side of things! I just deployed my first model with streamlit but want to take it up a notch

  • @nelsonbeneche2372
    @nelsonbeneche2372 Рік тому

    Great videos..... wawww

  • @tokyofamily8536
    @tokyofamily8536 Рік тому

    could you explain how this can be done locally using openai's whisper?

  • @fiefiego2298
    @fiefiego2298 Рік тому

    awesome! i listen to podcasts a lot as well! that is so exciting to learn this topic with python!! at the same time, is it possible to analyze individual podcast preference?

    • @KeithGalli
      @KeithGalli  Рік тому

      Glad to know you are excited about the video! Yeah I think it should be possible to analyze individual podcast preference. What types of things are you trying to analyze??

    • @fiefiego2298
      @fiefiego2298 Рік тому

      @@KeithGalli happy to get your message! im thinking about something like annual summary, e.g. how many podcasts you listen to in a certain period of time; which episode/ podcast is your favourite? your total consuming time? what do you think? and is it possible to get those data?

  • @sebastianalvarez1537
    @sebastianalvarez1537 Рік тому

    holy moly

  • @amrak2010
    @amrak2010 Рік тому

    Please make a video of ChatGPT

  • @mirshodoripov1035
    @mirshodoripov1035 Рік тому +1

    are you working for any a company right now?

    • @KeithGalli
      @KeithGalli  Рік тому

      I'm not working for a specific company, but I'm working on a couple freelance data science consulting projects for companies as well as helping run an analytics bootcamp.

  • @bennguyen1313
    @bennguyen1313 6 місяців тому

    Do you have a list of the podcasts that you like to listen to?

    • @KeithGalli
      @KeithGalli  5 місяців тому

      Definitely! Here are a few:
      - Joe Rogan Experience
      - Huberman Lab
      - Tim Ferriss Show
      - Lex Fridman Podcast
      - Morning Brew Daily
      - Business Wars