Full Python Portfolio Project! Create a smart program to download & transcribe top podcasts.
Вставка
- Опубліковано 7 чер 2024
- Check out www.assemblyai.com/?... to start transcribing as many podcasts as your heart desires!
In this video we create a Python program that can automatically scrape the RSS feeds of your favorite podcasters, pulling out the episodes you’ll find most interesting, and downloading + transcribing them.
This project leverages a wide range of Python skills making it a good portfolio project. In it you’ll use the BeautifulSoup + requests libraries to first web scrape & download MP3 podcast files. You can use the regex library (re) and other NLP libraries to smart search for specific episodes that you'll enjoy. Next you'll see how you can use AssemblyAI's speech to text API to transcribe all of the episodes that you download. This code will be leveraged to create a text corpora that is going to be used in language analysis in upcoming tutorials.
To view/download source code for project: github.com/keithgalli/podcast...
If you have any questions, let me know in the comments!
Make sure to smash like + subscribe if you enjoyed this video :)
-------------------------
Follow me on social media!
Instagram | / keithgalli
Twitter | / keithgalli
TikTok | / keithgalli
-------------------------
If you are curious to learn how I make my tutorials, check out this video: • How to Make a High Qua...
Practice your Python Pandas data science skills with problems on StrataScratch!
stratascratch.com/?via=keith
Join the Python Army to get access to perks!
UA-cam - / @keithgalli
Patreon - / keithgalli
*I use affiliate links on the products that I recommend. I may earn a purchase commission or a referral bonus from the usage of these links.
-------------------------
Video timeline!
0:00 - Video Introduction
1:19 - How podcasts work (RSS feeds overview)
5:11 - How can we utilize the XML webpages? (breakdown of RSS feed information & how we’ll use it to create a smart program)
7:47 - Accessing this project on GitHub
9:22 -Writing Python code to download podcasts locally (requests & beautifulsoup libraries)
18:10 - Modify our script to be able to download many podcasts
22:51 - Building in smart search capabilities to grab podcasts we’ll find most interesting!
31:00 - Using the AssemblyAI API to transcribe the podcasts we’ve downloaded
1:06:08 - Cleaning our code with functions & classes and putting everything into Python scripts.
1:18:09 - Portfolio project extension ideas! (Spotify API, NLP semantic search)
1:19:56 - Smash like & subscribe pretty please :)
I absolutely loved your videos! Please make more data science projects!
Well-done Keith 👏🔥
Awesome video idea! Can't wait to watch on lunch break
Hope you enjoy it!!
💪🔥🔥 you’re doing the most Keith.
Thanks man!!
Awesome!!
Goodto have you back Keith...
Good to be back!
Whoa. This is rad. I so pumped to build this!
thanks dog!
Enjoy!!
very interesting! thank u 🙏🏼
love it
great video as always
Thank you :)
I recommend Darknet Diaries if you like computer crime podcasts. Also, thanks for this video!
Hello! Thank you for your invaluable video! I find it extremely useful for beginners! I would like to ask about one thing regarding data. I learnt Pandas in terms of Data Wrangling and Transformation. Therefore, how about Pandas for Data Engineers? Is it useful tool for ETL/ELT transformations? Obviously, the next step will be PySpark, but I would like to start learninig Pandas. It seems it is a good path for the next one. What do you think about it ? I would appreciate it if you could share your views about it.
The return of the legend!
😎
Great
Really cool! Could you do more on model deployment?
What types of things are you trying to learn about model deployment?? Happy to look into that further.
@@KeithGalli The CS side of things! I just deployed my first model with streamlit but want to take it up a notch
Great videos..... wawww
could you explain how this can be done locally using openai's whisper?
awesome! i listen to podcasts a lot as well! that is so exciting to learn this topic with python!! at the same time, is it possible to analyze individual podcast preference?
Glad to know you are excited about the video! Yeah I think it should be possible to analyze individual podcast preference. What types of things are you trying to analyze??
@@KeithGalli happy to get your message! im thinking about something like annual summary, e.g. how many podcasts you listen to in a certain period of time; which episode/ podcast is your favourite? your total consuming time? what do you think? and is it possible to get those data?
holy moly
Please make a video of ChatGPT
are you working for any a company right now?
I'm not working for a specific company, but I'm working on a couple freelance data science consulting projects for companies as well as helping run an analytics bootcamp.
Do you have a list of the podcasts that you like to listen to?
Definitely! Here are a few:
- Joe Rogan Experience
- Huberman Lab
- Tim Ferriss Show
- Lex Fridman Podcast
- Morning Brew Daily
- Business Wars