How To Scrape Reddit & Automatically Label Data For NLP Projects | Reddit API Tutorial

Поділитися
Вставка
  • Опубліковано 29 січ 2025

КОМЕНТАРІ • 46

  • @dabunnisher29
    @dabunnisher29 3 роки тому +7

    You are certainly one of my favorite Python Masters. I really needed to learn how to do this for stocks. Thank you sooooooo much! You are AWESOME!!!!!

    • @patloeber
      @patloeber  3 роки тому +1

      Glad you like it :)

    • @dabunnisher29
      @dabunnisher29 3 роки тому

      I looked all around today and I couldn't find how to search a subreddit by a key word like "PLTR", get the results and use the NLTK library. If anyone can help, I would appreciate it.

  • @NoIntroTutorials
    @NoIntroTutorials 2 роки тому

    MAGNIFICENT! I just needed the first part, getting the post titles!, thank you man!

  • @carfromcars3679
    @carfromcars3679 Рік тому

    wowwww easiest tutorial to follow by FAR. thank you!!!!

  • @dodgewagen
    @dodgewagen 3 роки тому +5

    Thanks! Definitely, do more of these API consumption/analysis videos.

  • @bthapa94
    @bthapa94 3 роки тому +17

    Great video and well explained! How do you scrape ALL the posts for a certain time period? I am looking a small subreddit and require a lot of data.

  • @Asianyoungman22
    @Asianyoungman22 7 місяців тому

    thank you very much, you save my life, my dissertation for my master degree.

  • @chasengonzales85
    @chasengonzales85 2 роки тому

    This is really awsome thank you for taking the time to put this together.

  • @paulsastre9833
    @paulsastre9833 Рік тому +1

    thank you for this wonderful video. but how did you get the url used in the beginning

  • @moy92
    @moy92 3 роки тому

    Thanks for doing this! I have been wanting to scrape reddit for a while as exploratory analysis

  • @ElectroCoderEC
    @ElectroCoderEC Рік тому

    woooow amazing. You save my life. very useful. Thanks a lot! :)

  • @catalina5382
    @catalina5382 3 роки тому

    This is exactly what I wanted. I would like to know what modifications do I have to make in order to get the headlines with the flair as well

  • @marizapapdrs
    @marizapapdrs 2 місяці тому

    what function can be used in R istead of praw? Great video btw, ty!

  • @anny23108
    @anny23108 3 роки тому +1

    Could you do a tutorial for mining historical data as well? thank you

  • @prajjwalsinha1187
    @prajjwalsinha1187 Рік тому +1

    How do I scrape comments from reddit posts?

  • @Probly
    @Probly 2 роки тому

    Do you know how to scrape in a specified time period so I can compare sentiment towards a stock within r/wallstreetbets or r/investments against the historical stock price of the same period

  • @varinderjitkaur3656
    @varinderjitkaur3656 2 роки тому

    great video, i am trying to get the historical daily number of members on a subreddit. Is it possible using praw?

  • @blancaherrerosdetejada7160
    @blancaherrerosdetejada7160 Рік тому

    Is it a way to automatically scrape any new posts in a subreddit? (without having to re-run program)

  • @basemgoueli
    @basemgoueli 3 роки тому +2

    I have a project I could use the help of someone of your caliber with. I want to determine the five stocks mentioned most frequently on Reddit's WallStreetBets page on a given day. from January 2022-August 2022 (I have the CSV file for this). After that I want to take the five most commonly mentioned stocks based on number of days in the top 5 from the aforementioned analysis. I would like to plot the number of mentions of the given stock per day against its stock price for the designated time frame. Any help you can offer would be greatly apprecaited.

  • @miaoinperth680
    @miaoinperth680 2 роки тому

    Thanks so much for your video. Will you share the codes in github or somewhere?

  • @wasgeht2409
    @wasgeht2409 3 роки тому

    Hey, danke für das Video :) Habe unten lesen können, dass du aus Deutschland bist. Ich hätte da mal eine Frage und zwar ist es auch möglich über LDA kommende Textnachrichten in Themengebiete zuzuordnen ?

  • @JackFrost1206
    @JackFrost1206 3 роки тому +4

    Maybe you can scrape the subreddit wallstreetbets :D

  • @fernandosantos3576
    @fernandosantos3576 3 роки тому

    Yes, I woul love if you publish a video on a complete project.
    Thank you.

  • @prod.kashkari3075
    @prod.kashkari3075 3 роки тому

    Wow push and praw!

  • @tazrinkhan1297
    @tazrinkhan1297 3 роки тому

    Thank you for this video. This is really helpful. I am trying to get data for a particular time period (March 2020- November 2020). Can you please tell me how to write the code for this?

  • @mealone007
    @mealone007 3 роки тому

    Great video! Quick question, how to scrape the historical headlines with date stamp?

    • @bitsinbytes9002
      @bitsinbytes9002 3 роки тому

      The UTC attribute will give you the Unix Timestamp, then you just have to convert it. Getting historical headlines may be a little trickier, as the PRAW API allows you to iterate through the following "submission" types: controversial, gilded, hot, new, rising, top.

  • @limjuroy7078
    @limjuroy7078 3 роки тому

    Why the user_agent is not "Example"?

  • @selcukturk3550
    @selcukturk3550 Рік тому

    how can i get this code?

  • @gsom2000
    @gsom2000 3 роки тому

    great tutorial! Thanks a lot! is there any opportunity to do the same with twiiter data?

    • @patloeber
      @patloeber  3 роки тому

      I already have 2 tutorials using the twitter API (tensorflow NLP and flask Twitter bot). Maybe you can apply the knowledge from these videos here

    • @gsom2000
      @gsom2000 3 роки тому

      @@patloeber nice! apparently i just missed them! Danke!

  • @fintech1378
    @fintech1378 Рік тому

    why is it always 401?

  • @samarendrapradhan5067
    @samarendrapradhan5067 2 роки тому

    I"m using python 3.9,so older vesion may differ for my below comment.Thanks

  • @gardnmi
    @gardnmi 3 роки тому +4

    Just went to that politics subreddit. It's laughably bias. Thanks for the tutorial.

  • @knowledgeshack5040
    @knowledgeshack5040 3 роки тому

    First!

  • @JackFrost1206
    @JackFrost1206 3 роки тому

    Are you german?

  • @samarendrapradhan5067
    @samarendrapradhan5067 2 роки тому

    Please import followings
    import matplotlib.pyplot as plt
    import seaborn as sns
    nltk.download('vader_lexicon')
    Use from vaderSentiment.vaderSentiment import SentimentIntensityAnalyzer as SIA in place of from nltk.sentiment.vader import SentimentIntensityAnalyser as SIA
    Please suggest witdth =100 is showig error as "width' is an invalid keyword argument for print()"