Andrius Knispelis
Andrius Knispelis
  • 1
  • 182 203
LDA Topic Models
LDA Topic Models is a powerful tool for extracting meaning from text. In this video I talk about the idea behind the LDA itself, why does it work, what are the free tools and frameworks that can be used, what LDA parameters are tuneable, what do they mean in terms of your specific use case and what to look for when you evaluate it.
Переглядів: 182 208

Відео

КОМЕНТАРІ

  • @lioncaptive
    @lioncaptive 18 днів тому

    This video topic on LDA modeling is the best layman's presentation thus far, but even more so applicable in the recent state of AI modeling. Thank you dearly.

  • @tharushapiyumal
    @tharushapiyumal 2 місяці тому

    Thanks a lot

  • @bhuvandwarasila
    @bhuvandwarasila 3 місяці тому

    Thank you!!! Great help!!!

  • @shakibomrani9830
    @shakibomrani9830 Рік тому

    This is great, clear, and practical. Excellent package in 21 minutes. Thank you!

  • @jumanakhrais1065
    @jumanakhrais1065 2 роки тому

    Thank you, very useful

  • @eyabaklouti9066
    @eyabaklouti9066 2 роки тому

    amazing !!!!

  • @stuartxu1509
    @stuartxu1509 2 роки тому

    That's a pretty great video for LDA! Thank you sir for making this. I just have a simple question. Given a magazine, can we name its topic using the words that have high prob in the highest prob topic associated to this magazine?

  • @yagangchen916
    @yagangchen916 2 роки тому

    Thank you soooooooooooooo much. The best topic modelling presentation ever!!!!!! Many thanks from a linguistic student.

  • @aasifjavid471
    @aasifjavid471 2 роки тому

    One of the best presentations I've ever seen .. Thanks

  • @bosorensen
    @bosorensen 2 роки тому

    Fantastic! Thank you!

  • @raWan1409
    @raWan1409 2 роки тому

    Iiiiiii

  • @darwin6984
    @darwin6984 2 роки тому

    could you share us another new video how topic models applying to real world problems or solutions

  • @matthewjohncoffey8790
    @matthewjohncoffey8790 2 роки тому

    Really clear and concise way to explain this complex topic. Thanks

  • @pallavic2012
    @pallavic2012 3 роки тому

    can we apply LDA modeling on images (not text image).

  • @8eck
    @8eck 3 роки тому

    Instantly can see, when someone is talking about a topic in which he was involved himself. Great job! You definitely know what you are talking about. “If you can't explain it to a six year old, you don't understand it yourself.” ― Albert Einstein.

  • @8eck
    @8eck 3 роки тому

    This one was super helpful, thank you very much!

  • @fitrazak5147
    @fitrazak5147 3 роки тому

    Compact, crisp and strong narrative video presentation...I watched it only 2 times and understand the process thoroughly....1 question, just to get your insight..Is LDA can be combined with systematic literature review protocol (SLR) and the produced model in LDA is similar with structural equation modeling (SEM) model?

  • @toddgm8031
    @toddgm8031 3 роки тому

    wow ,this is a great video!! and i like the color scheme you used, could you share it or where i can download the similar styles? thanks!

    • @andriusknispelis8627
      @andriusknispelis8627 3 роки тому

      hi Todd, happy to hear you liked it :) for colors i used one of the styles that i found here: flatuicolors.com (my go to place for color schemes). For a font i used "Helvetica Neue".

  • @alexwutzke7459
    @alexwutzke7459 3 роки тому

    Great video! One question - what is a magazine?

    • @andriusknispelis8627
      @andriusknispelis8627 3 роки тому

      A question for future historians! :) Magazine is 40% articles of a certain, usually shared, topic, and 60% adds. I know, i didnt belive it either, but 20th century was a wild one in terms of ideas like this

  • @zahra_az
    @zahra_az 3 роки тому

    great job. but why didn't you describe the prior distribution this model uses?

    • @andriusknispelis8627
      @andriusknispelis8627 3 роки тому

      Thanks Zahra, happy you liked it! I didnt go into priors since i didnt touch them - i left them as their default values in gensim. I mostly experimented preprocessing of text corpus, number of topics and interpretation of the results :)

  • @ShahzadKhan-zq9ty
    @ShahzadKhan-zq9ty 3 роки тому

    one of the best presentation for beginner like me

  • @theq18
    @theq18 3 роки тому

    Amzing presentation and explanation

  • @dollbaby665
    @dollbaby665 3 роки тому

    Amazing!!!

  • @simpliexplained8519
    @simpliexplained8519 4 роки тому

    This presentation is so much informative. Thanks!

  • @lexli5515
    @lexli5515 4 роки тому

    thank you so much for making such a complex concept relatively easy to comprehend

  • @ktrades2898
    @ktrades2898 4 роки тому

    Make more content!

  • @xuequan0913
    @xuequan0913 4 роки тому

    This is really great! Love the excellent visualization and methodical explanation.

  • @drunknoodle3188
    @drunknoodle3188 4 роки тому

    amazing explanation!

  • @mark_grey
    @mark_grey 4 роки тому

    This is a lot of help! Many thanks~

  • @LRth3KING
    @LRth3KING 4 роки тому

    Man I loved this video. It helped me so much!! Really apreciated it. Now my master degree is on the right track once again!!!

  • @leromerom
    @leromerom 4 роки тому

    The best presentation I’ve seen for LDA’s and most other themes. Outstanding work thank you for producing it!

  • @derekxiaoEvanescentBliss
    @derekxiaoEvanescentBliss 4 роки тому

    Found this trying to learn about linear discriminant analysis, stayed because it was a week put together presentation on an interesting topic

  • @zahraamirmahani4828
    @zahraamirmahani4828 4 роки тому

    it was as help full as possible thank you so much

  • @Anirudhlohia1
    @Anirudhlohia1 4 роки тому

    A slightly different question. Which tool was used to create this presentation?

    • @andriusknispelis8627
      @andriusknispelis8627 3 роки тому

      THANK YOU! :) Everything was done in Keynote (mac version of a Powerpoint). It comes with all the animations you see, and it lets you record sound as well and then export everything straight into video.

  • @Anirudhlohia1
    @Anirudhlohia1 4 роки тому

    Wow! Blown over by the video. It was easy to follow and gained a lot of information for implementing my model. Thank you

  • @sasankv9919
    @sasankv9919 4 роки тому

    This presentation is excellent. Thanks.

  • @1996ashishs
    @1996ashishs 4 роки тому

    Thanks a lot sir

  • @SogaMaplestory
    @SogaMaplestory 4 роки тому

    Amazing intro to LDA, thank you very much

  • @svengunther7653
    @svengunther7653 4 роки тому

    Dude, such a great presentation. Thank you very much for this superb explanation! :)

  • @mmishrafaculty
    @mmishrafaculty 4 роки тому

    Very well explained. Did you use Mallet along with Gensim or was it only Gensim?

  • @PrimevalShrimp
    @PrimevalShrimp 4 роки тому

    Important question: I build a model like this on my own using simple SVD and vectorizing my documents. At the moment I have around 10k documents and searching for the closet 10 documents or so is quite fast. (~0.001s). However, at this moment, I am just brute-forcing getting the closest vector in linear time and if my database will be bigger (over 1M or even 10M documents) I need some kind of indexing to make the search much faster. Do you have any suggestions? I feel confident to design an approximate algorithm by myself, but I would prefer an exact solution running in log(n) time.

    • @andriusknispelis8627
      @andriusknispelis8627 3 роки тому

      what i did was use K means to cluster the whole space of documents. Then the number of clusters will depend on number of themes (not the documents). Each cluster is then represented by a single LDA distribution, and then the search was split into two spets: 1) find a cluster, 2) find nearest neighbors in that cluster That helped us a lot. Well, that and the fact that while LDA model was trained and processed in Python, we used faster and more efficient languages for finding similarities

  • @rishabhag1949
    @rishabhag1949 4 роки тому

    Hey Andrius, great job explaining the topic modeling concept and relating it with the use-case of magazines. I really enjoyed the presentation, the graphics, video, and the entire layout plan. Kudos. Once again, thank you for posting it here.

  • @hydraze
    @hydraze 4 роки тому

    Thank you for that beautiful presentation! I learnt a lot from it and enjoyed it immensely

  • @rohitupadhyay4665
    @rohitupadhyay4665 4 роки тому

    Amazing explanation! Thank you for this.

  • @wezside
    @wezside 4 роки тому

    Wow. Fantastic explanation. Thanks so much.

  • @mahmoueco1200
    @mahmoueco1200 4 роки тому

    wow just wow dude i like ur explanation very much

  • @sigmarulesog984
    @sigmarulesog984 4 роки тому

    best explanation

  • @ArtArchitectureAndAI
    @ArtArchitectureAndAI 4 роки тому

    What an amazing presentation Andrius. Very well explained and nicely crafted. Clearly demonstrates your deep understanding of LDA and fantastic communication skills!

  • @tserenpurevchuluunsaikhan821
    @tserenpurevchuluunsaikhan821 4 роки тому

    [Remove if a word appears in more than 10% of the articles. Remove if a word appears in less than 20 articles.] Can someone explain these actions in detail? Why do we need this?

    • @andriusknispelis8627
      @andriusknispelis8627 4 роки тому

      This is the part when dictionary is built, it's about removing words that appear too many times or too few times. Check out the tutorials here radimrehurek.com/gensim/auto_examples/index.html Hope it helps :) cheers!

  • @peetali7131
    @peetali7131 4 роки тому

    OMG literally every single word you said in this video is super helpful to me. Thanks a ton!!