Movie Recommendation System with Collaborative Filtering

Поділитися
Вставка
  • Опубліковано 15 вер 2024

КОМЕНТАРІ • 147

  • @iqrasadia5602
    @iqrasadia5602 2 роки тому +10

    For someone who doesn't even have good coding experience. I had to finish a demo for the work. I was able to complete it with your tutorial. Excellent work.

  • @kaushiknippanikar7021
    @kaushiknippanikar7021 4 роки тому +26

    DAMN.....!!!!!!!!!!
    I tried so hard to learn ML on other channels.
    All of them were so confusing and longer than my attention span.
    But your videos keep me concentrated all the way.
    Great job man, hope UA-cam search engine will be more generous on you.

  • @aparanapant1512
    @aparanapant1512 3 роки тому +9

    This tutorial had really helped me a lot ! Thank you so much for this wonderful content and explaining it so clearly ! I wish you all the best !!

  • @roopagaur8834
    @roopagaur8834 5 років тому +6

    Such a great explaination. You need many more subscribers.

    • @CodeHeroku
      @CodeHeroku  5 років тому

      Hi! thanks for your motivating words and support 😀

  • @ashishbisht9552
    @ashishbisht9552 5 років тому +3

    interesting project.... your way of teaching is very simple and explanations are really good... looking forward for more projects to ad in my resume...😅😅

    • @CodeHeroku
      @CodeHeroku  5 років тому

      Thanks for your kind words :)

  • @sarabhian2270
    @sarabhian2270 3 роки тому

    such a simple teacher , he made hard stuff very easy

  • @simkort5799
    @simkort5799 Рік тому

    Hi great video! I personally think the axises during the 'Quantifying the similarity' part can be a bit confusing. If you are measuring the similarities of the Users, then u are comparing the User's rating score on different movies. So it should be U1 and U2 marked on the dots instead of M1 and M2, and M1 & M2 on the axises. Because when it expands from 2D to nD, the vector will be combinations of U1 and U2's rating on different movies, and each of the axis will be M1, M2....Mn

  • @juandiegorodriguez7583
    @juandiegorodriguez7583 2 роки тому +1

    Great Tutorial, it helps me to understand how to implement a recommendation system in my projects about anime

    • @dhaanish0264
      @dhaanish0264 2 роки тому

      Did you complete your project??

    • @juandiegorodriguez7583
      @juandiegorodriguez7583 2 роки тому

      @@dhaanish0264 Yep, It usually takes a long time (10 min) to run the algorithm because the amount of data but it works

    • @dhaanish0264
      @dhaanish0264 2 роки тому

      @@juandiegorodriguez7583 could you share me the code , I want a reference as I'm working on the same project but with different website

  • @ShikshaMishraMTCS
    @ShikshaMishraMTCS 2 роки тому +2

    Hi, can make a video of how after training a model which predicts rating by a user of an item, multiple recommendations for a particular user based on their item rakings are generated?? It would be really helpful

  • @ankan1627
    @ankan1627 3 роки тому +2

    There is an error in how the non-rated entries are treated. specifically, if nans are replaced by 0 BEFORE mean normalization, you are basically penalising that unrated item. The right thing to do is to take the unrated items as 0 AFTER mean normalization, therefore giving them the same user rating as the mean.

    • @swhemerling
      @swhemerling 2 роки тому

      i was also wondering whether there were other approaches... in a large normal dataset, most (the vast majority) of the dataset would wind up being values that are NaNs converted to means... having such a large proportion of imputed values must impact the quality of the predictions, no?

    • @CongNguyen-og3iz
      @CongNguyen-og3iz 2 роки тому

      Can we use the built in standard scaler instead of creating that standardize helper method?

  • @abhishekpalkar52
    @abhishekpalkar52 5 років тому

    You are the best sir, teaching method really helps!

  • @dinsfire4628
    @dinsfire4628 2 роки тому

    This was so incredibly helpful, thank you so much!

  • @tuliomoreira1992
    @tuliomoreira1992 2 роки тому +2

    Thank you for the video! One question, though.. would it be better to use df.mean().mean() as the mean rating of all users instead of 2.5? Many thanks!

  • @Maddy44979
    @Maddy44979 4 роки тому +12

    Very well explained! Thanks. Could you please explain me how the mean 2.5 was subtracted from user's rating which you mentioned as a trick @21:02? I was trying out this on books recommendation dataset.

    • @Just_Moh_it
      @Just_Moh_it 2 роки тому +1

      Because all the ratings at that time were on a scale of 0 to 5, and he converted it to the scale of -2.5 to 2.5 for obvious reasons. Hope this helps 😃

    • @Maddy44979
      @Maddy44979 2 роки тому

      @@Just_Moh_it Thanks!

  • @quangdang633
    @quangdang633 4 роки тому +3

    It would be so great if you can do something more about evaluation metrics to evaluate your model
    Thank you

    • @elianchristian8531
      @elianchristian8531 3 роки тому

      you all prolly dont care at all but does anyone know a method to get back into an Instagram account??
      I was stupid forgot the password. I love any help you can give me

    • @alejandrooliver7197
      @alejandrooliver7197 3 роки тому

      @Elian Christian Instablaster :)

    • @elianchristian8531
      @elianchristian8531 3 роки тому

      @Alejandro Oliver I really appreciate your reply. I found the site through google and I'm waiting for the hacking stuff now.
      I see it takes quite some time so I will reply here later with my results.

    • @elianchristian8531
      @elianchristian8531 3 роки тому

      @Alejandro Oliver it worked and I actually got access to my account again. Im so happy!
      Thank you so much you really help me out !

    • @alejandrooliver7197
      @alejandrooliver7197 3 роки тому

      @Elian Christian You are welcome :)

  • @ravi99217
    @ravi99217 3 роки тому +1

    very Nice, easy to understand all points

  • @shikhasinghal5713
    @shikhasinghal5713 2 роки тому

    The session was really helpful.Thankyou!

  • @Darazfinds4350
    @Darazfinds4350 Рік тому

    Thank you 🙌
    Kindly tell me the accuracy...

  • @wanmuz86
    @wanmuz86 4 роки тому

    Thanks. Very helpful I managed to redo your code without problem

  • @garimaojha3403
    @garimaojha3403 4 роки тому +3

    Hello sir, I have some confusion, our csv will not be the same as in this tutorial, i.e movies in column and ratings in rows our data will be like i'll have three column one is user_id , ratings, product_id how will I apply this filtering on such data, please guide me

    • @roshanmishal8866
      @roshanmishal8866 4 роки тому

      notebooks.azure.com/hello-codeheroku/projects/collab-filtering here u will get

  • @satyapragyandas811
    @satyapragyandas811 3 роки тому +1

    how to evaluate such type of recommendation system?

  • @jaggyjut
    @jaggyjut 3 роки тому

    Nice, could you please shown how to deploy the model and connect with a front end react app

  • @Michael-jt4sb
    @Michael-jt4sb 2 роки тому

    HI, could you please do a video on creating a hybrid recommendation system

  • @vikashkumarchaurasia1299
    @vikashkumarchaurasia1299 3 роки тому +1

    Very nice explanation thanks alot

  • @arnabpersonal6729
    @arnabpersonal6729 3 роки тому +1

    Shouldn't it be axis=1 in standardize method

  • @mirsahib596
    @mirsahib596 3 роки тому +1

    can you please suggest how should we split the data into train/test and evalute our model

  • @ryyesterday
    @ryyesterday 2 роки тому

    For your standardize function, why do you include the 0s in your calculation of the mean when normalizing?

  • @steffm.4884
    @steffm.4884 3 роки тому +2

    Hi, thanks for the great tutorial. In the toy dataset example, what do I need to do in order to find similar users (not movies) based on action_lover's ratings?

  • @sheruloves9190
    @sheruloves9190 4 роки тому +1

    Does standardise() function remain same? eg. if we wish to perform user-user CF, do we need to standardise column instead?
    amazing tutorial, thanks.. :))

  • @tuetran9212
    @tuetran9212 3 роки тому +2

    13:49
    I'm not sure if that's the correct way to standardize data. The correct formula should be:
    x_new = (x - mean)/std

  • @mianmuhammadnouman8700
    @mianmuhammadnouman8700 3 роки тому

    really appreciated. helps me a lot. thank you. Good luck for channel

  • @manishbolbanda9872
    @manishbolbanda9872 3 роки тому

    you teaching skills are way better...thanks for this...also i have a doubt. suppose i wanna build a hybrid system(content based+collaberative) the way to do is that get recommendations(content based+collaberative) from both and combine to form one .example: recommendation1=[m1,m3,m6] and recom2=[m4,m5,m6,m8,m9] then final recomndation=[m6,m1,m3,m6,m4,m5,m8,m9] . is this the way how it should be done or there are other better way please tell.

  • @joeljoseph26
    @joeljoseph26 3 роки тому +1

    I have a small doubt when you penalize the user rating by -2.5 for the case of romantic then won't that affect the scores for action movies.

    • @apostolismoschopoulos1876
      @apostolismoschopoulos1876 2 роки тому

      There is no such problem because ONLY if (user_Ratings-2.5) is negative (i.e small rating) the multiplication will move it even more negative when if it is positive the product will sure be a positive number.

  • @gpstech3048
    @gpstech3048 2 роки тому

    Nice demonstration. Question: i have developed model based cf and predicted values. How can i recommend items from this model?please help me

  • @tanushreekurup1251
    @tanushreekurup1251 4 роки тому +1

    According to your steps shown in the website for application development on medium,when we are trying to open test.html file it appears blank.Can you please explain the testing API step again.

    • @VivekYadav-lw7ln
      @VivekYadav-lw7ln 3 роки тому

      Yes, there is no API located at /movies/default/call/json/get_recommendations

  • @sanskaarpatni9137
    @sanskaarpatni9137 4 роки тому

    Great explanation!
    Thank You Sir

  • @sarabhian2270
    @sarabhian2270 3 роки тому

    I have a dataset and I want to create a model which will predict top3 movies to user, such that final output should be CSV file with columns user_id and top3 how to do this , pls help me

  • @khushalvyas5633
    @khushalvyas5633 5 років тому +4

    very well explained... Thank you!

  • @shivagitam6332
    @shivagitam6332 3 роки тому

    how to split the training and test data set and measure the mean squared error, please help me in this problem

  • @deandroid6008
    @deandroid6008 2 роки тому +1

    Great tutorial! how can I deploy it in the android app? can we convert it into TFlite?

    • @karuneshpalekar5212
      @karuneshpalekar5212 2 роки тому

      Yes you can . It has been already done . Check the official repo of tf

    • @deandroid6008
      @deandroid6008 2 роки тому

      ​@@karuneshpalekar5212 Thanks!! but i want to make tf model using this tutorial approch.

  • @sercanyildirimtugcann
    @sercanyildirimtugcann 2 роки тому

    I understood the tutorial but how we can integrate this system into real website ?nobody explains it in detailed. Just Jupiter notebooks.!!!

  • @kayk137
    @kayk137 5 років тому

    In the third cell why the output is not displaying all the user id's?!!!.... And why it displays Only Five user id's

  • @princeakhoury4294
    @princeakhoury4294 4 роки тому +2

    Please please please help urgently
    When i get to the recommendation part... I'm getting an error
    No matter what movie i say it gives me a KeyError and says there has been an exception handling error
    Please help Asap

    • @snehasingh127
      @snehasingh127 4 роки тому

      you might have written: user_ratings=rating.pivot_table(index=['userId'],columns=['title'],values=['rating'])
      instead of: user_ratings=rating.pivot_table(index=['userId'],columns=['title'],values='rating')
      remove the braces from rating

  • @torrestam8527
    @torrestam8527 4 роки тому

    hmm how can I get list similar movies afterall? It seems like the final list we have is the list of rating values, not the movieId, so I don't know how to get information of the movies from the final recommended list?

  • @parthdesai1108
    @parthdesai1108 4 роки тому

    Sir, while working with a bigger dataset of movielens , dataframe of shape(22884377, 4), i am unable to create a pivot tabel. The error is: Unstacked DataFrame is too big, causing int32 overflow
    PLEASE PROVIDE A SOLUTION TO THIS SIR!!

  • @parisanejatian8940
    @parisanejatian8940 4 роки тому

    thanks a lot for your really really useful tutorial. Keep going .....

  • @jeelkadivar3080
    @jeelkadivar3080 Рік тому

    can you please upload the toy dataset again, the given link is not working. If anyone else has it then pls reply.

  • @madcapPOG
    @madcapPOG Рік тому

    So this is user based collaborative filtering??

  • @hungly549
    @hungly549 2 роки тому +1

    Very good and funny videos bring a great sense of entertainment!

  • @Dollyrana2498
    @Dollyrana2498 3 роки тому

    error in
    ----> 3 from sklearn.metrics.pairwise import consine_similarity
    ImportError: cannot import name 'consine_similarity' from 'sklearn.metrics.pairwise' (/usr/local/lib/python3.7/dist-packages/sklearn/metrics/pairwise.py)
    ---------------------------------------------------------------------------

  • @lavanyamaradani473
    @lavanyamaradani473 4 роки тому

    I really liked the way you implemented and i started using them but i got stuck at a point when i am using the pivot_table. My jupyter notebook kernel dies every time when i try running it. Can someone please help me how to overcome that.

  • @kirtik9233
    @kirtik9233 3 роки тому

    Hi, Could you please share Car Recommendation system

  • @flamingflamingo4021
    @flamingflamingo4021 3 роки тому

    Were the issues/challenges dealt with in any other tutorial or video?

  • @elmerjr.gapuzan9653
    @elmerjr.gapuzan9653 2 роки тому

    is it possible to include an evaluation metrics?

  • @andyslovakia
    @andyslovakia 5 років тому +1

    hi what would you do if you had 278k users and 250k books and the pivot gets out bounds so you have ratings in three columns - user movie rating? thanks

    • @CodeHeroku
      @CodeHeroku  5 років тому

      You will have to use a distributed / parallel approach. Have a look at this:
      endymecy.gitbooks.io/spark-ml-source-analysis/content/%E6%8E%A8%E8%8D%90/papers/Large-scale%20Parallel%20Collaborative%20Filtering%20the%20Netflix%20Prize.pdf

  • @abhi_2104
    @abhi_2104 3 роки тому

    I liked it??? I loved it!!!

  • @HumaRizwan
    @HumaRizwan 3 роки тому

    I subscribed man......great content

  • @arpanroychowdhury2961
    @arpanroychowdhury2961 4 роки тому

    Plase help me here....Why there is a use of Standardize function? please anyone!!

  • @vijayverma5270
    @vijayverma5270 4 роки тому

    Sir how we predict . If one user gives ratings one movie1 to 5 then how we predict for movie2.

  • @alexsr9697
    @alexsr9697 5 років тому +2

    really good video and explanation, +1 sub bro

    • @CodeHeroku
      @CodeHeroku  5 років тому

      Thanks for your motivation :)

  • @sumitvaise5452
    @sumitvaise5452 3 роки тому

    Loved it.

  • @mukundkrishna2789
    @mukundkrishna2789 4 роки тому +2

    Can I take this code for my final year project?

  • @prepptspsc4476
    @prepptspsc4476 4 роки тому

    I just loved it

  • @garimaojha3403
    @garimaojha3403 4 роки тому

    Can I Have more than 1 indexes because I want to filter movie on user and genres basis

  • @iasupsc100
    @iasupsc100 Рік тому

    Can you make me a collaborative recommendations system project

  • @nabdeep28
    @nabdeep28 4 роки тому

    But what if I want to find out the NAN values using python?

  • @sandeepv2441
    @sandeepv2441 3 роки тому

    Sir how can we deploy this model in flutter application

  • @myopinio7056
    @myopinio7056 4 роки тому

    Item based : movie 6

  • @michaelomosebi6374
    @michaelomosebi6374 2 роки тому

    really helpful...

  • @anwarshadaab
    @anwarshadaab 4 роки тому

    What if in my dataset, instead of NaN, all the values are already 0. Then should I use the fillna(0) function?

    • @mirsahib596
      @mirsahib596 3 роки тому

      function call will have no effect on your code because your dataset already contain 0 value

  • @mukundkrishna2789
    @mukundkrishna2789 4 роки тому

    When I ran the final cell it was coming series object has no attribute sort

  • @maryammohseni4507
    @maryammohseni4507 10 місяців тому

    it was great

  • @mohammadrezarashidiyan2058
    @mohammadrezarashidiyan2058 2 роки тому

    hello , how can i access to the power point of this video

  • @abhishek-singh31
    @abhishek-singh31 2 роки тому

    How to convert this to a web application?

  • @Mansi-te8js
    @Mansi-te8js 10 місяців тому

    Does it not have ui ???

  • @priyanshneema
    @priyanshneema 5 років тому

    Hlo mihir sir, the cource is very good i want to suggest you please update code heroku I want to become campus ambassador of this platform can you provide me.

  • @lokeshpaladugula5793
    @lokeshpaladugula5793 5 років тому

    sir, nice video .
    but i want to know how to do user similarity recommended system of videos, here the data consists only users and they liked videos (no rating required), can u help me

    • @MrAnandml
      @MrAnandml 3 роки тому

      Based on the duration of a particular genre that a user watched ....

  • @chetanmundhe8619
    @chetanmundhe8619 4 роки тому

    hi,, can i get the codes for same example in R

  • @sameerpokhrel8554
    @sameerpokhrel8554 4 роки тому

    Which algorithm is implemented here?

  • @qaisarabbas6090
    @qaisarabbas6090 5 років тому

    What if we have multiple criteria for movie? Then how can we find similarities.

    • @CodeHeroku
      @CodeHeroku  5 років тому

      Hi Qaisar, could you please elaborate on what you mean by multiple criteria for a movie? Do you mean having different features such as movie director, tags etc.?

  • @akshayjagtap7834
    @akshayjagtap7834 3 роки тому

    Where can I find python file for this

  • @Dollyrana2498
    @Dollyrana2498 3 роки тому

    i want dataset of movie and rating i didn't found can you send me the link here

    • @CodeHeroku
      @CodeHeroku  3 роки тому +1

      drive.google.com/file/d/1WWQCl9w52M1sXNWd4JSKL7q-HHywk03p/view

  • @firdausisuzuki9097
    @firdausisuzuki9097 3 роки тому

    how to do the rmse?

  • @sreelayavuyyuru2319
    @sreelayavuyyuru2319 3 роки тому

    The website is not working!!!

  • @wardatariq3612
    @wardatariq3612 4 роки тому

    can we make recommender system using collaborative filtering in simple Visual C#

    • @rohansingh1057
      @rohansingh1057 4 роки тому

      You can, however, you will have to find some frameworks and tools to do so.
      Python and R are used most of the times as they have wonderful tools to help out as sci-kit learn.
      If you understood the maths completely and can implement it by yourself go for it. However, I still will recommend at least looking up for a matrix multiplication library for C#.
      This is vital as multiplying millions of numbers is really time-consuming and your code may take days to train. Or may not train at all.
      Libraries have really efficient implementations of matrix multiplication and even utilise hardware acceleration with GPU sometimes.

  • @olaitanafolabi4856
    @olaitanafolabi4856 2 роки тому

    Please how can I join your whatsapp group ? I need further guide on a project I am working on

  • @erenterlikli2821
    @erenterlikli2821 4 роки тому +1

    Can we do it with free Azure account?

    • @CodeHeroku
      @CodeHeroku  4 роки тому

      Yes. And you don't need an Azure account, you just need a Microsoft account.

  • @monikamayer6279
    @monikamayer6279 4 роки тому

    Hi, i have used your approach for my master thesis, but there is a big question, how the prediction will make? the prediction is the last step in collaborative filtering. How can i cover your approach in my letter?

  • @ЭльвираШебанова
    @ЭльвираШебанова 2 роки тому

    Specify the goal and need money

  • @torrestam8527
    @torrestam8527 4 роки тому

    I have a question, is "Action_lover" a new user to the system? Or this user has already been in the system?

    • @omkarshete185
      @omkarshete185 4 роки тому +1

      It's a new user

    • @torrestam8527
      @torrestam8527 4 роки тому

      @@omkarshete185 so what if "action_lover" is old user, should I add his ratings to the matrix before calculating cosine similarity?

    • @omkarshete185
      @omkarshete185 4 роки тому

      @@torrestam8527 I think it doesn't matters if you didn't add that user to matrix ,because you are finding the similar movie on the rating given to previous movies as we are using the item centered method.

    • @omkarshete185
      @omkarshete185 4 роки тому

      @@torrestam8527 are you working on this project ?

    • @torrestam8527
      @torrestam8527 4 роки тому

      @@omkarshete185 yes I am :)) I'm 90% through it, thanks to this video.

  • @Hafiyyanabdulaziz
    @Hafiyyanabdulaziz 3 роки тому

    14:39 where did you get the number 5?

    • @CodeHeroku
      @CodeHeroku  3 роки тому +1

      Ratings are on the scale of 1 to 5

  • @utkarshsalvi518
    @utkarshsalvi518 3 роки тому

    toy_dataset is not there in the google drive

    • @CodeHeroku
      @CodeHeroku  3 роки тому

      Please get it from here: raw.githubusercontent.com/codeheroku/Introduction-to-Machine-Learning/master/Collaborative%20Filtering/dataset/toy_dataset.csv

  • @farizsutedy7155
    @farizsutedy7155 4 роки тому

    hebat lu bro

  • @giannismaris13
    @giannismaris13 3 роки тому

    How can we find the accuracy score here?

    • @anishkc3234
      @anishkc3234 2 роки тому

      i need help with that too did u do it?

    • @giannismaris13
      @giannismaris13 2 роки тому

      @@anishkc3234 Hi, I used RMSE , and MAE (aka root mean square error and mean absolute error)

    • @anishkc3234
      @anishkc3234 2 роки тому

      @@giannismaris13 can i get the code??i need it

  • @55535y
    @55535y 3 роки тому

    Not able to find the data😭

    • @CodeHeroku
      @CodeHeroku  3 роки тому

      drive.google.com/file/d/1WWQCl9w52M1sXNWd4JSKL7q-HHywk03p/view
      raw.githubusercontent.com/codeheroku/Introduction-to-Machine-Learning/master/Collaborative%20Filtering/dataset/toy_dataset.csv

  • @iramansari3625
    @iramansari3625 4 роки тому

    Toydataset link?

    • @CodeHeroku
      @CodeHeroku  4 роки тому

      It's in the Azure Notebook project
      notebooks.azure.com/hello-codeheroku/projects/collab-filtering

    • @md.fazlulkarim8847
      @md.fazlulkarim8847 4 роки тому

      @@CodeHeroku The Microsoft Azure Notebooks preview website will be retired on October 9th, 2020. Please transfer the course file to another platform.

  • @snandi1603
    @snandi1603 3 роки тому +1

    Item to Item collaborating filtering explanation was bad

  • @simranshresthacomputing23
    @simranshresthacomputing23 3 роки тому

    which algorithm is this??

  • @sriram1120
    @sriram1120 3 роки тому

    Which algorithm is used here?