Great talk! I've got two questions: 1. Is it a real-time recommendation because upon user query it is enriched with historical data as well as real-time data(stored after each interaction)? Or something else like you are training embedding at near real time? 2. How often do you train Ranking model and does it only train with candidates?
From my understandings: 1. It's real time because it's capable to do one recommendation per user query without much delay. Instead of spotify that just gives you one recommendation once a week. So it's essential the same thing except the infrastructure required to be able to provide "real time" recommendation is different. But the ML models and processes you'd use to do one or the other would be largely the same. It's not about training it's about how speedy your evaluation is. Also, adding filters and so on, so overall it's a more complex form of interaction with the ML system. 2. You'd want to train your ranking model whenever you feel you've gotten enough data. There's not really a magic number here. As users interact with your website you'll get better data for positive/negative examples; it also depends on how much it costs for you to re-train your algo as well; if it's too expensive then maybe you just want to be re-training when necessary.
@@MrNagano00 how is the infra different between batch recommendation and realtime recommendation? also what is the model serving doing in the real-time recommendation system - is it for ranking only? why you need a vector database for the embeddings? dont you need to train an embedding model too?
can someone explain to me, how come you calculate USER embedding when training. And when searching for similar embeddings, you actually get ITEMS embeddings???
how does the model get generated for the batch recommendation service? does the embedding model for the realtime-recommendation only use userdata and not item data?
Wow. Amazing presentation on Machine Learning Infrastructure at scale! Thank you.
That was so good. Nice detail and example thank you so much for putting this up.
Incredible lecture! Thanks
Hello, could you share the code with me? There is no such codes in the github now
Great talk! I've got two questions:
1. Is it a real-time recommendation because upon user query it is enriched with historical data as well as real-time data(stored after each interaction)? Or something else like you are training embedding at near real time?
2. How often do you train Ranking model and does it only train with candidates?
From my understandings:
1. It's real time because it's capable to do one recommendation per user query without much delay. Instead of spotify that just gives you one recommendation once a week. So it's essential the same thing except the infrastructure required to be able to provide "real time" recommendation is different. But the ML models and processes you'd use to do one or the other would be largely the same. It's not about training it's about how speedy your evaluation is. Also, adding filters and so on, so overall it's a more complex form of interaction with the ML system.
2. You'd want to train your ranking model whenever you feel you've gotten enough data. There's not really a magic number here. As users interact with your website you'll get better data for positive/negative examples; it also depends on how much it costs for you to re-train your algo as well; if it's too expensive then maybe you just want to be re-training when necessary.
@@MrNagano00 how is the infra different between batch recommendation and realtime recommendation? also what is the model serving doing in the real-time recommendation system - is it for ranking only? why you need a vector database for the embeddings? dont you need to train an embedding model too?
Can't you get the score (ranking score | similarity score) while fetching items from the Vector DB? ..
what model does the batch recommendation use? some content based, collaborative filtering model?
can someone explain to me, how come you calculate USER embedding when training.
And when searching for similar embeddings, you actually get ITEMS embeddings???
how does the model get generated for the batch recommendation service? does the embedding model for the realtime-recommendation only use userdata and not item data?
Amazing, thank you