Metadata Filtering for Vector Search + Latest Filter Tech

3 Vector-based Methods for Similarity Search (TF-IDF, BM25, SBERT)

Mean Average Precision (mAP) | Explanation and Implementation for Object Detection

«Просив пробачення, що не уберіг Діму» - історія братів Василя Репчука і Дмитра Мурару #shorts

Тайское мороженое в Калининграде

ふわふわシフォン大作戦🩷スイーツ戦隊のキラキラミッション✨【銀座コージーコーナー】 #shorts #シフォンケーキ #クリスマスケーキ #クリスマス #ケーキ #チョコケーキ #christmas

Evaluation Measures for Search and Recommender Systems

James Briggs

Переглядів 12 591

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 8 лют 2025

КОМЕНТАРІ • 25

@aminghaderi1902 11 місяців тому ⁺³
Probably best explanation out there.
@parsakhavarinejad Рік тому ⁺¹
Clearly explained. Thank you
@goelnikhils 2 роки тому ⁺¹
Amazing Explanation. So clear. Very helpful
@anujlahoty8022 Рік тому ⁺¹
What a video, hats off!
@sriks4003 8 місяців тому ⁺¹
Very helpful, thank you!
@shrar837 2 роки тому ⁺⁴
Your videos are impressive and very informative mate. 👌
@jamesbriggs 2 роки тому
thanks!
@sumantjha8392 2 роки тому ⁺¹
Super informative and great..thanks
@Han-ve8uh 2 роки тому
1. I got confused at 18:29 when predicted is a nicely increasing sequence making me think are those ranks or item ids. I was also thinking whether the len of intersection act_set & pred_set could simply be len(act_set), then i realized this example here is a very special case where act_set is subset of pred_set. If act_set contains value 9, then we can't use len(act_set) alone and the formula in video is required.
2. Similar to question nikhil goel asked in comments section 2 weeks before this, where does 13:46 actual_relevant data come from? It looks manually labelled, and this labelling occurs per query making it super unscalable?.
3. Assuming we accept manual labelling how is the 0-4 range determined? I feel like drift is a problem, when todays 4 becomes tomorrows' 3 as value judgements change, does this mean relabelling all results again?
4. I noticed some metrics aggregate across queries and k, and some are only within 1 query across k, in what scenarios do we use each?
5. I didn't expect a *relk in AP@K formula, why do we ignore certain precision at certain k? Feels like artificially increasing metrics for the sake of it, which becomes ineffective if every query does it
@goelnikhils 2 роки тому ⁺²
Hi James, I have a question on NDCG or any other ranking aware metrics. How does these metrics work where you have millions of products/items. What I mean is if we have millions of items, then it means we have to first label (manually) all the million items for relevance /rank. And then when our model predicts we use NDCG. Isn't this a big drawback of NDCG. Can you please suggest what is better approach to rank if we don't have relevance labeled data. Thanks in
@miguelfsousa Рік тому
This video is great.
@HazemAzim Рік тому
Super nice .. Thanks
@vishalwaghmare3130 2 роки тому
Very helpful ❣️
@Data_scientist_t3rmi 2 роки тому
Good video !
@preetimehta1247 11 місяців тому
Hi , I have a query If I am working on a song recommendation project by using Spotify API data set, I have used models like cosine similarity, matrix factorization, knn , Latent Semantic Analysis (LSA) model, Correlation Distance method. Now I am confused about how should I approach for evaluation metric in this system.
@morannechushtan2101 Рік тому ⁺¹
21:23 Statistically there is probably a cat in the box on image 3
@tarikkarakas587 2 роки тому ⁺¹
Biggest problem is labeling the product whether it is relevant or not. It is not possible to label each search. Meanless if you can't handle with that.
@jamesbriggs 2 роки тому ⁺¹
Yeah data prep as usual with ML is the hard part, if you're interested in evaluation methods for IR *without* labeled data look into online metrics for eval (and training)
@Data_scientist_t3rmi 2 роки тому
IN MRR, when our search result doesnt inclued the result that we want, for your example if we want to search for cats and we find only dogs, how can we calculate MRR ? can we give it a big number for exemple rank 20 for all Not included results? 1/20
@jamesbriggs 2 роки тому ⁺¹
yes as you said - or use another metric that better fits to your scenario
@Data_scientist_t3rmi 2 роки тому
@@jamesbriggs Thank you for your answer
@joyeetamallik5063 2 роки тому
Hi James! can u make some vedios of updating Models if we Keep on getting data(e.g Biweekly)
@jamesbriggs 2 роки тому ⁺¹
cool idea! I'll add to the list :)
@mattygrows7667 2 роки тому ⁺¹
love your videos but why do you always seem so sad
@jamesbriggs 2 роки тому ⁺⁴
thanks! idk I'm happy I promise lol

Наступне

Автоматичне відтворення

Metadata Filtering for Vector Search + Latest Filter Tech

Metadata Filtering for Vector Search + Latest Filter Tech

3 Vector-based Methods for Similarity Search (TF-IDF, BM25, SBERT)

3 Vector-based Methods for Similarity Search (TF-IDF, BM25, SBERT)

Mean Average Precision (mAP) | Explanation and Implementation for Object Detection

Mean Average Precision (mAP) | Explanation and Implementation for Object Detection

«Просив пробачення, що не уберіг Діму» - історія братів Василя Репчука і Дмитра Мурару #shorts

«Просив пробачення, що не уберіг Діму» — історія братів Василя Репчука і Дмитра Мурару #shorts

Тайское мороженое в Калининграде

Тайское мороженое в Калининграде

ふわふわシフォン大作戦🩷スイーツ戦隊のキラキラミッション✨【銀座コージーコーナー】 #shorts #シフォンケーキ #クリスマスケーキ #クリスマス #ケーキ #チョコケーキ #christmas

ふわふわシフォン大作戦🩷スイーツ戦隊のキラキラミッション✨【銀座コージーコーナー】 #shorts #シフォンケーキ #クリスマスケーキ #クリスマス #ケーキ #チョコケーキ #christmas

Уличный боец с ДУХОМ воина

Уличный боец с ДУХОМ воина

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

Intro to Computer Science and Programming in Python 6 of 12

Intro to Computer Science and Programming in Python 6 of 12

Statistical Rethinking 2023 - 01 - The Golem of Prague

Statistical Rethinking 2023 - 01 - The Golem of Prague

Every Ranking Metric : MRR, MAP, NDCG

Every Ranking Metric : MRR, MAP, NDCG

Trends in Recommendation & Personalization at Netflix

Trends in Recommendation & Personalization at Netflix

Wayfair Data Science Explains It All: Evaluating Recommender Systems

Wayfair Data Science Explains It All: Evaluating Recommender Systems

François Chollet on OpenAI o-models and ARC

François Chollet on OpenAI o-models and ARC

Investigating the Periodic Table with Experiments - with Peter Wothers

Investigating the Periodic Table with Experiments - with Peter Wothers

Product Quantization for Vector Similarity Search (+ Python)

Product Quantization for Vector Similarity Search (+ Python)

🔥"СВОшник" РОЗНОСИТЬ шоу путіністів! Ведучий ШОКОВАНИЙ від цих СЛІВ #shorts

🔥"СВОшник" РОЗНОСИТЬ шоу путіністів! Ведучий ШОКОВАНИЙ від цих СЛІВ #shorts

СПОРИМ ТЫ НЕ ЗНАЕШЬ ТРИ СЛОВА НА БУКВУ О? #shortsvideo #юмор #катяклон #comedy #прикол #мамадочка

СПОРИМ ТЫ НЕ ЗНАЕШЬ ТРИ СЛОВА НА БУКВУ О? #shortsvideo #юмор #катяклон #comedy #прикол #мамадочка

"ВСЯ УЛИЦА полетела" - курянка про обстріли рф

"ВСЯ УЛИЦА полетела" — курянка про обстріли рф

МАФИЯ в РЕАЛЬНОЙ ЖИЗНИ: Дубровский, Позов, Мамикс, Катя Клэп, Егорик, Кадрол, Столяров, Масленников

МАФИЯ в РЕАЛЬНОЙ ЖИЗНИ: Дубровский, Позов, Мамикс, Катя Клэп, Егорик, Кадрол, Столяров, Масленников

СКОЛЬКО ИХ...?! #Shorts #Глент

СКОЛЬКО ИХ...?! #Shorts #Глент

ФИЛЬМ! НЕВИНОВНЫЙ ГОТОВИТ ДЕРЗКИЙ ПОБЕГ С НЕПРИСТУПНОГО ОСТРОВА-ТЮРЬМЫ! Мотылёк! Русский фильм

ФИЛЬМ! НЕВИНОВНЫЙ ГОТОВИТ ДЕРЗКИЙ ПОБЕГ С НЕПРИСТУПНОГО ОСТРОВА-ТЮРЬМЫ! Мотылёк! Русский фильм

Гениальное изобретение из обычного стаканчика!

Гениальное изобретение из обычного стаканчика!

ЧТО ОПАСНЕЕ? ОТВЕТЫ ВАС ШОКИРУЮТ... (1% ОТВЕЧАЮТ ПРАВИЛЬНО) #Shorts #Глент

ЧТО ОПАСНЕЕ? ОТВЕТЫ ВАС ШОКИРУЮТ... (1% ОТВЕЧАЮТ ПРАВИЛЬНО) #Shorts #Глент