Lecture 9 : N-Gram Language Models

Lecture 12: Language Modeling: Advanced Smoothing Models

Lecture 22: Syntax - Introduction

💣Все! Під КУРСЬК зайшли БІЛОРУСИ на танках. У Київ везуть ПОСЛАННЯ ПУТІНА. ТАКОГО ТОЧНО ще не було!

ПОКАЖІТЬ ЦЕ Путіну! Моді НЕ СТРИМАВ емоцій під час зустрічі з Зеленським. Пресконференція

МАФИЯ в РЕАЛЬНОЙ ЖИЗНИ: Масленников, Матвиенко, Булкин, Сабина, Бустер, Дилара, Гурам, Леон, Янчик

Lecture 10: Evaluation of Language Models, Basic Smoothing

Natural Language Processing

Переглядів 18 905

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 25 сер 2024

КОМЕНТАРІ • 16

@pawanchoure1289 2 роки тому ⁺¹
One solution to probability density estimation is referred to as Maximum Likelihood Estimation or MLE for short. First, it involves defining a parameter called theta that defines both the choice of the probability density function and the parameters of that distribution.
@pawanchoure1289 2 роки тому ⁺¹
Traditionally, language model performance is measured by perplexity, cross-entropy, and bits-per-character (BPC). As language models are increasingly being used as pre-trained models for other NLP tasks, they are often also evaluated based on how well they perform on downstream tasks.
@louerleseigneur4532 4 роки тому ⁺²
Thanks sir
@pawanchoure1289 2 роки тому
perplexity is a measurement of how well a probability model predicts a sample. In the context of Natural Language Processing, perplexity is one way to evaluate language models.
@pawanchoure1289 2 роки тому
A 2-gram (or bigram) is a two-word sequence of words, like “I love”, “love reading”, or “Analytics Vidhya”. And a 3-gram (or trigram) is a three-word sequence of words like “I love reading”, “about data science” or “on Analytics Vidhya”.
@pawanchoure1289 2 роки тому
In information theory, perplexity is a measurement of how well a probability distribution or probability model predicts a sample. It may be used to compare probability models. A low perplexity indicates the probability distribution is good at predicting the sample.
@pawanchoure1289 2 роки тому
The Shannon Visualization Method
1. Choose a random bigram (, w) according to its probability.
2. Now choose a random bigram (w, x) according to its probability.
3. And so on until we choose
4. Then string the words together.
@pawanchoure1289 2 роки тому
The term smoothing refers to the adjustment of the maximum likelihood estimator of a language model so that it will be more accurate. ... When estimating a language model based on a limited amount of text, such as a single document, smoothing of the maximum likelihood model is extremely important.
@pawanchoure1289 2 роки тому
Perplexity is the inverse probability of the test set, normalized by the number of words. In the case of unigrams: Now you say you have already constructed the unigram model, meaning, for each word you have the relevant probability.
@pawanchoure1289 2 роки тому
What is extrinsic and intrinsic evaluation?
In an intrinsic evaluation, quality of NLP systems outputs is evaluated against pre-determined ground truth (reference text) whereas an extrinsic evaluation is aimed at evaluating systems outputs based on their impact on the performance of other NLP systems
@pawanchoure1289 2 роки тому
unigram prior smoothing
@divyanshukumar2605 3 роки тому ⁺⁵
Never goes in depth of any concepts, just says a bunch of technical words without explaining explicitly, even the explanations are word to word copied from the lecture of Dan Jurafsky.
@sumonchakrabarty6805 Рік тому ⁺³
Worst teacher ever seen in my life. He don't even know English properly. His vocabulary is worse. These kind of professors should be fired out immediately of IITs. They are polluting the teaching process....
@divyanshukumar2605 3 роки тому ⁺¹
A third grade teacher, he should be teaching a 5th grader.

Наступне

Автоматичне відтворення

Lecture 9 : N-Gram Language Models

Lecture 9 : N-Gram Language Models

Lecture 12: Language Modeling: Advanced Smoothing Models

Lecture 12: Language Modeling: Advanced Smoothing Models

Lecture 22: Syntax - Introduction

Lecture 22: Syntax - Introduction

💣Все! Під КУРСЬК зайшли БІЛОРУСИ на танках. У Київ везуть ПОСЛАННЯ ПУТІНА. ТАКОГО ТОЧНО ще не було!

💣Все! Під КУРСЬК зайшли БІЛОРУСИ на танках. У Київ везуть ПОСЛАННЯ ПУТІНА. ТАКОГО ТОЧНО ще не було!

ПОКАЖІТЬ ЦЕ Путіну! Моді НЕ СТРИМАВ емоцій під час зустрічі з Зеленським. Пресконференція

ПОКАЖІТЬ ЦЕ Путіну! Моді НЕ СТРИМАВ емоцій під час зустрічі з Зеленським. Пресконференція

МАФИЯ в РЕАЛЬНОЙ ЖИЗНИ: Масленников, Матвиенко, Булкин, Сабина, Бустер, Дилара, Гурам, Леон, Янчик

МАФИЯ в РЕАЛЬНОЙ ЖИЗНИ: Масленников, Матвиенко, Булкин, Сабина, Бустер, Дилара, Гурам, Леон, Янчик

Як «Козак» бився з БМП, брали полонених і трофейну техніку, і місцеві в Суджі. Репортаж

Як «Козак» бився з БМП, брали полонених і трофейну техніку, і місцеві в Суджі. Репортаж

Lecture 5: Text Processing: Basics

Lecture 5: Text Processing: Basics

Lecture 29 : Transition Based Parsing : Learning

Lecture 29 : Transition Based Parsing : Learning

Nlp - 2.3 - Evaluation and Perplexity

Nlp - 2.3 - Evaluation and Perplexity

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Lecture 23: Syntax - Parsing I

Lecture 23: Syntax - Parsing I

What are Generative AI models?

What are Generative AI models?

Lecture 17: Viterbi Decoding for HMM, Parameter Learning

Lecture 17: Viterbi Decoding for HMM, Parameter Learning

Lecture 15: Introduction to POS Tagging

Lecture 15: Introduction to POS Tagging

Different types of vector data and concept of topology

Different types of vector data and concept of topology

КТО ЛЮБИТ ГРИБЫ?? #shorts

КТО ЛЮБИТ ГРИБЫ?? #shorts

Побег из Тюрьмы : Nuggets Gegagedigedagedago удирает от Nikocado Avocado !

Побег из Тюрьмы : Nuggets Gegagedigedagedago удирает от Nikocado Avocado !

Проверка жены 😅 #тнт #shorts #юмор #камедиклаб #камеди #маринакравец #карибидис #биржатруда #работа

Проверка жены 😅 #тнт #shorts #юмор #камедиклаб #камеди #маринакравец #карибидис #биржатруда #работа

Можно ли пропускать завтрак? #эндокринолог #питание #диеты #правильноепитание

Можно ли пропускать завтрак? #эндокринолог #питание #диеты #правильноепитание

Gli occhiali da sole non mi hanno coperto! 😎

Gli occhiali da sole non mi hanno coperto! 😎

ЗАРЯДКА ⚡️🪫1 серия. Полный сезон уже на YouTube

ЗАРЯДКА ⚡️🪫1 серия. Полный сезон уже на YouTube

«Зайшли п*дорам в тил у перший день штурму» #україна #війна #зсу #окупанти

«Зайшли п*дорам в тил у перший день штурму» #україна #війна #зсу #окупанти