Part Of Speech POS Tagging: NLP Tutorial For Beginners - S1 E11

AI Is Making You An Illiterate Programmer

o3-mini is the FIRST DANGEROUS Autonomy Model | INSANE Coding and ML Abilities

вернулись в ПРОШЛОЕ 🔃 | WICSUR #shorts

Сестра обхитрила!

Cute Baby Ties Up Dad And Wants To Play With His Phone #funny #fatherhoodlove#cute#fatherhoodmoments

Stemming and Lemmatization: NLP Tutorial For Beginners - S1 E10

codebasics

Переглядів 70 244

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 1 лют 2025

КОМЕНТАРІ • 43

@codebasics 9 днів тому
Folks, here's a link to our bootcamp for learning AI and Data Science in the most practical way: tinyurl.com/395u4mnm
@Breaking_Bold Рік тому ⁺²
I love the way you explain - other NLP concepts - customizing the pipeline for example !!!
@ayushgupta80 10 місяців тому ⁺⁵
Stemming (removing something) vs Lemmatization ( mapped with base word) 4:50
Note : Spacy don't have support of stemming .
Code : stemming
import nltk
import spacy
from nltk.stem import PorterStemmer
stemmer = PorterStemmer()
words = ["eating","eats","eat","ate","adjustable","rafting","ability","meeting"]
for word in words:
print(word,"|",stemmer.stem(word))
--------------------------------------------------------------------------------
Code : lemmatization
nlp = spacy.load("en_core_web_sm")
doc = nlp("eating eats eat ate adjustable rafting ability meeting better")
for token in doc:
print(token,"|",token.lemma_,"|",token.lemma)
-----------------------------------------------------------------------------------------
Custom lemmatization
Code :
ar = nlp.get_pipe('attribute_ruler')
ar.add([[{"TEXT":"Bro"}],[{"TEXT":"Brah"}]],{"LEMMA":"Brother"})
doc =nlp("Bro, you wanna go ? Brah , don't say no ! I am exhausted")
for token in doc:
print(token.text,"|",token.lemma_)
@jatinnandwani6678 Рік тому
Thanks so much
@amandaahringer7466 2 роки тому ⁺¹
Very helpful! Looking forward to the rest of the series! Thank you!
@belfloretkoriciza5279 2 роки тому ⁺¹
you are my teacher and i am proud of you
@codebasics 2 роки тому
Thanks 🙏
@pphantom5037 5 місяців тому ⁺¹
There is a quiz now!! thank your for your awsome work♥♥♥
@Breaking_Bold Рік тому
Fantastic ...you make complex NLP topics simple. !!!
@aintgonhappen 2 роки тому
This is some quality content.
Thank you!
@amandaahringer7466 2 роки тому ⁺¹
8:36 I noticed that the prebuilt language pipelines return an unexpected lemma for "ate". I assumed that lg and trf pipelines would produce ate -> eat while the sm and md pipelines would produce ate -> ate, but that doesn't seem to be the case.
def eat_lemma(lang_pipeline):
nlp = spacy.load(lang_pipeline)
doc = nlp("ate")
print(lang_pipeline, '|', doc[0].lemma_)
lp = ["en_core_web_sm", "en_core_web_md", "en_core_web_lg", "en_core_web_trf"]
for lang_pipeline in lp:
eat_lemma(lang_pipeline)
en_core_web_sm | ['eat']
en_core_web_md | ['ate']
en_core_web_lg | ['eat']
en_core_web_trf | ['ate']
Update: I see that when "ate" is used in the context of a sentence each pipeline produces a lemma of "eat".
doc = nlp("The person ate an apple.")
en_core_web_sm | ['the', 'person', 'eat', 'an', 'apple', '.']
en_core_web_md | ['the', 'person', 'eat', 'an', 'apple', '.']
en_core_web_lg | ['the', 'person', 'eat', 'an', 'apple', '.']
en_core_web_trf | ['the', 'person', 'eat', 'an', 'apple', '.']
@arnavverma8622 2 роки тому
Excellent Series👌👌🔥🔥
@Malayalam_learner 3 місяці тому
Thanks a bunch ❤
@apurav363 3 місяці тому
Very helpful
@raphayzia9214 2 роки тому
Sir it will be very helpful if you make a NLP project like a Chatbot at the end of the series and thanks for making this series
@codebasics Місяць тому
In this playlist, end to end chatbot development project (using Google's Dialogflow framework) is added.
@rohanthite9711 23 дні тому
Sir will you please share ppts also , that will help in clearing the concepts
@aashishmalhotra 2 роки тому
If possible try to come with live sessions it would be helpful
@rajiv7 7 місяців тому
You are the excellent. Fullstop.
@sandeepnaik6437 2 роки тому ⁺⁵
What is Behavioural data science?
@MuhammadIBRAHIM-iy3rg 10 місяців тому
amazing videos
@vivekthumu8992 24 дні тому
❤Nice
@omarsalam7586 2 роки тому
thank you, sir
@muzaffariqbalraja6464 Рік тому
very nice
@berkayates6254 11 місяців тому
Hey Guys when we used stemming and lemmatizing before training the data we just change the words. After training the model model could generate words that are different from lemmatized words. I mean we teach the model `eat` however the model learn also `ate` how?
@JayShah-m1v Рік тому
Hey!
Firstly, this is a very good series. But for the exercise, in the last part using lemmatization, some of my words such as cooking were converted into cook and playing to play while running stayed as it is. Do you know what could be the issue?
Or do you have any explanation to this?
Thank you.
@agastyabose1645 11 місяців тому
it just might be how that specific model of nlp you used, performs. maybe idk
@GAURAVRAUL95 2 роки тому ⁺¹
Which one are you? Marc Spector or Steven Grant??
@codebasics 2 роки тому ⁺⁷
I am Dhaval, Marc and Steven are my alter egos 😎
@anaschoudhari511 2 роки тому
Hi sir a request for you to make some videos on python
@codebasics 2 роки тому ⁺¹
I have a python tutorial playlist with more than 40 videos. in youtube search "codebasics python tutorial"
@zaytech528 2 роки тому
hello sir, if i want to stem and lemmatize my string at the same time, how'd i do that? as spacy doesn't allow stemming. and nltk doesn't allow lemmatization. pls answer asap
@firdospathan3700 2 роки тому
I could not unable to install Ai4bharat package in PC.
Is there solution. For that error
@muradmammedzade2885 Рік тому
How to write Lemmatizer from scratch?
@Pride_Of_Ultras 2 роки тому
🤩
@Telugu-Tech-suport 2 роки тому
Sir last 1year EGO my pc hacked .gujd ransomwer please huw to get back my data 🙏 help mee please sum important data is ther
@leoxu1299 2 роки тому
Hey, aren't you the moon knight?
@codebasics 2 роки тому ⁺¹
Ha ha you are the third person to say this 🤣😎😎😎
@thoughtofme8263 2 роки тому
pleeeeeeeeeease try hindi speaking

Наступне

Автоматичне відтворення

Part Of Speech POS Tagging: NLP Tutorial For Beginners - S1 E11

Part Of Speech POS Tagging: NLP Tutorial For Beginners - S1 E11

AI Is Making You An Illiterate Programmer

AI Is Making You An Illiterate Programmer

o3-mini is the FIRST DANGEROUS Autonomy Model | INSANE Coding and ML Abilities

o3-mini is the FIRST DANGEROUS Autonomy Model | INSANE Coding and ML Abilities

вернулись в ПРОШЛОЕ 🔃 | WICSUR #shorts

вернулись в ПРОШЛОЕ 🔃 | WICSUR #shorts

Сестра обхитрила!

Сестра обхитрила!

Cute Baby Ties Up Dad And Wants To Play With His Phone #funny #fatherhoodlove#cute#fatherhoodmoments

Cute Baby Ties Up Dad And Wants To Play With His Phone #funny #fatherhoodlove#cute#fatherhoodmoments

ВОТ ПОЧЕМУ Япония живет в будущем 🤫 Утилизация масла #япония #токио #путешествия #shorts

ВОТ ПОЧЕМУ Япония живет в будущем 🤫 Утилизация масла #япония #токио #путешествия #shorts

Tokenization in Spacy: NLP Tutorial For Beginners - S1 E8

Tokenization in Spacy: NLP Tutorial For Beginners - S1 E8

What is BERT? | Deep Learning Tutorial 46 (Tensorflow, Keras & Python)

What is BERT? | Deep Learning Tutorial 46 (Tensorflow, Keras & Python)

AI Battle Heats Up: West Tries to Crush China's DeepSeek | Vantage with Palki Sharma | N18G

AI Battle Heats Up: West Tries to Crush China's DeepSeek | Vantage with Palki Sharma | N18G

China’s Artificial Sun Creates Nuclear Fusion World Record | Vantage with Palki Sharma | N18G

China’s Artificial Sun Creates Nuclear Fusion World Record | Vantage with Palki Sharma | N18G

Language Processing Pipeline in Spacy: NLP Tutorial For Beginners - S1 E9

Language Processing Pipeline in Spacy: NLP Tutorial For Beginners - S1 E9

NVIDIA CEO Jensen Huang's Vision for the Future

NVIDIA CEO Jensen Huang's Vision for the Future

Text Representation Using Word Embeddings: NLP Tutorial For Beginners - S2 E7

Text Representation Using Word Embeddings: NLP Tutorial For Beginners - S2 E7

Text Preprocessing | tokenization | cleaning | stemming | stopwords | lemmatization

Text Preprocessing | tokenization | cleaning | stemming | stopwords | lemmatization

СПОРИМ ТЫ НЕ ЗНАЕШЬ ТРИ СЛОВА НА БУКВУ О? #shortsvideo #юмор #катяклон #comedy #прикол #мамадочка

СПОРИМ ТЫ НЕ ЗНАЕШЬ ТРИ СЛОВА НА БУКВУ О? #shortsvideo #юмор #катяклон #comedy #прикол #мамадочка

«Шнурки не зрізайте, акуратненько»: медик про реакцію військових на поранення #shorts

«Шнурки не зрізайте, акуратненько»: медик про реакцію військових на поранення #shorts

ФИЛЬМ! НЕВИНОВНЫЙ ГОТОВИТ ДЕРЗКИЙ ПОБЕГ С НЕПРИСТУПНОГО ОСТРОВА-ТЮРЬМЫ! Мотылёк! Русский фильм

ФИЛЬМ! НЕВИНОВНЫЙ ГОТОВИТ ДЕРЗКИЙ ПОБЕГ С НЕПРИСТУПНОГО ОСТРОВА-ТЮРЬМЫ! Мотылёк! Русский фильм

“Don’t stop the chances.”

“Don’t stop the chances.”

Нельзя смеяться | Смех с водой | 97 #shorts

Нельзя смеяться | Смех с водой | 97 #shorts

"Бажано відбити посадку без втрат": військовий розповів, як загибель побратимів впливає на психіку

"Бажано відбити посадку без втрат": військовий розповів, як загибель побратимів впливає на психіку

Удержаться на воде?? 🌊 #симбочкапимпочка #симбочка #симба

Удержаться на воде?? 🌊 #симбочкапимпочка #симбочка #симба

Wall Rebound Challenge 🙈😱

Wall Rebound Challenge 🙈😱