Hotel Reviews Sentiment Analysis In python|NLP Sentiment analysis in Python

Поділитися
Вставка
  • Опубліковано 27 сер 2024
  • Hotel Reviews Sentiment Analysis In python|NLP Sentiment analysis in Python
    #SentimentAnalysisInPython #NLPSentimentAnalysis #UnfoldDataScience
    Hi ,
    This is Aman and I am a data scientist.
    About this video:
    This video is about sentiment analysis of hotel reviews in python. Below questions are answered in this video:
    1. How to do sentiment analysis in python
    2. What is Sentiment analysis in python
    3. Text Sentiment analysis in python
    4. Sentiment analysis using python for text data
    5. NLP sentiment analysis in python
    About Unfold Data science: This channel is to help people understand basics of data science through simple examples in easy way. Anybody without having prior knowledge of computer programming or statistics or machine learning and artificial intelligence can get an understanding of data science at high level through this channel. The videos uploaded will not be very technical in nature and hence it can be easily grasped by viewers from different background as well.
    Join Facebook group :
    www.facebook.c...
    Follow on medium : / amanrai77
    Follow on quora: www.quora.com/...
    Follow on twitter : @unfoldds
    Get connected on LinkedIn : / aman-kumar-b4881440
    Follow on Instagram : unfolddatascience
    Watch Introduction to Data Science full playlist here : • Data Science In 15 Min...
    Watch python for data science playlist here:
    • Python Basics For Data...
    Watch statistics and mathematics playlist here :
    • Measures of Central Te...
    Watch End to End Implementation of a simple machine learning model in Python here:
    • How Does Machine Learn...
    Learn Ensemble Model, Bagging and Boosting here:
    • Introduction to Ensemb...
    Access all my codes here:
    drive.google.c...
    Have question for me? Ask me here : docs.google.co...
    My Music: www.bensound.c...

КОМЕНТАРІ • 235

  • @UnfoldDataScience
    @UnfoldDataScience  2 роки тому

    Access Hindi, English courses here- www.unfolddatascience.com/s/store
    Plz register on the website

  • @siddheshshanker4162
    @siddheshshanker4162 3 роки тому +3

    Very well explained in short and concise manner. Didn’t knew about the pipeline module. Learnt an extra thing along with NLP implementation.

  • @yash422vd
    @yash422vd 3 роки тому +3

    Your videos are so straight to the point, the concept is nailed so very well, that it is registered in mind at once. Thanks!

  • @nevilparekh6400
    @nevilparekh6400 3 роки тому +1

    I dont usually like or comment any video but after going through yours i cant resist myself to appreciate your efforts.
    Short and sweet... very good work... keep it up...

    • @UnfoldDataScience
      @UnfoldDataScience  3 роки тому +1

      Thanks a lot Nevil. Your comments motivate me.
      Please share within your data science groups if you find it useful.

  • @thshantakumar
    @thshantakumar Рік тому +8

    The reason for the last incorrect prediction is that 'not' and 'happy' are considered as separate words. Hence, the word 'happy' has not been negated which is expected. If we consider 2-gram then it might improve the prediction. Amazing video. You just got a subscriber. Keep on Unfolding Data Science.

    • @UnfoldDataScience
      @UnfoldDataScience  Рік тому

      Thanks a lot. When audience gets involved with the content, that's real happiness for me.

    • @shironpereira2641
      @shironpereira2641 Рік тому

      @@UnfoldDataScience do you have any lessons on how to implement 2 gram

  • @bennurakash
    @bennurakash 3 роки тому +1

    Thank you very much and the way you teach us is mind blowing.

  • @vishnujatav6329
    @vishnujatav6329 3 роки тому +2

    Really helpful

  • @antoniusarya
    @antoniusarya 3 роки тому +1

    Thanks a lot, man, this video helps me so much for my project!

  • @akashprabhakar6353
    @akashprabhakar6353 3 роки тому +1

    thanks...nice and simply explained

  • @anirbansarkar6306
    @anirbansarkar6306 3 роки тому +1

    Thanks a lot Aman. I would request you to please come up with more such tutorials (if your time permits) wherein you can show your thought process behind ML models that you create with kaggle datasets or for kaggle competitions. I am requesting this to you, as I think your videos is helping me a lot in learning and developing concepts in data science. Thanks Again😊😊

  • @GenerativeAI-AI
    @GenerativeAI-AI Рік тому

    We can use Negation handling and BERT technique like word flipping for to avoid "not","never" ...

  • @princevegeta7921
    @princevegeta7921 4 роки тому +3

    Bro why did you not use stop word removal, lemmatization and stemming?
    Other than that, a really good video.
    Thank you for this.

    • @UnfoldDataScience
      @UnfoldDataScience  4 роки тому

      Thanks prince for your feedback. Tf idf does few things internally but yes advisable to do by self. Happy learning, stay safe!!

  • @dipannita7436
    @dipannita7436 4 роки тому

    No doubt, very clear and nice...

  • @vladimir_egay
    @vladimir_egay 4 роки тому +1

    Yo! Nice tutorial bro!
    And thanks for sharing your code.
    Make more videos on a similar concept.
    Respect!

  • @salikmalik7631
    @salikmalik7631 3 роки тому

    Awesome video, I am doing project on sentiment analysis it helped me alot.

  • @__mothership__8475
    @__mothership__8475 2 роки тому

    Here you did cleaning and feature engineering first then train test split but in the recent video you said we should first do train test split then all the cleaning and FE to protect from Data leakage ?

    • @UnfoldDataScience
      @UnfoldDataScience  2 роки тому

      Just to save time, Ideally, break first then do anything.

  • @kaleabwoldemariam4288
    @kaleabwoldemariam4288 3 роки тому

    Very good, excellently presented. I wish you could have explained behind the scene activities of vectorizer and logistic classifier. I like it!

  • @manojmholagi7035
    @manojmholagi7035 2 роки тому

    When u do cleaning of description column using function, how does it exactly take the description column??

  • @yonasgido7560
    @yonasgido7560 3 роки тому +1

    Thank you,its best video.I have one question for you how i can compare more than one text vectors by using cosine similarity ,and display the cosine score in pd.dataframe.

    • @UnfoldDataScience
      @UnfoldDataScience  3 роки тому +1

      Hi Yonas, use the cosine similarity formula from python

  • @sadhnarai8757
    @sadhnarai8757 4 роки тому +1

    Very nice AMan

  • @ayencoscolfield3312
    @ayencoscolfield3312 3 роки тому +1

    Nice video and nice work ,well-done Please can you make video on sentiment analysis on suicidal ideation(classification) using twitter data

    • @UnfoldDataScience
      @UnfoldDataScience  3 роки тому

      Thanks. Similar concept can be applied for your problem statement as well.

    • @ayencoscolfield3312
      @ayencoscolfield3312 3 роки тому

      @@UnfoldDataScience oh that is good, thanks man

  • @preranatiwary7690
    @preranatiwary7690 4 роки тому +1

    Good one!

  • @sandipansarkar9211
    @sandipansarkar9211 2 роки тому

    FINISHED WATCHING

  • @moosa1003
    @moosa1003 2 роки тому +1

    Can you tell me which algorithms and which libraries are used in this project?

  • @harini2933
    @harini2933 2 роки тому +1

    Please let me know why didn't you do Tokenization and Stemming/Lemmatization in this project even though it is an NLP proj? Thanks in advance :)

    • @UnfoldDataScience
      @UnfoldDataScience  2 роки тому +1

      Here just to keep video short, j may have skipped but it's good practice to do it always.

  • @anshuaravaryan2842
    @anshuaravaryan2842 Місяць тому

    What it means when length of test, train & split comes (1,1,1) ? how to fix that. Please help.

  • @beib2281
    @beib2281 4 роки тому +1

    Hello, thank you for your video and for sharing your knowledge
    Can you please explain why use a random state of 225 ? why not only put 1 ?

  • @nikhilgupta4859
    @nikhilgupta4859 3 роки тому

    Hi Aman - I have started this playlist today and I am highly impressed with the way you explain things.
    I have one query.. We have text data as well as corresponding ratings on websites.We use this data as training data.
    So why we need sentiment analysis tool??
    For which data we need predictions, if we already have text & ratings on every website.
    Please respond.

    • @UnfoldDataScience
      @UnfoldDataScience  3 роки тому

      Hi Nikhil, this is a good question. In this example, we have the rating however we might not have rating in all real world use case.
      We create rating based on our business understanding and then use it to train model.
      Later, we can use trained model for rating prediction for new examples.

    • @nikhilgupta4859
      @nikhilgupta4859 3 роки тому

      @@UnfoldDataScience Thank You Aman for responding. As per my limited knowledge I can only see that in every field we have ratings data such as retail, movie, songs, even after customer care call we give ratings.
      Still not clear on which data a company need to predict ratings.
      Its a request if you can answer me or provide some use case on my email nikhilgpt23@gmail.com.
      I have completed your timeseries playlist as well. Will post my questions there. Superb work!!

    • @nishah4058
      @nishah4058 2 роки тому

      @@nikhilgupta4859 hii..I also had the same qsn...did u get the answer of it?

    • @nikhilgupta4859
      @nikhilgupta4859 2 роки тому

      @@nishah4058 No I haven't got any response..If you get it please post it here as well :)

    • @nishah4058
      @nishah4058 2 роки тому

      @@nikhilgupta4859 I get to know by thinking on it... actually it's for hotels chain(who own the hotel) in case of hotel review that what is their market reputation...how customers like their services etc...if the company itself read no of reviews by itself which can't be practical so there is a need of sentiment analysis when one business or organisation want to know their reputation,why they are lacking with others,what they can improve in their system...this is the main reason to do sentiment analysis.i think so

  • @GopiKumar-ny3xx
    @GopiKumar-ny3xx 4 роки тому

    Very nice presentation..

  • @attaulhaq459
    @attaulhaq459 Рік тому

    plz give the roadmap of data science?

  • @shailendra9292
    @shailendra9292 3 роки тому

    Thanks Aman for the wonderful explaination !
    One question to ask, why did you use 'lbfgs' within Logistic model not 'newton-cg' or any other method?

  • @pavanjoshi7578
    @pavanjoshi7578 4 роки тому

    Thanks.easily explained.

  • @arjunrampal3274
    @arjunrampal3274 4 роки тому +1

    I'm complete ur task..
    If you are increasing train dataset..
    " I'm not happy" this will give u "not happy " result.

    • @UnfoldDataScience
      @UnfoldDataScience  4 роки тому +2

      Hi Arjunram, Even it it gives, the approach that we took in this analysis is a basic one. Our solution should take care of many things in real world applications, hence better approach should be taken.
      I am very happy that you tried the use case and coming up with queries. Happy Learning. Tc

  • @robotdream8355
    @robotdream8355 2 роки тому

    Hi Great Explanation! Any update for "I am not happy?" such kind of reviews?

  • @ashokpal-gw7yc
    @ashokpal-gw7yc 4 роки тому

    Very good sir

    • @UnfoldDataScience
      @UnfoldDataScience  4 роки тому

      Thanks Ashok, Keep watching. Happy learning. stay Safe!

  • @RAY-fk3tv
    @RAY-fk3tv 4 роки тому +1

    I think using a neural network for the same can improve accuracy. Can you make a video for that?

    • @UnfoldDataScience
      @UnfoldDataScience  4 роки тому

      HI Ray, thaks for your feedback. Yes definitely it can help. I will create a video on this topic using same data and check.

    • @ayencoscolfield3312
      @ayencoscolfield3312 3 роки тому

      @@UnfoldDataScience Yea will surely love to see from you video on this because you are such a great teacher, besides with neural network we can get such a great result in terms of accuracy

  • @shubhamrauniyar5641
    @shubhamrauniyar5641 2 роки тому

    kindly provide the link next part of this video where u solved this problem of strong word ..

  • @hashirsheikh8386
    @hashirsheikh8386 2 роки тому

    i can't find this code in your drive. there is a code in npl hybrid model which is related to semtiment analysis but the dataset you used in that is a customer review from a flight.

  • @farhadkhan3893
    @farhadkhan3893 2 роки тому

    Hello sir, why didn't you apply steaming and tokenization ?

  • @ShaidaMuhammad
    @ShaidaMuhammad 4 роки тому

    Very nice

  • @nriezedichisom1676
    @nriezedichisom1676 4 місяці тому

    Why didn't you change the independent variables to numbers eg 0 and 1

  • @miandaniyalkhan
    @miandaniyalkhan Рік тому

    The link doesn’t have this file, please update the link, i am waiting

  • @nithyashreev3404
    @nithyashreev3404 4 роки тому

    Thank you very much for the video!Which algorithm have you used for sentiment analysis here?

    • @UnfoldDataScience
      @UnfoldDataScience  4 роки тому

      You are welcome, I have created data using TF-IDF and then use classification algorithm :)

    • @hashirsheikh8386
      @hashirsheikh8386 2 роки тому

      @@UnfoldDataScience hello sir can you please tell me which classification algorithm you have used in it?

  • @chemistryforbeginners4709
    @chemistryforbeginners4709 19 днів тому

    I don't think you used TF-IDF function, hence, happy was not masked very well in the data

  • @olaali6920
    @olaali6920 Рік тому

    could you please make video on project of analysing google play store using LDA for text mining and sentiment analysis? As I work on same project and face some problems in applying

  • @apurvtanay6191
    @apurvtanay6191 3 роки тому +1

    What if I have a dataset which only has reviews and not description. For example, how do I go about doing sentiment analysis when I am given only the reviews in the dataset.

    • @UnfoldDataScience
      @UnfoldDataScience  3 роки тому

      You can create a label by yourself for a subset of reviews and use it for model training.

  • @Vipulghadi
    @Vipulghadi Рік тому

    we can use n grams

  • @vinaytandel4539
    @vinaytandel4539 3 роки тому

    Very well explained. Thank you. I am just wondering that how can perform the operations in python by triggering it from my angular project and display result there. May I seek assistance on it please?

  • @VinayKumar-gw6dd
    @VinayKumar-gw6dd 4 роки тому

    Nice video

  • @kranthi9314
    @kranthi9314 3 роки тому

    How to get the Is_Response===> happy or unhappy in the dataset if that doesn't predefined in dataset already

    • @UnfoldDataScience
      @UnfoldDataScience  2 роки тому

      Good question, we need to create a target varibale based on many things. sometimes manual process

  • @Shrey-Patel
    @Shrey-Patel 2 роки тому

    can you make this for multiple languages? btw this tutorial was awesome

  • @dhilipmaharish
    @dhilipmaharish 4 роки тому +1

    Hello u said applying sentiment analysis but u applied only TFidf vector alone

    • @UnfoldDataScience
      @UnfoldDataScience  4 роки тому

      Hi Dhilip, yes good point. I have not used any existing packages but derived features from text and then training model.

  • @sandipsinha2537
    @sandipsinha2537 11 місяців тому

    in the google drive i cannot find the code of this sentiment analysis using logistic Regression. please specify me the path

  • @AK-ud4ur
    @AK-ud4ur 3 роки тому

    Hi Aman, Is there a way to handle "NOT GOOD" as "BAD". i.e negation of words.

    • @UnfoldDataScience
      @UnfoldDataScience  3 роки тому +1

      Yes Abhijit, that is where "context based " learning comes into picture.

  • @SaiKrishna-fs8oo
    @SaiKrishna-fs8oo 3 роки тому +1

    Bro I don't know location of file to download hotel review project , please help me, today I want to submit my project in my clg

    • @UnfoldDataScience
      @UnfoldDataScience  3 роки тому

      drive.google.com/drive/folders/1XdPbyAc9iWml0fPPNX91Yq3BRwkZAG2M

  • @tonymathewjohn5958
    @tonymathewjohn5958 2 роки тому

    which algorithm is used?

  • @dhanashribhoite8493
    @dhanashribhoite8493 2 роки тому

    In NLP classification problem sampling is required

  • @harini2933
    @harini2933 4 роки тому

    please do more videos on NLP . thank you

  • @shravan949
    @shravan949 4 роки тому

    Hello, can you explain the reason behind, why we are getting as happy result on providing the review as "I'm not happy".
    I have seen in the comments, better approach should be taken, can you please explain which is the better approach.

    • @UnfoldDataScience
      @UnfoldDataScience  3 роки тому +1

      Hi Shravan, that is a very good question. What i have demonstrated here is a very simple approach for NLP problems. This is not what is recommended for real world use cases. Here we are taking "frequency based approach" where the semantic meaning of word is sentence is not captured hence model gives result unexpected way as the example you gave.
      Better approach will be, using Word embeddings/Prediction based models/sequence based Deep learning models where we can capture sematic meaning of a sentence.

  • @yagyarajbhatta2444
    @yagyarajbhatta2444 4 роки тому

    Sir why you won't use tokenization and lemmatization in the preprocessing step?🤔
    Can you guide?

    • @UnfoldDataScience
      @UnfoldDataScience  3 роки тому

      Hi Yagya, here I might not have used however those are recommended way of cleaning text data. good point noticed.

  • @trivenikohale9428
    @trivenikohale9428 3 роки тому +1

    sir which algorithms did you used here

  • @akshaysawant9518
    @akshaysawant9518 3 роки тому

    Please make a video on hybrid features we have a BE project - sentimental analysis using hybrid feature

  • @sahitkeshatti5463
    @sahitkeshatti5463 10 місяців тому

    i am not able to access to data set can u attach a link

  • @JP-fi1bz
    @JP-fi1bz 3 роки тому +1

    Why didn't you balance the label data?

    • @UnfoldDataScience
      @UnfoldDataScience  3 роки тому

      Good question, it should be done. Here in this demo I did not follow all steps as I wanted to demonstrate things at high level.

    • @JP-fi1bz
      @JP-fi1bz 3 роки тому

      @@UnfoldDataScience ok please tell the technique which can be used to balance text data/in nlp

  • @misbahuddinmahmud5456
    @misbahuddinmahmud5456 3 роки тому

    You should have to handle negation scope to improvise accuracy

  • @attaulhaq459
    @attaulhaq459 Рік тому

    from which file this project is saved?

  • @anirbansarkar6306
    @anirbansarkar6306 3 роки тому

    I think to improve we can go for bi-gram or tri-gram to get better understanding of the concept

  • @MrAnandml
    @MrAnandml 3 роки тому

    Can we use countvectorizer instead of TfidfVectorizer

  • @saimanikanta2091
    @saimanikanta2091 Рік тому

    hello sir,i got this error AttributeError: 'DataFrame' object has no attribute 'Is_Response' when i am doing this

  • @faiitech5185
    @faiitech5185 3 роки тому

    how to display model performance metrics

  • @maksudalom8627
    @maksudalom8627 3 роки тому

    The system understand the meaning of "Don't" but Don't understand the meaning "do not". I find that reason. The maim fact is Don't is represent as a single word which obtained a single index in vector but do not split out in vector and abstain separate index. That's why that kind of problem arise

  • @anuragchandnani8037
    @anuragchandnani8037 4 роки тому +1

    my session crashed (utilized all the RAM) after the confusion matrix line, is there any particular reason ? how do i resolve the issue ?

    • @UnfoldDataScience
      @UnfoldDataScience  4 роки тому +1

      Hi Anurag, can u share the screenshot in email?

    • @anuragchandnani8037
      @anuragchandnani8037 4 роки тому

      @@UnfoldDataScience I have shared the code to this email - amanrai77@gmail.com

  • @SureshBabu-wi4kx
    @SureshBabu-wi4kx 3 роки тому

    Will help me to know sentence classification using rnn

  • @sanaasherif9134
    @sanaasherif9134 2 роки тому

    got an error: AttributeError: 'Series' object has no attribute 'lower' while doing the first layer of cleaning

  • @pursueweazmarecoverall486
    @pursueweazmarecoverall486 2 роки тому

    Greetings Sir, do you still take questions?

  • @shiridisaivadla2235
    @shiridisaivadla2235 4 роки тому +1

    Cant we do the same thing with ML ??

    • @UnfoldDataScience
      @UnfoldDataScience  4 роки тому

      It is ML only. The only difference is how we extract features from text.

  • @yogeshpandey730
    @yogeshpandey730 4 роки тому

    amazing explaination, whatever you explain it got stuck to my mind. Thank you sir ,go hard

  • @anunoy2011
    @anunoy2011 3 роки тому

    Getting an error "'Series' object has no attribute 'lower'
    " line 34, in text_clean_1
    text = text.lower().

  • @colabwork1910
    @colabwork1910 3 роки тому

    Hey, great video. Can you please upload this code file in your google drive? there is not in your google drive.

    • @UnfoldDataScience
      @UnfoldDataScience  3 роки тому

      drive.google.com/drive/folders/1XdPbyAc9iWml0fPPNX91Yq3BRwkZAG2M

  • @onyee0170
    @onyee0170 3 роки тому

    Hi, Please I am carrying out a research on online reviews for hotels. Please how do I pull these data for data collection, since copy and paste wouldn't work? Thanks.

    • @UnfoldDataScience
      @UnfoldDataScience  3 роки тому

      Hi, you need to use some web scrapping method. Take this code as reference and change based on your need.

    • @onyee0170
      @onyee0170 3 роки тому

      @@UnfoldDataScience Hello, thank you for your response. Please I cannot find the code. Kindly assist.

  • @theknightsoldier
    @theknightsoldier 2 роки тому

    Can we get over all project to download link sir....

  • @vishalrai2859
    @vishalrai2859 3 роки тому

    great video sir
    can you give me the link of kaggle dataset

    • @UnfoldDataScience
      @UnfoldDataScience  3 роки тому

      drive.google.com/drive/folders/1XdPbyAc9iWml0fPPNX91Yq3BRwkZAG2M

  • @janakirajkumar636
    @janakirajkumar636 4 роки тому

    What is the next level for research work...in this sir?

    • @UnfoldDataScience
      @UnfoldDataScience  4 роки тому

      Hi Janaki, many research work is going on, Join me live sunday 4PM IST, we can discuss mre.

  • @rajatbadade20
    @rajatbadade20 2 роки тому

    How to check the real and fake reviews

  • @rachnamalik7605
    @rachnamalik7605 Рік тому

    where i can get train dataset

  • @kumarashutosh5098
    @kumarashutosh5098 4 роки тому

    Sir, need some guidance on NLP.

  • @shilpatb9166
    @shilpatb9166 3 роки тому

    Sir can we do this for regional language?

  • @NotoriousGarage
    @NotoriousGarage 3 роки тому

    ⭕️⭕️❌❌‼️‼️5:03 I am getting Empty in column of cleaned description AFTER applying text clean 1‼️‼️‼️⁉️⁉️⭕️⭕️🚫🚫❌❌

  • @colabwork1910
    @colabwork1910 3 роки тому

    I am facing same problem. (I am not good) give me positive sentiment in my project. How to solve it?
    Thank you

    • @UnfoldDataScience
      @UnfoldDataScience  3 роки тому

      That is exactly where sequence based models like LSTM comes into picture :)

  • @akkuakbar4450
    @akkuakbar4450 3 роки тому

    Hello sir... Is it possible to scrap data from google play store app reviews

  • @salikmalik7631
    @salikmalik7631 3 роки тому

    but is it necessary to create a new column?
    I am confused

    • @UnfoldDataScience
      @UnfoldDataScience  3 роки тому

      Which part of the video you are talking Salik? Can u point me pls.

    • @salikmalik7631
      @salikmalik7631 3 роки тому

      @@UnfoldDataScience you create a new column after cleaning the text. that's what I am talking about...

  • @rajeshbhattacharjee2811
    @rajeshbhattacharjee2811 3 роки тому

    how to apply here SVM?

  • @maYYidtS
    @maYYidtS 3 роки тому

    can you please reply with an exact answer...if possible.
    i have around 27000 customer reviews. but I don't have ratings for those reviews.
    i need to generate ratings or sentiment(+ve or -ve) for each review. or I need to classify those reviews.
    how would I do...
    is there any possibility to cluster those reviews into two clusters? and consider as +ve and -ve reviews. if yes please describe the approach.
    if not..please.
    describe your approach.
    thanks.

    • @UnfoldDataScience
      @UnfoldDataScience  3 роки тому +1

      Answered in email :)

    • @ANKITSINGH-fv1ow
      @ANKITSINGH-fv1ow 3 роки тому

      @@UnfoldDataScience can you share the with me also

    • @afeezlawal5167
      @afeezlawal5167 2 роки тому

      @@ANKITSINGH-fv1ow first apply unsupervised technic to the problem, like kmeans or hierarchical, we can then train our model on that.

  • @rohitkumar-nk6sd
    @rohitkumar-nk6sd 3 роки тому

    How to change is_response to numbers as 0 or 1?

    • @UnfoldDataScience
      @UnfoldDataScience  3 роки тому

      Hi Rohit , If you want to change, you can do something like this
      df.loc[df["is_response"]== "positive", "is_response"] = 1

    • @rohitkumar-nk6sd
      @rohitkumar-nk6sd 3 роки тому

      @@UnfoldDataScience I did it through get_dummies() anyways thanks for replying 🙏

  • @jahanurbiswas7816
    @jahanurbiswas7816 3 роки тому

    Can we get the dataset and code ? It will be very helpful for us

    • @UnfoldDataScience
      @UnfoldDataScience  3 роки тому

      drive.google.com/drive/folders/1XdPbyAc9iWml0fPPNX91Yq3BRwkZAG2M

    • @sahitkeshatti5463
      @sahitkeshatti5463 10 місяців тому

      its not there
      @@UnfoldDataScience

  • @anirbansarkar6306
    @anirbansarkar6306 3 роки тому

    Hi Aman,
    Can you please explain what all are getting removed because of the below line?
    text = re.sub('\[.*?\]', '', text)

    • @anirbansarkar6306
      @anirbansarkar6306 3 роки тому +1

      actually I am confused with the .*? part in the expression. What are they used for in this expression?

    • @UnfoldDataScience
      @UnfoldDataScience  3 роки тому +1

      . * means anything. Bigger bracket means any of the existing thing inside the bigger bracket. For example [ABC] means if string has either A or B or C.

    • @anirbansarkar6306
      @anirbansarkar6306 3 роки тому

      ​@@UnfoldDataScience Thank you Aman for your reply. Okay so here the target is not to just remove the square brackets ('[]') but also to remove whatever is inside those brackets, is it so?
      Like for e.g. txt = ab[egg]ii, will become 'abii', we don't want 'abeggii'
      But still I have a small confusion, like here,
      text = re.sub('\[.*?\]', '', text)
      won't it be sufficient to add '.*' inside brackets. Why we are adding '?'
      too?
      What if we would have written
      text = re.sub('\[.*\]', '', text).
      Please help me to find out if I went wrong or missed out the real meaning of this statement somewhere.

  • @nikhilgahlaut3849
    @nikhilgahlaut3849 4 роки тому

    help me i want to use naive bayes algorithm and multinomialNB too!!!! how can i implement in it?

    • @UnfoldDataScience
      @UnfoldDataScience  4 роки тому +1

      Hi Nikhil, both of these can be applied using python package in a similar way. try from your side, if unable to do, let me know. Thanks. tc

    • @nikhilgahlaut3849
      @nikhilgahlaut3849 4 роки тому

      @@UnfoldDataScience SIR i've tried but it is always showing error

    • @nikhilgahlaut3849
      @nikhilgahlaut3849 4 роки тому

      please help

    • @divyanshsomvanshi4727
      @divyanshsomvanshi4727 4 роки тому

      @@nikhilgahlaut3849 did you succeed on implementing native Bayes?

  • @forhadhossain6087
    @forhadhossain6087 3 роки тому +1

    Can anyone give me this jupiter notebook file?

    • @UnfoldDataScience
      @UnfoldDataScience  3 роки тому

      You can go to "about" Section in my UA-cam channel, scroll down , you will get the link

  • @theknightsoldier
    @theknightsoldier 2 роки тому

    Can we get link to download plz

  • @komalyadav9227
    @komalyadav9227 3 роки тому

    how to give path of downloaded dataset?

    • @UnfoldDataScience
      @UnfoldDataScience  3 роки тому

      drive.google.com/drive/folders/1XdPbyAc9iWml0fPPNX91Yq3BRwkZAG2M