R - Sentiment Analysis and Wordcloud with R from Twitter Data | Example using Apple Tweets

Поділитися
Вставка
  • Опубліковано 1 жов 2024

КОМЕНТАРІ • 498

  • @vijaymore1239
    @vijaymore1239 7 років тому +18

    Too good Video....your explanation is awesome sir....Thank you so much

    • @bkrai
      @bkrai  3 роки тому

      You are welcome!

  • @SB-gw1wq
    @SB-gw1wq 2 роки тому +1

    Can you please help, after doing text preprocessing I am trying to perform tdm however getting bellow error.
    dtm

  • @amaanraza2774
    @amaanraza2774 3 роки тому +7

    By far one of the best teacher of Data Science, who believes in keeping it simple. Absolutely love them.

    • @bkrai
      @bkrai  3 роки тому

      Thanks for comments!

  • @msmanimanikandan7
    @msmanimanikandan7 5 років тому +2

    hi, your video is good and very useful for me. can you please provide how to get file from twitter and iam new to work on R software. if you don't mind i need some basic videos step by step process from getting data and doing wordcloud and sentiment analysis on R. i am waiting for your reply
    for example i need a data from twitter big data in agriculture. like that i want to get data sheet from twitter and run the word cloud and sentiment analysis on R. can you suggest me

    • @bkrai
      @bkrai  5 років тому

      You can try these playlists:
      ua-cam.com/play/PL34t5iLfZddt0tt5GdDy3ny6X5RQvwrp6.html
      ua-cam.com/play/PL34t5iLfZddtfgOcE6aKCEXWtCkolpgjV.html

  • @bars2251
    @bars2251 3 роки тому +2

    Sir, I have a problem. wordcloud2 not working at my pc.
    error message:
    Error in gsub(

    • @bkrai
      @bkrai  3 роки тому +1

      For pc make sure you use "utf-8" in line-7 and not "utf-8-mac".

    • @bars2251
      @bars2251 3 роки тому

      @@bkrai I am sure. I use to gsub func for turkish character to convert.
      Example:
      corpus = gsub("ÅŸ", "s", corpus)
      ş > s

    • @havacilikveyapayzeka
      @havacilikveyapayzeka 3 роки тому

      @@bars2251 Error in gsub("

  • @debinsunny5145
    @debinsunny5145 4 роки тому +2

    Thankyou for the video.....How do you create apple.csv file in your desktop that contains 1000 tweets?

    • @bkrai
      @bkrai  4 роки тому +1

      I got it from Twitter:
      ua-cam.com/video/QETCjkQ3CBw/v-deo.html

    • @debinsunny5145
      @debinsunny5145 4 роки тому +1

      @@bkrai Thank you sir

    • @bkrai
      @bkrai  4 роки тому

      You are welcome!

  • @sharafabankole3425
    @sharafabankole3425 4 роки тому +2

    Thanks so much for this detailed explanation. Please I'm having an issue fixing the TermDocumentMatrix error. I get this error when I get to the tdm:
    Error in nchar(Terms(x), type = "chars") :
    invalid multibyte string, element 220
    How do I fix it?

    • @bkrai
      @bkrai  4 роки тому

      Difficult to say much without looking at the code.

  • @tanmaygawade1068
    @tanmaygawade1068 3 роки тому +1

    >>>corpus

    • @bkrai
      @bkrai  3 роки тому +1

      If you are using Windows computer, use: "utf-8" instead of "utf-8-mac".

  • @arunshowri7829
    @arunshowri7829 4 роки тому +1

    Hello Sir, I have imported all the mentioned libraries, but when I run the command: get_nrc_sentiment, I am getting the error as:
    Error: 'vec_as_subscript' is not an exported object from 'namespace:vctrs'
    Run `rlang::last_error()` to see where the error occurred.
    Can you please help

    • @bkrai
      @bkrai  4 роки тому

      Check structure of your data and make sure it is in right format.

  • @debaleenachatterjee9542
    @debaleenachatterjee9542 4 роки тому +1

    Sir, unable to perform sentiment analysis due to unavailability of package syuzhet and lubridate.... others like ggplot,scales are there... R is showing error displaying there is no packages named syuzhet and lubridate... what to do Sir.. plz help..

    • @bkrai
      @bkrai  4 роки тому

      You need to install those packages first.

  • @dushyanthbommana54
    @dushyanthbommana54 3 роки тому +1

    I am getting the below waring messages, how can I ignore this
    Warning message:
    In tm_map.SimpleCorpus(corpus, tolower) : transformation drops documents

    • @bkrai
      @bkrai  3 роки тому

      You can check the matrix and see if you are really losing anything.

  • @Vishal20April
    @Vishal20April 3 роки тому +1

    Many thanks for your video. Can you please tell us something more about the Datasets Apple & Apple 2. Did you export it from Twitter Analytics/Activity.

    • @bkrai
      @bkrai  3 роки тому

      I used this:
      ua-cam.com/video/QETCjkQ3CBw/v-deo.html

  • @54hari
    @54hari 5 років тому +3

    Warning message:
    In tm_map.SimpleCorpus(corpus, tolower) : transformation drops documents..
    Message is shown

    • @bkrai
      @bkrai  5 років тому +3

      Warning message in R are ok. You should not get error message.

  • @sonalichakrabarty1618
    @sonalichakrabarty1618 2 роки тому +1

    Sir while trying this example my TermDocumentMatrix is not working...system gets hanged when the function is called

    • @bkrai
      @bkrai  2 роки тому

      It may happen if data size is to big to be handled by your computer. Try reducing data size. Or you can try google colab:
      ua-cam.com/video/XVfn6IpoUPU/v-deo.html

  • @preetidevi8735
    @preetidevi8735 2 роки тому +1

    Sir how can cav file be generated?? Although you provided Cvs file but how can I extract file from twitter or facebook

    • @bkrai
      @bkrai  2 роки тому

      For getting Twitter data, see if this helps:
      ua-cam.com/video/bxJseJ-aZJk/v-deo.html

  • @ChandraMSc
    @ChandraMSc 3 роки тому +1

    Hey, i super appreciate to your work it easy to understand, but i have questions is that code can be implemented on any language such as Bahasa Indonesia?
    Thanks you, if you are available for reply

    • @bkrai
      @bkrai  3 роки тому

      I'll look into it.

  • @puneetsingh7779
    @puneetsingh7779 5 років тому +1

    hi Sir the way u removed stocks with stock i faced this challenge with multiple words like google was like goog googl, etc, and few other words like microsft was written twive as microsoft and msft, amazon as amzn. How to automate this process of replacing these kind of words

    • @bkrai
      @bkrai  5 років тому

      You can do it for just top 2 or 3 words. After that it doesn't really matter.

  • @sachinguptaitm
    @sachinguptaitm 6 років тому +3

    Thanks Bharatendra sir for sharing your knowledge.IT has been very helpful in my current project. We have below list of sentiments packages.
    #syuzhet
    #SentimentR
    #Rsentiment
    #Sentimentanalysis
    I am still validating all of above. Please let us know your thought about package recommendation for sentiment analysis.

    • @bkrai
      @bkrai  6 років тому +1

      syuzhet should work fine for sentiment analysis.

    • @sachinguptaitm
      @sachinguptaitm 6 років тому

      Thanks sir for your confirmation. yes Syuzhet is giving expected result for emotion and sentiments with some misclassification. I am using SentimentR to get avg sentiment by respective dimension and to get positive word list and negative word list.

  • @jabeens02
    @jabeens02 5 років тому +3

    It was a Great Lecture and very easy to understand. Thanks for this video.
    Can you please tell me how did you pull tweets after earnings.
    which part of the video or lines we need to follow to pull the tweets?
    I have followed this complete video and got the results. I want to extract Preprocessed csv file..please help me by your suggestions?
    Is this preprocessed file stored in R library of my computer?

    • @bkrai
      @bkrai  5 років тому

      For pulling tweets you can follow 1st video in this playlist:
      ua-cam.com/play/PL34t5iLfZddt0tt5GdDy3ny6X5RQvwrp6.html

  • @KevinGalbraith
    @KevinGalbraith 4 роки тому +5

    The best explanation and walk-through of sentiment analysis I've seen so far. Well done, very helpful.

    • @bkrai
      @bkrai  4 роки тому

      Thanks for the feedback!

  • @praveenshah3284
    @praveenshah3284 4 роки тому +2

    What version of R have u used for the entire project as I have having difficulties with R 3.5 and R 4.0 for installing some packages like Corpus and iconv

    • @bkrai
      @bkrai  4 роки тому +1

      Probably it has more to do with the computer. I used Mac and that's why used "utf-8-mac". If you are using windows, make sure to use "utf-8".

  • @beverlymenezes1357
    @beverlymenezes1357 5 років тому +2

    what do i do if i want to save the score in that csv file? or replace the reviews with the score in the file? Please Help.

    • @bkrai
      @bkrai  5 років тому +1

      Let's say your twitter data is in apple. Let's say you have all sentiment scores in SCORES. Then you can combine them using cbind(apple, SCORES). It will combine the columns.

    • @beverlymenezes1357
      @beverlymenezes1357 5 років тому +1

      Dr. Bharatendra Rai will try this. Thank you so much

    • @bkrai
      @bkrai  5 років тому +1

      welcome!

  • @vinayanand2920
    @vinayanand2920 3 роки тому +1

    Sir when I am going to remove stopwords , still stopwords are present. Plz guide me how to remove these stopwords

    • @bkrai
      @bkrai  3 роки тому

      It should work fine. Review your code again.

  • @birasafabrice
    @birasafabrice 2 роки тому +1

    what should users of window use in the place of utf-8-mac at 1:50? I have tried to reproduce your example but it is not working, sorry I am newbie to these things, thank you in advance!

    • @bkrai
      @bkrai  2 роки тому

      for windows use utf-8

    • @birasafabrice
      @birasafabrice 2 роки тому

      @@bkrai thanks a lot for prompt feedback!

  • @jg_reveles
    @jg_reveles 5 років тому +2

    Thaks Bharatendra, your videos are great full. Do you know how a I coud be specify another idiom (like Spanish) for the sentiment analysis?

  • @puneetsingh7779
    @puneetsingh7779 5 років тому +2

    Dear Bharatendra Rai sir you are doing great job with these videos, very informational. Please guide how to get data from twitter or other social media handles for a specific word. For example real estate market's particular project in Chennai.
    Thanks for your valuable inputs.

    • @bkrai
      @bkrai  5 років тому

      you can use a word or phrase, just as we use when we do google search.

    • @puneetsingh7779
      @puneetsingh7779 5 років тому

      @@bkrai thanks i will try and get back to you
      my question is if i want to know what are kolkata people tweeting on one specific tweet how may i drill down to location and then to that #subject

    • @puneetsingh7779
      @puneetsingh7779 5 років тому

      @@bkrai thanks

  • @dravidandaa
    @dravidandaa 6 років тому +3

    you are simply great. i tried and it worked really well. Thank you so much for clear explanation

    • @bkrai
      @bkrai  6 років тому

      Thanks for your comments!

  • @niv2419
    @niv2419 7 років тому +2

    Hello sir,
    What does 'utc-8-mac' mean and what does it do? Could you please explain?
    Thank You!

    • @bkrai
      @bkrai  4 роки тому

      See it now, probably you already figured it out.

  • @thejuhulikal6290
    @thejuhulikal6290 4 роки тому +2

    Whichever topic I take to study in R software, I will find your video teaching most interestingly and easily, thank you so much, sir

    • @bkrai
      @bkrai  4 роки тому

      You are most welcome!

  • @FPrimeHD1618
    @FPrimeHD1618 6 років тому +2

    Fantastic video! Surprised I have not run into your channel before. I am analyzing customer survey results from different countries, and one problem that I am having is that some of the countries have very polite people (not a bad thing of course) and their survey results show positive, when in fact they are not happy with the result. Have you had this problem before, and if so how did you handle it?

    • @bkrai
      @bkrai  6 років тому +1

      I've not come across this, but seems an interesting problem.

  • @kaushikchanda9690
    @kaushikchanda9690 3 роки тому +3

    Sir, What is the Algorithm used here?

    • @bkrai
      @bkrai  3 роки тому

      It uses nrc lexicon.

  • @nageshgoud721
    @nageshgoud721 6 років тому +2

    Thanks for your knowledge Share. Your explanation helps me a lot to achieve my goal. However, when I ran the scripts
    corpus

    • @bkrai
      @bkrai  6 років тому

      Check if you ran library(tm) line or not.

    • @ashishsingh6329
      @ashishsingh6329 6 років тому

      Sir, i ran Library (tm) also on window but getting Error in iconv(apple$text, to = "utf-win") :
      unsupported conversion from '' to 'utf-win' in codepage 1252. pl help me .

    • @ashishsingh6329
      @ashishsingh6329 6 років тому

      even corpus function is also not working

    • @nachiketdesai3833
      @nachiketdesai3833 6 років тому

      utf8text

  • @pels7432
    @pels7432 5 років тому +3

    Absolutely awesome video. Leart from your video more than my expectation. Clear and sensational! Thank you.

    • @bkrai
      @bkrai  5 років тому

      Thanks for comments!

  • @akj2784
    @akj2784 6 років тому +2

    Awesome video. I see the video was uploaded around 1 year back. Is there any new library after that which can further simplify the coding ? Also will there be further simplification of the code if we use Microsoft Cognitive Services (Text Analytics API) .

    • @bkrai
      @bkrai  6 років тому +1

      If you are doing sentiment analysis, the code has just one or two lines. When using other tools such as MS cognitive services, note that the results are likely to be different.

  • @ajayvaidya7
    @ajayvaidya7 3 роки тому +1

    When I write this code corpus

    • @bkrai
      @bkrai  3 роки тому

      If you are using windows computer, use "utf-8".

    • @ajayvaidya7
      @ajayvaidya7 3 роки тому +1

      @@bkrai - Oh okay. Understood. Thank you.

    • @bkrai
      @bkrai  3 роки тому

      Welcome!

  • @lipanghosh2252
    @lipanghosh2252 4 роки тому +1

    Hi Sir in TermdocumentMatrix ---Cleanset is not coming ,do we need to install any separate package for it?

    • @bkrai
      @bkrai  4 роки тому

      Make sure in TermdocumentMatrix, 'D' is uppercase like 'TermDocumentMatrix.

    • @lipanghosh2252
      @lipanghosh2252 4 роки тому

      @@bkrai done sir

  • @vinayanand2920
    @vinayanand2920 3 роки тому +1

    Sir please make a video on topic modelling of Twitter data on R🙏

    • @bkrai
      @bkrai  3 роки тому

      Thanks for the suggestion!

  • @emregumus9640
    @emregumus9640 4 роки тому +1

    thank you for video. but lettercloud function is not working. R is not send message "lettercloud is not working" but there is no graph. why please help me?

    • @bkrai
      @bkrai  4 роки тому +1

      I also noticed it. Looks like this feature doesn't exist anymore.

    • @emregumus9640
      @emregumus9640 4 роки тому

      @@bkrai hi again. i fixed the problem finally. firstly run the code "lettercloud" and there will be an empty page at the corner and select "open in the browser" upside. and again there will be an empty page and finally click f5 for the refresh the internet page. so you will see the lettercloud. :)

    • @frederico3725
      @frederico3725 3 роки тому

      Congrats on the work @@bkrai. Regarding lettercloud, after I run the code, it only responds with one word, but it works fine for wordcloud2 to make stars or circles. Do you know what the problem is?

  • @nureyna629
    @nureyna629 6 років тому +4

    This video is really helpful, thank you very much! but how can we evaluate the performance of the sentiment analysis you have just made? Are there any metric to use? Please help...

    • @bkrai
      @bkrai  6 років тому +2

      One way could be to score tweets manually and then compare the results from automated sentiment analysis. And then mismatch % could work as a metric to assess the performance.

    • @nureyna629
      @nureyna629 6 років тому +1

      Thank you very much.

  • @flamboyantperson5936
    @flamboyantperson5936 6 років тому +4

    This is simply awesome and so much interesting. Loved it. Thank you so much Sir.

    • @bkrai
      @bkrai  6 років тому

      Thanks!

  • @rahulvijay8129
    @rahulvijay8129 2 роки тому +1

    Could you please help me out...
    I was unable to get the letter cloud.
    All steps followed as mentioned in the video.
    The video was very helpful in all the other aspects.

    • @bkrai
      @bkrai  2 роки тому

      I too noticed that lettercloud is no more functional. I hope they update the package soon.

    • @rahulvijay8129
      @rahulvijay8129 2 роки тому +1

      Wow!
      I didn't expect such quick response..
      Thank you so much for the clarification..🙏🏾

    • @bkrai
      @bkrai  2 роки тому

      You are welcome!

  • @sriraksha2114
    @sriraksha2114 5 років тому +2

    Thank you soo much for the video but I just wanted to know which algorithm is being used here

  • @ayushraj7267
    @ayushraj7267 3 роки тому

    Hello sir.. thoroughly enjoyed the video, and was trying to replicate the same in windows, but was getting some errors as in fourth line there were some accented characters which i could not get rid of -
    "RT @SylvaCap: Things might get ugly for $aapl with the iphone delay. With $aapl down that means almost all of the FANG stocks were down pos… "
    the letters after pos in the end.. (i am thinking this might be due to some encoding issue, I am using utf-8 as suggested by you in one of the comments below),
    i tried using your code as well, still the result were same.. Can you help me on how to move forward? Thanks

  • @user-wj3dh
    @user-wj3dh 4 роки тому +1

    Sir can you provide the same code for windows users since I'm running the code on windows machine. Because some commands are different in Mac laptop.

    • @bkrai
      @bkrai  4 роки тому

      Did you try this: ua-cam.com/video/BPjgwdqHM8g/v-deo.html

    • @user-wj3dh
      @user-wj3dh 4 роки тому +1

      @@bkrai thank you so much Sir.

    • @bkrai
      @bkrai  4 роки тому

      You are welcome!

  • @jiaxinli3351
    @jiaxinli3351 6 років тому +2

    Thank you for your video! It is very helpful and your explanation is so clear! Thanks again.

    • @bkrai
      @bkrai  6 років тому

      Thanks for comments!

  • @TheZltd
    @TheZltd 2 роки тому +1

    Thank you Dr. Rai, very informative video kind sir

    • @bkrai
      @bkrai  2 роки тому

      You are welcome!

  • @ajayvaidya7
    @ajayvaidya7 3 роки тому

    I am getting this error Error in nchar(Terms(x), type = "chars") : invalid multibyte string, element 22, when I write this code for tdm. Can you please help me with code error?

  • @sharjeelarain6897
    @sharjeelarain6897 4 роки тому +1

    This video is very helpful and could explain z test on r

    • @bkrai
      @bkrai  4 роки тому

      I've used z-test in this:
      ua-cam.com/video/oxRy2DMrOF4/v-deo.html

  • @swatigupta1956
    @swatigupta1956 6 років тому +2

    This was really helpful, but i was wondering how to do normalization of words, here you did only for one word, what if i need to do for many words?

    • @bkrai
      @bkrai  6 років тому

      For 'normalization', which line of the code in the video are you referring to?

  • @orueta81ify
    @orueta81ify 5 років тому +1

    > tdm tdm
    Non-/sparse entries: 612739/3617891553
    Sparsity : 100%
    Error in nchar(Terms(x), type = "chars") :
    invalid multibyte string, element 13087
    > tdm

    • @bkrai
      @bkrai  5 років тому

      Probably 27 GB is too big for your computer.

  • @Kareviews100
    @Kareviews100 5 років тому +2

    thank you sir! amazing video. You explanation and pace of videos are on point!. thanks again.

    • @bkrai
      @bkrai  5 років тому

      Thanks for comments!

  • @elpiopro
    @elpiopro 2 роки тому +1

    You're a legend! thank you for this video.

    • @bkrai
      @bkrai  2 роки тому

      You're welcome!

  • @eminanokic7093
    @eminanokic7093 2 роки тому

    Hi! Thanks a lot for your video! I have a question. How do I remove stopwords for German? Is it the same line of code with putting german instead of english?
    Thanks

  • @muralidhara2063
    @muralidhara2063 6 років тому +2

    Hey bharat, Thank you very much for such a nice explanation.
    Need your help.
    am not able to see the output of letter cloud as shown.
    Please suggest.
    Regards,
    Murali

    • @bkrai
      @bkrai  6 років тому

      Difficult to much without looking at codes. Probably size may be too big and doesn't fit in the area available.

    • @muralidhara2063
      @muralidhara2063 6 років тому

      Thank you for your reply.
      Actually, am using the same code for lettercloud but the output is not same as yours.
      code: "letterCloud(w,word = "apple",wordSize = 1)"
      The output is apple with black letters.
      Please share your email id so that I can share the output

  • @tingblubblub6106
    @tingblubblub6106 5 років тому +2

    Very good explanation! Excellent job! Btw I have a question about word cloud: does the position of the words mean any relationship?

    • @bkrai
      @bkrai  5 років тому +1

      Position of words is random.

  • @marshalmashiat
    @marshalmashiat 3 роки тому +1

    at 7:03 Error in nchar(Terms(x), type = "chars") :
    invalid multibyte string, element 19-----Cant find solution anywhere

    • @baruchschwartz819
      @baruchschwartz819 3 роки тому +2

      problem is, you don't have a Mac, so you never converted your Apple text into UTF-8.
      Use:
      apple$text

    • @marshalmashiat
      @marshalmashiat 3 роки тому +1

      @@baruchschwartz819 thank you so much sir. help appreciated

    • @baruchschwartz819
      @baruchschwartz819 3 роки тому +1

      @@marshalmashiat cheers

    • @bkrai
      @bkrai  3 роки тому +1

      Thanks!

  • @JohnWick-mk4ve
    @JohnWick-mk4ve 4 роки тому +2

    Excellent Video sir, very elaborate. a quick question.. what would be the windows equivalent of the line tweets

    • @bkrai
      @bkrai  4 роки тому +4

      tweets

    • @Ambesh.m
      @Ambesh.m 6 місяців тому

      ​@@bkrai sir this is showing that unsupported conversion from ' ' to 'utf-8' in codepage 65001 , plz help with this

  • @janapureddiradha7412
    @janapureddiradha7412 3 роки тому

    Names on x-axis and count on y-axis of barplot didn't appear while executing the barplot.It only shows the 📊 itself...please help me out

  • @thambi83
    @thambi83 5 років тому +2

    Excellent one Sir.... Please keep sharing such videos ....Your way of explaining things is Awesome...

    • @bkrai
      @bkrai  5 років тому

      Thanks for comments and feedback!

  • @albertooraa5012
    @albertooraa5012 4 роки тому +1

    Hi! I have a question
    While trying to plot the wordcloud I get this message:
    hertha could not be fit on page. It will not be plotted.
    6: In wordcloud(words = names(w), freq = w, max.words = 100, min.freq = 5, :
    Any way I can fix it?

    • @bkrai
      @bkrai  4 роки тому

      You can control how big or small a word looks using scale function as shown in the video.

  • @Mr.Conyer
    @Mr.Conyer 7 років тому +1

    For the Sentiment bar chart comparison, It would be an easier comparison if the bars were based on % of total row count instead of comparing counts.
    Would you do rowsum(x) / sum(x)?
    Or is it colsum(x) / sum(x)

    • @bkrai
      @bkrai  7 років тому

      For converting y-axis to % you can use:
      barplot(100*colSums(s)/sum(s),
      las = 2,
      col = rainbow(10),
      ylab = 'Percentage',
      main = '% Sentiment Scores for Apple Tweets')

  • @netmarketer77
    @netmarketer77 5 років тому +1

    Mr.Baharat, After getting text (tweets) and clean them, How can we rewrite them as a .csv file to get the sentiment analysis of the new data with the clean text. What you did here is that you analysed the raw data with unclean text? Thanks

  • @anu2udu
    @anu2udu 6 років тому +2

    Dear Bharatendra ..awesome video thanks for efforts to share knowledge . superb

    • @bkrai
      @bkrai  6 років тому

      Thanks for comments!

  • @jasonyao3762
    @jasonyao3762 2 роки тому

    Hello Professor, first of all, thank you for sharing. I have a question about how to handle a corpus in TXT format.

  • @vishnunath1524
    @vishnunath1524 6 років тому +4

    Thank you sir for a simple tutorial on sentiment analysis and word cloud. Quite helpful for my marketing research class.
    Regards.

    • @bkrai
      @bkrai  6 років тому +1

      Thanks for comments!

  • @yellankinaveen2496
    @yellankinaveen2496 2 роки тому +1

    lettercloud is not work in my r studio ca n you tell me why it showing only spelling of tax , i can't see the result as your's

    • @bkrai
      @bkrai  2 роки тому

      Recently I too tried and it doesn't show up. Looks like they have discontinued this feature.

    • @yellankinaveen2496
      @yellankinaveen2496 2 роки тому +1

      @@bkrai kk sir Thank you for replying

    • @bkrai
      @bkrai  2 роки тому

      You are welcome!

  • @amanakshansh1021
    @amanakshansh1021 5 років тому +2

    You have explained everything very clearly and it is very helpful for beginners like me. Thank You

    • @bkrai
      @bkrai  5 років тому +1

      Thanks for comments!

    • @amanakshansh1021
      @amanakshansh1021 5 років тому

      Sir I have one doubt. Why have you used : to = "utf-8-mac" in line no.7.
      Is it same everytime in all the problems?

    • @beverlymenezes1357
      @beverlymenezes1357 5 років тому

      @@amanakshansh1021 no, you can just write utf8 instead.

  • @rahulsingh40g
    @rahulsingh40g 3 роки тому +1

    no explanations for the function used. downvoted

    • @bkrai
      @bkrai  3 роки тому

      Thanks for the feedback!

  • @markviya2194
    @markviya2194 Рік тому +1

    how can we calculate percentage sentiment of all tweets?

    • @bkrai
      @bkrai  Рік тому

      Check this link, I used % here:
      ua-cam.com/video/kkMBMxwxTEg/v-deo.html

  • @joseluisbeltramone599
    @joseluisbeltramone599 4 роки тому +2

    Excellent. Very well explained. Thanks a lot, Dr.

    • @bkrai
      @bkrai  4 роки тому

      Most welcome!

  • @birasafabrice
    @birasafabrice 2 роки тому +2

    this is an amazing tutorial, thank you Dr.!

    • @bkrai
      @bkrai  2 роки тому

      You're very welcome!

  • @lalithkathula7572
    @lalithkathula7572 4 роки тому +1

    hello sir, i have a doubt, for the first file apple.csv you have done all the pre processing(cleaning of data) ,but for the second time when you are performing the actual sentimental analysis you have not done any pre processing. So you are doing the sentimental analysis without any cleaning in the second file?

    • @bkrai
      @bkrai  4 роки тому

      Pre processing helps when you are developing wordcloud as you don't want to see thing that are not relevant. However, for sentiment analysis it may not matter if common words like 'the' or 'is' are there or not.

  • @xxXXCarbon6XXxx
    @xxXXCarbon6XXxx 5 років тому +1

    Word clouds should be banned; next to the Pie chart they are one of the worst ways of displaying data.

    • @bkrai
      @bkrai  5 років тому

      😃

  • @WahranRai
    @WahranRai 2 роки тому +1

    Nothing to say: pure quality

    • @bkrai
      @bkrai  2 роки тому

      Thanks!

  • @netmarketer77
    @netmarketer77 6 років тому +2

    It is really amazing . Can we know how to get those two files from Twitter ? Is there an R package that enable us to do so? Or how? Thanks.

    • @bkrai
      @bkrai  6 років тому +1

      You can get Twitter data using steps in this link:
      ua-cam.com/video/QETCjkQ3CBw/v-deo.html

    • @netmarketer77
      @netmarketer77 6 років тому +1

      Honestly while you are typing this reply , am really watching the link you provided me .. much appreciated Bharat. Thanks.

  • @apurva.m25
    @apurva.m25 4 роки тому +1

    Which algorithm is used for this analysis?

    • @bkrai
      @bkrai  4 роки тому

      For the sentiment analysis, nrc lexicon is used.

  • @abhiabhinav2176
    @abhiabhinav2176 4 роки тому +1

    Sir, may i know how did you get "apple2.csv" ?
    i.e how did you extract data (apple2.csv) after earnings ?

    • @bkrai
      @bkrai  4 роки тому

      Here is the link to get tweets in csv file: ua-cam.com/video/QETCjkQ3CBw/v-deo.html

  • @bpa261
    @bpa261 4 роки тому +1

    if text form is in different language other than english how can i solve such of text. any video related to NLP bigram,wupalmer, stemming, lemmatization

    • @bkrai
      @bkrai  4 роки тому

      I've added this to my list.

  • @danielit5845
    @danielit5845 7 років тому +2

    Many Thans Rai but please explain me how "iconv" function works in simple words.
    I didnt understand that function. and Great Work!

    • @bkrai
      @bkrai  6 років тому

      It's needed to convert text data to the right format needed when working on a Mac.

    • @siva-xj2kj
      @siva-xj2kj 6 років тому +1

      what about windows...?

  • @sanjaseljan1405
    @sanjaseljan1405 5 років тому +1

    Really excellent video!!
    Where can we find dictionaries with keywords annotated with sentiments (e.g. ugly as you mentioned in video)?
    Also, how can we add sentiment dictionary for new language? Any video on these topics?
    Thank you very much ... :)

  • @harishnagpal21
    @harishnagpal21 5 років тому +2

    Very well explained Bharatendra. I am learning lots of interesting things in R from you.

    • @bkrai
      @bkrai  5 років тому

      Thanks for comments!

  • @bhaveshjoon9291
    @bhaveshjoon9291 3 роки тому +1

    the code shows error i.e could not find get_nrc_sentiment even i have install syuzhet package can u tell me what is the problem

    • @bkrai
      @bkrai  3 роки тому

      Make sure to run library line after installing the package.

  • @ericrichard4940
    @ericrichard4940 4 роки тому +1

    What if there is an interesting word in the WordCloud. Is it possible to find from which tweet it came from? to localize it basically,

    • @bkrai
      @bkrai  4 роки тому +1

      Such a word is likely to be from several tweets. You can know which tweets it came from using the Term Document Matrix.

  • @netmarketer77
    @netmarketer77 6 років тому +1

    Mr.Bharat, when i want now to pull new data about apple from Twitter , R returns the tweets dated 2018. My question is: how can i pull tweets ,for example, about the second quarter 2016? Is there any way to specify an exact date range to pull tweets? Regards.

    • @bkrai
      @bkrai  6 років тому +1

      Twitter has some restrictions about how much and how often you can pull data. It only allows you to go back few weeks from current date.

  • @priya6053
    @priya6053 3 роки тому +1

    From where u get dataset ??? Source of dataset

    • @bkrai
      @bkrai  3 роки тому +1

      Here is the link to get data from Twitter:
      ua-cam.com/video/QETCjkQ3CBw/v-deo.html

  • @arunvijay8514
    @arunvijay8514 4 роки тому +1

    Can you please make a video on text2vec

    • @bkrai
      @bkrai  4 роки тому +1

      I've added it to my list.

  • @jasbirkaur-ou8en
    @jasbirkaur-ou8en 6 років тому +2

    Awesome video.. thank you for the great explanation!

    • @bkrai
      @bkrai  6 років тому

      Thanks for comments!

  • @MalharTakle
    @MalharTakle 5 років тому +1

    Hello,
    Is it possible to use the above method for sentimental analysis of tweets in languages other than english?

    • @bkrai
      @bkrai  4 роки тому

      I've added it to my list.

  • @saurabhiim
    @saurabhiim 7 років тому +1

    Thank you very much ... sir we are facing many issues in openNLP package ... request you to kindly help us in getting some insight on the same also ..

    • @bkrai
      @bkrai  7 років тому

      I've not used openNLP.

    • @muralidhara2063
      @muralidhara2063 6 років тому

      Can I know the issues you are facing? Thanks in advance..

  • @sandiprijal9521
    @sandiprijal9521 2 роки тому

    Hello sir,
    I am having this issue while using the code for twitter and no idea on how to resolve it.
    > setup_twitter_oauth(api_key,api_secret,access_token,access_secret)
    [1] "Using direct authentication"
    Error in check_twitter_oauth() : OAuth authentication error:
    This most likely means that you have incorrectly called setup_twitter_oauth()'
    > setup_twitter_oauth(api_key,api_secret,access_token=NULL,access_secret=NULL)
    [1] "Using browser based authentication"
    Error in init_oauth1.0(self$endpoint, self$app, permission = self$params$permission, :
    Forbidden (HTTP 403).
    You guidance will be of great help.

    • @bkrai
      @bkrai  2 роки тому

      For getting Twitter data, try this:
      ua-cam.com/video/bxJseJ-aZJk/v-deo.html

  • @sidrahijaz9826
    @sidrahijaz9826 5 років тому +1

    hello sir thanks for such a knowledgeable video sir I want to know that which year of data you used in this video and may we use this data for research purposes.please reply thanks again.

    • @bkrai
      @bkrai  5 років тому

      It's 2017.

  • @harish00784
    @harish00784 4 роки тому +1

    Very informative, thanks a lot. Subscribed the moment after watching this video.

    • @bkrai
      @bkrai  4 роки тому

      Thanks and welcome!

  • @lakshmiprayaga1651
    @lakshmiprayaga1651 6 років тому +2

    Awesome videos, examples were very simple and nice, useful materials

    • @bkrai
      @bkrai  6 років тому

      Thanks for comments!

  • @abrarahmedkhan7203
    @abrarahmedkhan7203 Рік тому +1

    Can i get the csv files for twitter data?

    • @bkrai
      @bkrai  Рік тому

      Use the link below video.

  • @wingswt8536
    @wingswt8536 4 роки тому +1

    Hello, I have a question, I would like to ask how could I view an excel file with Chinese characters. After I str my data, it only shows sth like this: U+84EE. I have tried this corpus

  • @netmarketer77
    @netmarketer77 6 років тому +1

    Hi Bharat; When looking at the data file, we can see that most of columns are N/A. How we can get Longitude and Latitude for example? what is the meaning of replyToSN and replyToSID?
    you have mentioned that Apple two datasets are taken from Twitter before and after quarterly earnings report. Could you please let us know what quarter is that? I mean is it after the 1, 2, 3 or fourth quarter the data was taken???

    • @bkrai
      @bkrai  6 років тому

      Longitude and Latitude is only available for those Twitter users who agree to share their location. Due to privacy reasons many people do not like to share their location and that's why majority of tweets do not have this info.
      replyToSN - reply to Screenname
      replyToSID - reply to sender ID
      apple2.csv has data for 2nd quarter of 2017.

  • @DnyaneshwarPanchaldsp
    @DnyaneshwarPanchaldsp 5 років тому +2

    Too useful , thanks 👌💐💐💐💐

    • @bkrai
      @bkrai  5 років тому

      Thanks for comments!

  • @kaapiglass
    @kaapiglass 4 роки тому +1

    Hello sir,
    Don't we have to pre process the text before running get_nrc_sentiment, you are directly passing the comments into that command, but in case of wordcloud you are removing stopwords, urls etc...Could you pls explain why are you doing that?

    • @bkrai
      @bkrai  4 роки тому +1

      For sentiment analysis words with no emotions will automatically be ignored. In wordcloud we do not want those words to appear, that are not meaningful.

    • @kaapiglass
      @kaapiglass 4 роки тому

      @@bkrai ok thanks

  • @Gamevault007
    @Gamevault007 3 роки тому

    i have a query, can we do theme tagging for social media data or consumer data ?

  • @google_india
    @google_india 6 років тому +2

    Awesome it is! subscribed to you channel :)

    • @bkrai
      @bkrai  4 роки тому

      Thanks!