I honestly wish there was a "double thumbs up"-button to your videos! You are saving a lot of students a lot of headaches ;) Please keep these very informative videos going!
hi :) when I perform the terms (lda_model) for my topic analysis, I get this error message: "Error $ operator not defined for this S4 class". Can anyone help?
Hi Kasper, Thank you for your videos. Very informative. Is it possible to use a word document (doc or docx)? or some preparation is important prior to analysis?
There are packages for reading .doc and .docx into r. The easiest solution here would be to use the readtext package (github.com/quanteda/readtext). You do might have to do some preparation, depending on how your texts are structured, but the readtext documentation deals with the most common use cases.
@@kasperwelbers thanks a lot Kasper. I am wondering if there is any particular approach to analyze a collection of entire (full) research papers (not just abstracts)? Either stored in a private link or PDF files. What would be your recommendations to analyze entire research papers?
Hi @@RaJa-qy3py, I'm not an expert on this, but someone asked a similar question and I ran into the fulltext package (books.ropensci.org/fulltext/ ) which might be just what you need.
@@kasperwelbers Thank you for your suggestion and Sorry for the late reply. I have been out for field work. I have the full texts either pdfs or word file. How to analyse the entire pdf/word file? Is that possible? Something like global biodiversity reports (long PDFs) from FAO, IUCN, WWF etc.,
@@RaJa-qy3py If you already have the files, there are basically two steps. The first is to load the data into R. For this you probably want to use the readtext package (github.com/quanteda/readtext). This let's you point to a folder of pdf/word files, and all files will be imported and structured as a data.frame. If you just want to analyze "the entire pdf/word file", this might be all you need to get the data in a format that quanteda (or another text analysis package) can handle.
I honestly wish there was a "double thumbs up"-button to your videos!
You are saving a lot of students a lot of headaches ;) Please keep these very informative videos going!
Hi Kasper, I really can't thank you enough for these videos, without them I don't think could have managed writing the methods for my PhD!!!
What a legend! You have no idea how much your videos have helped me. Thanks for making it clear and easy to understand:)
hi :) when I perform the terms (lda_model) for my topic analysis, I get this error message: "Error $ operator not defined for this S4 class". Can anyone help?
Hi Kasper, Thank you for your videos. Very informative.
Is it possible to use a word document (doc or docx)? or some preparation is important prior to analysis?
There are packages for reading .doc and .docx into r. The easiest solution here would be to use the readtext package (github.com/quanteda/readtext). You do might have to do some preparation, depending on how your texts are structured, but the readtext documentation deals with the most common use cases.
@@kasperwelbers thanks a lot Kasper. I am wondering if there is any particular approach to analyze a collection of entire (full) research papers (not just abstracts)? Either stored in a private link or PDF files. What would be your recommendations to analyze entire research papers?
Hi @@RaJa-qy3py, I'm not an expert on this, but someone asked a similar question and I ran into the fulltext package (books.ropensci.org/fulltext/ ) which might be just what you need.
@@kasperwelbers
Thank you for your suggestion and Sorry for the late reply. I have been out for field work.
I have the full texts either pdfs or word file. How to analyse the entire pdf/word file? Is that possible? Something like global biodiversity reports (long PDFs) from FAO, IUCN, WWF etc.,
@@RaJa-qy3py If you already have the files, there are basically two steps. The first is to load the data into R. For this you probably want to use the readtext package (github.com/quanteda/readtext). This let's you point to a folder of pdf/word files, and all files will be imported and structured as a data.frame. If you just want to analyze "the entire pdf/word file", this might be all you need to get the data in a format that quanteda (or another text analysis package) can handle.
great thanks
Brabo