Yes, very neat! I took a picture with a my phone of a page from an old book under less than perfect conditions and it was remarkable accurate. Few typos here and there but no more than an ordinary human would make before doing a spellcheck. Thanks
You could run a simple accuracy test by taking a text file (word or latex) and convert it into a pdf. Next you could scan it and compare the original with the scanned version.
Nicely done! One thing missing is the discussion on the installation of Tesseract. Have you done it elsewhere? (Great video --- and I really like the Arabic example!!!)
Just like to say very cool... keep it up ... Is there anything like tesseract for objects? Is there any way to have a doo it your self object recognition with R?
Yes, very neat! I took a picture with a my phone of a page from an old book under less than perfect conditions and it was remarkable accurate. Few typos here and there but no more than an ordinary human would make before doing a spellcheck.
Thanks
You could run a simple accuracy test by taking a text file (word or latex) and convert it into a pdf. Next you could scan it and compare the original with the scanned version.
fantastic Samer
Great tutorial!
Nicely done! One thing missing is the discussion on the installation of Tesseract. Have you done it elsewhere? (Great video --- and I really like the Arabic example!!!)
very nice explained as always, would you mind to do one on NLP for R?
That's a good one! Thank you.
Can you make a video on how to extract tables from a pdf containing tables
Just like to say very cool... keep it up ... Is there anything like tesseract for objects? Is there any way to have a doo it your self object recognition with R?
I know there's something like that in Python. Not sure about it being in R.
Hi @
Samer Hijjazi could you please make a video on how to bypass the captha using R
that sounds delightfull... but capcha is now for trains and boats
Might be a difficult one to be honest. I know Python can do that.
Hi I have a question, is there a way to save into .docx or .pdf?
You can save the text in a variable then add that variable to a word doc object using the officer package. Not so sure about PDF.