Introduction to Optical Character Recognition OCR Using R (Tesseract)

Поділитися
Вставка
  • Опубліковано 4 гру 2024

КОМЕНТАРІ • 15

  • @haraldurkarlsson1147
    @haraldurkarlsson1147 2 роки тому +2

    Yes, very neat! I took a picture with a my phone of a page from an old book under less than perfect conditions and it was remarkable accurate. Few typos here and there but no more than an ordinary human would make before doing a spellcheck.
    Thanks

  • @haraldurkarlsson1147
    @haraldurkarlsson1147 2 роки тому +3

    You could run a simple accuracy test by taking a text file (word or latex) and convert it into a pdf. Next you could scan it and compare the original with the scanned version.

  • @mubangansofu7469
    @mubangansofu7469 Рік тому

    fantastic Samer

  • @ImpulsiveMigea
    @ImpulsiveMigea 2 роки тому

    Great tutorial!

  • @alrumich
    @alrumich 2 роки тому

    Nicely done! One thing missing is the discussion on the installation of Tesseract. Have you done it elsewhere? (Great video --- and I really like the Arabic example!!!)

  • @siamaksiamak5583
    @siamaksiamak5583 2 роки тому +1

    very nice explained as always, would you mind to do one on NLP for R?

  • @rohitsatyam2935
    @rohitsatyam2935 5 місяців тому

    Can you make a video on how to extract tables from a pdf containing tables

  • @ciroweinstein8627
    @ciroweinstein8627 2 роки тому

    Just like to say very cool... keep it up ... Is there anything like tesseract for objects? Is there any way to have a doo it your self object recognition with R?

    • @SamerHijjazi
      @SamerHijjazi  Рік тому +1

      I know there's something like that in Python. Not sure about it being in R.

  • @prashanthiremath1259
    @prashanthiremath1259 2 роки тому +1

    Hi @
    Samer Hijjazi could you please make a video on how to bypass the captha using R

    • @ciroweinstein8627
      @ciroweinstein8627 2 роки тому

      that sounds delightfull... but capcha is now for trains and boats

    • @SamerHijjazi
      @SamerHijjazi  Рік тому

      Might be a difficult one to be honest. I know Python can do that.

  • @95btkh
    @95btkh 2 роки тому

    Hi I have a question, is there a way to save into .docx or .pdf?

    • @SamerHijjazi
      @SamerHijjazi  Рік тому

      You can save the text in a variable then add that variable to a word doc object using the officer package. Not so sure about PDF.