Live Discussion - How To Read Text From Images Using Pytesseract

Поділитися
Вставка
  • Опубліковано 4 гру 2024

КОМЕНТАРІ • 101

  • @shivaprasad9787
    @shivaprasad9787 3 роки тому

    You have actually played a safe game in the video without resolving the extraction issues

  • @pushpendudas3372
    @pushpendudas3372 4 роки тому +2

    It was nice. Please keep doing session so that our learning curve doesn't stop.

  • @amitjajoo9510
    @amitjajoo9510 4 роки тому +3

    I like this type of session sir thank you for such a great session

  • @harsharora3535
    @harsharora3535 4 роки тому +3

    Great sir 👍 , before this video i can't imagine that python do this type of extraction also.

  • @vcjayan8206
    @vcjayan8206 2 роки тому +1

    Thank you Krish for the video. Really interesting and useful..!!

  • @suryav3959
    @suryav3959 4 роки тому +2

    This type of classes are really nice.
    Please do upload advanced topic in tesseracr in future videos.
    Thank you so much for this, one of my most waited video

  • @vaibhavghadage8127
    @vaibhavghadage8127 2 роки тому +4

    This is very helpful session for me ... Can you please make a video on how to convert Image to CSV ... If possible.
    Thank you so much 👍🤝

  • @tarunkumar9528
    @tarunkumar9528 3 роки тому +5

    Hi sir,
    I am currently working on a project Text Extraction from CPG(Consumer packaged goods) Product Images. Can we use Pytesseract to do the same?

  • @shashankshashi6361
    @shashankshashi6361 4 роки тому +6

    Sir, how extract data from PDF and separate the names and phone numbers and save it in Excel file

  • @venkateshnallapaneni8932
    @venkateshnallapaneni8932 4 роки тому

    Really you are helping me alot
    Thank you very much

  • @sajithakk4790
    @sajithakk4790 4 роки тому +1

    Sir please take a class about how to save the model created using cnn for future use using hdf5

  • @pattuchiitu8978
    @pattuchiitu8978 3 роки тому

    Thank you So much!, its really helpful

  • @anoopyadav4081
    @anoopyadav4081 3 роки тому +1

    I'm doing the same steps still it shows test for simple images, but does not show for other complex images like invoices, traffic signal sign etc. What may be the reason, pls guide.

  • @harshavardhanachyuta2055
    @harshavardhanachyuta2055 2 роки тому

    Please make a video on post processing of the text that is extracted from ocr ?? It is very important because of the design changes hardcording like contain string does not work. So we need to use nlp I guess.

  • @karishmagermanwala764
    @karishmagermanwala764 4 роки тому +2

    Do we have any library which can extract text from structured documents like passport, adhar card ,pancard ?

  • @adityanjsg99
    @adityanjsg99 Рік тому

    Sir, please make video on custom training and fine tuning! Please!

  • @CodePursuit
    @CodePursuit Рік тому

    @krish Naik sir could you please tell some way to extract address from a large text corpus? How can tesseract help to extract address from docs?

  • @shubairabbas5480
    @shubairabbas5480 4 роки тому +2

    Sir i am working with teseract and opencv for making an ML based Application on Invoice system. The project is basically on the systwm where you automate the data and convert into excel by training 10 bill of invoice.
    Please help sir

    • @manjubadiger2902
      @manjubadiger2902 4 роки тому

      Hi you can contact me regarding OCR on invoice projects

    • @shubairabbas5480
      @shubairabbas5480 4 роки тому

      @@manjubadiger2902 Thanks. But mera project band kar diya gaya ha. Maine bahat help mangi krish sir se, aur bhi loggo se LinkedIn par. But no one help. For this reason i lost my job. Tabse maine inn sab youtuber ki video dekhna band kar diya. Thanks for your favour.

    • @SportsKiCharcha
      @SportsKiCharcha 4 роки тому

      @@shubairabbas5480 Could you clarify abt the project..and why was it closed?

    • @rubs63
      @rubs63 3 роки тому

      @@manjubadiger2902 hey buddy.. I need some help, how to extract tables along with other datas from any scanned document??

  • @qzwwzt
    @qzwwzt 11 місяців тому

    Hi Sir, My challenge is reading texts inside images with wavy lines. The Image was created with a cell phone and just inserted as an image in a PDF file. Any special lib to do this? Pytesseract did work very well. It didn't capture well the wavy lines

  • @anilsharma32g
    @anilsharma32g Рік тому

    Dear Sir, I am your Subscriber
    I want to create a tool that finds text errors in the image.
    For Example:
    I forgot to write CONTACT US, BUY NOW, CONTACT NUMBER, SPELLING MISTAKE, etc... in my social media post.
    that the tool finds error and suggests what are missing or what is incorrect in social media post.
    🙏 Please guide me and suggest what course I need to buy or what I need to learn to create this tool
    Thank you

  • @Fast_4ward
    @Fast_4ward 11 місяців тому

    Hi @Krish, I want to extract text from the yolov8 predicted results which are scanned documents and predicted result images also have bounding boxes with their classes defined as, header, footer, subheading and paragraph. I want to extract text with respect to the class name and the confidence score.

  • @tahsinasif2612
    @tahsinasif2612 4 роки тому

    Hi Krish thanks a lot for your videos..I also want to know create container in aws

  • @aryashailesh12
    @aryashailesh12 4 роки тому

    Live or recorded Both ways are good, sir

  • @heenagirdher6443
    @heenagirdher6443 2 роки тому

    Helo sir. Could you please make a video on segmentation of handwritten text image to characters. 🙏

  • @ankitdhanore1355
    @ankitdhanore1355 2 роки тому

    thank you so much sir...

  • @InspireXAI
    @InspireXAI 4 роки тому

    Nice topic , krish

  • @Rahul_Singh_Rajput_04
    @Rahul_Singh_Rajput_04 2 роки тому

    Thankyou so much sir

  • @SuheylBey
    @SuheylBey 2 роки тому

    Hello thank you for the video. Is there a way to get the image preprocessed by the tesseract algorithm? When running tesseract in cmd I can get it by setting tessedit_write_images = 1, but in python I couldn't find a way to get preprocessed image.

  • @Dailythingsx
    @Dailythingsx 3 роки тому

    you are awesome .. Nice video.

  • @krishanansamy3479
    @krishanansamy3479 3 роки тому

    You make one environment to install all installation. Or make every time create new environment and install.plz clear me.

  • @SICK527
    @SICK527 3 роки тому

    sir ,how can we do it on multiple images and the extracted text should be created as .txt file as like in notepad

  • @rohit_mondal__
    @rohit_mondal__ 3 роки тому

    sir i tried this pyteserract on number plate detection...and its not showing great results...can you please make one video on number plate detection also ?

  • @shubhamrajput3148
    @shubhamrajput3148 3 роки тому

    Sir can you have lecture on OCR USING DEEP LEARNING

  • @RAJATSINGH-hk6xi
    @RAJATSINGH-hk6xi 3 роки тому

    Thanks a lot sir ..

  • @ChetanParmar
    @ChetanParmar 4 роки тому +1

    Can this also read invoices or bank statements? I think should be able to help my wife who is a CA

    • @krishnaik06
      @krishnaik06  4 роки тому +2

      Yes I have shown the example

    • @ChetanParmar
      @ChetanParmar 4 роки тому +2

      Oh sorry did I miss it I am was getting my food.

    • @ChetanParmar
      @ChetanParmar 4 роки тому +1

      How to send this data to excel files?

    • @ChetanParmar
      @ChetanParmar 4 роки тому +1

      What is the name of the writing pad

  • @punks0325
    @punks0325 3 роки тому

    Sir how can we train or retrain the model for new symbol ....
    So that it can detect the symbol ....

  • @ragha2846vs
    @ragha2846vs 4 роки тому

    This is amazing. Thanks. Can we extract tabular info from image as tables? how?

    • @ppakpahan
      @ppakpahan 3 роки тому

      I want to know how to do this as well....

  • @swatibohidar1678
    @swatibohidar1678 4 роки тому

    I am getting error:
    ImportError: cannot import name 'image_to_string' from 'pytesseract' (c:\python37\lib\site-packages\pytesseract\__init__.py)
    Just after importing tesseract and giving the path.
    Please help!!

  • @akshatpant9758
    @akshatpant9758 4 роки тому

    hello krish,
    i try to upload the same images like you are uploading i.e traffic image and invoice..i choose the exact same image from google but on running, image is displayed but no text is getting printed and for the case i take screen shot of Wikipedia text its is working absolutely fine..what could be the problem??

  • @ChetanParmar
    @ChetanParmar 4 роки тому +1

    What is the name of your writing pad

  • @krishnirwan4493
    @krishnirwan4493 2 роки тому

    I am unable to join ur membership can u guide to join the membership

  • @AnuRadha-fg8ry
    @AnuRadha-fg8ry 4 роки тому

    Sir..can we extract arabic and english text in pytesseract?if so,can you discuss in tomorrows session or put a video reg the same sir..

  • @Justtchannell
    @Justtchannell 2 роки тому +1

    Bro, can u try
    Image_to_boxes

  • @kushaggrkashyap9831
    @kushaggrkashyap9831 2 роки тому

    If we draw a circle over a text and take a snap of it then How will we extract that only content which is inside the circle.?

    • @awerqga
      @awerqga Рік тому

      Did you find answer for this?

  • @AnuRadha-fg8ry
    @AnuRadha-fg8ry 4 роки тому

    sir..can u put a new video for text extraction in azure for arabc and eng ID cards

  • @MaxVerstappen-cj4jw
    @MaxVerstappen-cj4jw 9 місяців тому

    hey im trying to build a pdf chat bot but i want to install ocr in it so that it recognizes image text too , can someone guide me plz

  • @dswithreddy7384
    @dswithreddy7384 4 роки тому

    Krish can you make this on Real time video

  • @adityasoni1639
    @adityasoni1639 10 місяців тому

    How to know, what's the accuracy of my ocr model ?

  • @meghnajain7889
    @meghnajain7889 3 роки тому

    How can I generate character level confidence score using tesseract??

  • @xyz-wl9uj
    @xyz-wl9uj Рік тому

    sir plz tell how to implement for multiple images

  • @koustavdutta5317
    @koustavdutta5317 4 роки тому

    Sir Debit Card is not working for getting membership ( Rs. 59 ) of your channel. Please help sir.

  • @lakshikaseksaria845
    @lakshikaseksaria845 Рік тому

    Hey! can u create a model for extracting pan number from pan card

  • @surajmota6415
    @surajmota6415 4 роки тому

    Sir please build handwritten Oct recognise using CNN...

    • @shubhamrajput3148
      @shubhamrajput3148 3 роки тому

      Sir have you found any solution for your queary ,as I also need OCR using deep learning tutorial

  • @Gamezone-kq5sx
    @Gamezone-kq5sx 2 роки тому

    You saved me

  • @harjeetsangwan7707
    @harjeetsangwan7707 Рік тому

    sir if video would be recorded then it would have be more helpfull rather than livestreaming

  • @milindwaykole8187
    @milindwaykole8187 4 роки тому +1

    Please let me know how we can install it in Linux

    • @manishankus573
      @manishankus573 3 роки тому

      Hey, follow this ua-cam.com/video/-fIlUcp69xo/v-deo.html.

  • @mashaelabdullah6758
    @mashaelabdullah6758 3 роки тому

    Hi, I am looking for medical prescriptions dataset where I read the handwritten text using OCR, anyone can share with me this dataset?

  • @atharvachouhan474
    @atharvachouhan474 2 роки тому

    Can it read Doctor's Handwriting?

  • @MuhammadZainE-
    @MuhammadZainE- 3 роки тому

    Great

  • @lokap9662
    @lokap9662 4 роки тому +1

    Yes yes

  • @lokaranjan4353
    @lokaranjan4353 2 роки тому

    can we use pytesseract to read kannada text

  • @MaheshSharma-ud1cj
    @MaheshSharma-ud1cj 4 роки тому +2

    Getting error Exec format error tesseract-ocr-w64-v5.exe

    • @MaheshSharma-ud1cj
      @MaheshSharma-ud1cj 4 роки тому +2

      Running code in colab

    • @venkateshnallapaneni8932
      @venkateshnallapaneni8932 4 роки тому

      I think this doesn't work on colab because we need to install tessarct exe file on our local system to use it. So use this on your local desktop jupyter notebook.

  • @tejasarondekar375
    @tejasarondekar375 3 роки тому

    what if the language is hindi or sanscrit will it work

  • @akshaymitra
    @akshaymitra 3 роки тому

    This is not working in tabular data in scanned images

  • @nsnilesh604
    @nsnilesh604 3 роки тому

    please help us with captcha images reading

  • @geshusinha6413
    @geshusinha6413 2 роки тому

    I want to just read particular part from images after classification

    • @geshusinha6413
      @geshusinha6413 2 роки тому

      like only read names from all aadhar cards photos

  • @AshokYadav-is7wx
    @AshokYadav-is7wx 2 роки тому

    How to extract hindi text in tessract.

  • @anannayaaaaa
    @anannayaaaaa Рік тому

    What about other languages

  • @InspireXAI
    @InspireXAI 4 роки тому

    Yes

  • @EpicEchoes-payal
    @EpicEchoes-payal 4 роки тому

    It show me module not found sir

  • @Joel95ify
    @Joel95ify 4 роки тому +2

    Tesseract only works when the image background and texts are clear. I tried to use tesseract on lcd panels and it gave bad results.

    • @adis6867
      @adis6867 4 роки тому

      There is one otherway to make it, first change the original image to binary image which will basically separate the text and non- text part. And then further feed it into tesseract. It will get improved

    • @aditinagar6688
      @aditinagar6688 4 роки тому

      @@adis6867 Can you elaborate the steps for it? It would be quite helpful.

  • @EpicEchoes-payal
    @EpicEchoes-payal 4 роки тому

    When I execute import pytesseract....

  • @ideygmail
    @ideygmail 4 роки тому +2

    Sir, I have a linux box. What are the steps for me? I have installed tesseract-ocr and pytesseract both the packages

    • @manishankus573
      @manishankus573 3 роки тому

      Hey, follow this ua-cam.com/video/-fIlUcp69xo/v-deo.html.

  • @KasimKhan-lk4sw
    @KasimKhan-lk4sw 2 роки тому

    what about ubuntu path

  • @ccreddy87009
    @ccreddy87009 2 роки тому

    hi krish

  • @lokap9662
    @lokap9662 4 роки тому +1

    Hi