This type of classes are really nice. Please do upload advanced topic in tesseracr in future videos. Thank you so much for this, one of my most waited video
I'm doing the same steps still it shows test for simple images, but does not show for other complex images like invoices, traffic signal sign etc. What may be the reason, pls guide.
Please make a video on post processing of the text that is extracted from ocr ?? It is very important because of the design changes hardcording like contain string does not work. So we need to use nlp I guess.
Sir i am working with teseract and opencv for making an ML based Application on Invoice system. The project is basically on the systwm where you automate the data and convert into excel by training 10 bill of invoice. Please help sir
@@manjubadiger2902 Thanks. But mera project band kar diya gaya ha. Maine bahat help mangi krish sir se, aur bhi loggo se LinkedIn par. But no one help. For this reason i lost my job. Tabse maine inn sab youtuber ki video dekhna band kar diya. Thanks for your favour.
Hi Sir, My challenge is reading texts inside images with wavy lines. The Image was created with a cell phone and just inserted as an image in a PDF file. Any special lib to do this? Pytesseract did work very well. It didn't capture well the wavy lines
Dear Sir, I am your Subscriber I want to create a tool that finds text errors in the image. For Example: I forgot to write CONTACT US, BUY NOW, CONTACT NUMBER, SPELLING MISTAKE, etc... in my social media post. that the tool finds error and suggests what are missing or what is incorrect in social media post. 🙏 Please guide me and suggest what course I need to buy or what I need to learn to create this tool Thank you
Hi @Krish, I want to extract text from the yolov8 predicted results which are scanned documents and predicted result images also have bounding boxes with their classes defined as, header, footer, subheading and paragraph. I want to extract text with respect to the class name and the confidence score.
Hello thank you for the video. Is there a way to get the image preprocessed by the tesseract algorithm? When running tesseract in cmd I can get it by setting tessedit_write_images = 1, but in python I couldn't find a way to get preprocessed image.
sir i tried this pyteserract on number plate detection...and its not showing great results...can you please make one video on number plate detection also ?
I am getting error: ImportError: cannot import name 'image_to_string' from 'pytesseract' (c:\python37\lib\site-packages\pytesseract\__init__.py) Just after importing tesseract and giving the path. Please help!!
hello krish, i try to upload the same images like you are uploading i.e traffic image and invoice..i choose the exact same image from google but on running, image is displayed but no text is getting printed and for the case i take screen shot of Wikipedia text its is working absolutely fine..what could be the problem??
I think this doesn't work on colab because we need to install tessarct exe file on our local system to use it. So use this on your local desktop jupyter notebook.
There is one otherway to make it, first change the original image to binary image which will basically separate the text and non- text part. And then further feed it into tesseract. It will get improved
You have actually played a safe game in the video without resolving the extraction issues
It was nice. Please keep doing session so that our learning curve doesn't stop.
I like this type of session sir thank you for such a great session
Great sir 👍 , before this video i can't imagine that python do this type of extraction also.
Thank you Krish for the video. Really interesting and useful..!!
This type of classes are really nice.
Please do upload advanced topic in tesseracr in future videos.
Thank you so much for this, one of my most waited video
This is very helpful session for me ... Can you please make a video on how to convert Image to CSV ... If possible.
Thank you so much 👍🤝
Hi sir,
I am currently working on a project Text Extraction from CPG(Consumer packaged goods) Product Images. Can we use Pytesseract to do the same?
Sir, how extract data from PDF and separate the names and phone numbers and save it in Excel file
You found the way dude?
Have you got the information
Really you are helping me alot
Thank you very much
Sir please take a class about how to save the model created using cnn for future use using hdf5
Thank you So much!, its really helpful
I'm doing the same steps still it shows test for simple images, but does not show for other complex images like invoices, traffic signal sign etc. What may be the reason, pls guide.
Please make a video on post processing of the text that is extracted from ocr ?? It is very important because of the design changes hardcording like contain string does not work. So we need to use nlp I guess.
Do we have any library which can extract text from structured documents like passport, adhar card ,pancard ?
use opencv library
Sir, please make video on custom training and fine tuning! Please!
@krish Naik sir could you please tell some way to extract address from a large text corpus? How can tesseract help to extract address from docs?
Sir i am working with teseract and opencv for making an ML based Application on Invoice system. The project is basically on the systwm where you automate the data and convert into excel by training 10 bill of invoice.
Please help sir
Hi you can contact me regarding OCR on invoice projects
@@manjubadiger2902 Thanks. But mera project band kar diya gaya ha. Maine bahat help mangi krish sir se, aur bhi loggo se LinkedIn par. But no one help. For this reason i lost my job. Tabse maine inn sab youtuber ki video dekhna band kar diya. Thanks for your favour.
@@shubairabbas5480 Could you clarify abt the project..and why was it closed?
@@manjubadiger2902 hey buddy.. I need some help, how to extract tables along with other datas from any scanned document??
Hi Sir, My challenge is reading texts inside images with wavy lines. The Image was created with a cell phone and just inserted as an image in a PDF file. Any special lib to do this? Pytesseract did work very well. It didn't capture well the wavy lines
Dear Sir, I am your Subscriber
I want to create a tool that finds text errors in the image.
For Example:
I forgot to write CONTACT US, BUY NOW, CONTACT NUMBER, SPELLING MISTAKE, etc... in my social media post.
that the tool finds error and suggests what are missing or what is incorrect in social media post.
🙏 Please guide me and suggest what course I need to buy or what I need to learn to create this tool
Thank you
Hi @Krish, I want to extract text from the yolov8 predicted results which are scanned documents and predicted result images also have bounding boxes with their classes defined as, header, footer, subheading and paragraph. I want to extract text with respect to the class name and the confidence score.
Hi Krish thanks a lot for your videos..I also want to know create container in aws
Live or recorded Both ways are good, sir
Helo sir. Could you please make a video on segmentation of handwritten text image to characters. 🙏
thank you so much sir...
Nice topic , krish
Thankyou so much sir
Hello thank you for the video. Is there a way to get the image preprocessed by the tesseract algorithm? When running tesseract in cmd I can get it by setting tessedit_write_images = 1, but in python I couldn't find a way to get preprocessed image.
you are awesome .. Nice video.
You make one environment to install all installation. Or make every time create new environment and install.plz clear me.
sir ,how can we do it on multiple images and the extracted text should be created as .txt file as like in notepad
sir i tried this pyteserract on number plate detection...and its not showing great results...can you please make one video on number plate detection also ?
Sir can you have lecture on OCR USING DEEP LEARNING
Thanks a lot sir ..
Can this also read invoices or bank statements? I think should be able to help my wife who is a CA
Yes I have shown the example
Oh sorry did I miss it I am was getting my food.
How to send this data to excel files?
What is the name of the writing pad
Sir how can we train or retrain the model for new symbol ....
So that it can detect the symbol ....
This is amazing. Thanks. Can we extract tabular info from image as tables? how?
I want to know how to do this as well....
I am getting error:
ImportError: cannot import name 'image_to_string' from 'pytesseract' (c:\python37\lib\site-packages\pytesseract\__init__.py)
Just after importing tesseract and giving the path.
Please help!!
hello krish,
i try to upload the same images like you are uploading i.e traffic image and invoice..i choose the exact same image from google but on running, image is displayed but no text is getting printed and for the case i take screen shot of Wikipedia text its is working absolutely fine..what could be the problem??
What is the name of your writing pad
I am unable to join ur membership can u guide to join the membership
Sir..can we extract arabic and english text in pytesseract?if so,can you discuss in tomorrows session or put a video reg the same sir..
Have you tried that?
Bro, can u try
Image_to_boxes
If we draw a circle over a text and take a snap of it then How will we extract that only content which is inside the circle.?
Did you find answer for this?
sir..can u put a new video for text extraction in azure for arabc and eng ID cards
hey im trying to build a pdf chat bot but i want to install ocr in it so that it recognizes image text too , can someone guide me plz
Krish can you make this on Real time video
How to know, what's the accuracy of my ocr model ?
How can I generate character level confidence score using tesseract??
sir plz tell how to implement for multiple images
Sir Debit Card is not working for getting membership ( Rs. 59 ) of your channel. Please help sir.
Hey! can u create a model for extracting pan number from pan card
Sir please build handwritten Oct recognise using CNN...
Sir have you found any solution for your queary ,as I also need OCR using deep learning tutorial
You saved me
sir if video would be recorded then it would have be more helpfull rather than livestreaming
Please let me know how we can install it in Linux
Hey, follow this ua-cam.com/video/-fIlUcp69xo/v-deo.html.
Hi, I am looking for medical prescriptions dataset where I read the handwritten text using OCR, anyone can share with me this dataset?
Can it read Doctor's Handwriting?
Great
Yes yes
can we use pytesseract to read kannada text
Getting error Exec format error tesseract-ocr-w64-v5.exe
Running code in colab
I think this doesn't work on colab because we need to install tessarct exe file on our local system to use it. So use this on your local desktop jupyter notebook.
what if the language is hindi or sanscrit will it work
This is not working in tabular data in scanned images
please help us with captcha images reading
I want to just read particular part from images after classification
like only read names from all aadhar cards photos
How to extract hindi text in tessract.
What about other languages
Yes
It show me module not found sir
Tesseract only works when the image background and texts are clear. I tried to use tesseract on lcd panels and it gave bad results.
There is one otherway to make it, first change the original image to binary image which will basically separate the text and non- text part. And then further feed it into tesseract. It will get improved
@@adis6867 Can you elaborate the steps for it? It would be quite helpful.
When I execute import pytesseract....
Sir, I have a linux box. What are the steps for me? I have installed tesseract-ocr and pytesseract both the packages
Hey, follow this ua-cam.com/video/-fIlUcp69xo/v-deo.html.
what about ubuntu path
hi krish
Hi
Hi