Just watched some of your previous tf videos, and man was I blown away by how you didn't already have at least a million subs. Keep up the awesome work!
Hi nick, I love your videos because of how detailed you are. Since I discovered your channel I have never been the same because I have learned a lot from your teaching method and technical stuff on AI and deep learning. Kudos to you for your effort to impart knowledge.
I love the video thanks Nick! Something that might be useful for Mac users is adding the following option at the beginning of step 2: ocr_model = PaddleOCR(lang=‘en’, use_gpu=False) Tested it on MBP Intel and worked like a charm with impressive accuracy :-)
@@NicholasRenotte Hello sir your channel has pretty amazing content .I was planning on building an app that helps in detecting fake medicines using computer vision techniques .Could u please make a video wherein it analyse the input images and detects the difference and highlight it or any related tutorial .Thanks a lot
Hi Nick! TypeError Traceback (most recent call last) in ----> 1 annotated = draw_ocr(img, boxes, texts, scores, font_path=font_path) /usr/local/lib/python3.7/dist-packages/paddleocr/tools/infer/utility.py in draw_ocr(image, boxes, txts, scores, drop_score, font_path) 380 box_num = len(boxes) 381 for i in range(box_num): --> 382 if scores is not None and (scores[i] < drop_score or 383 math.isnan(scores[i])): 384 continue TypeError: '
looks like you're comparing a tuple to a float which isn't allowed. seems like scores[i] or drop_score is not the type of data that you think it is. Try using the type() func or print out the data.
by the way, i think this video is created a weeks ago because there has been certain changes in the stable release and code would have to be modified accordingly
Very good one. Have you tried it for watermark removal? I am testing paddle vs easyocr vs tesseract vs craft... Maximum detection success rate after training with synthetic data is 45%
Hello Nicholas, thank you so much for your awesome videos! they are really very helpful! keep it up . I have one question if we can use paddle ocr for handwritten text too? Thank you
hello, Awesome video!, i have a small doubt, if im trying to extract data from a table is there any way to extract data as as rows instead of selecting each line as a raw which it does by default? Thank you!
superb video, but I have a question, is there any possible way to extract texts from only specific parts of an image, for example. I have an image of the license card, I need to extract only the name and address. Any help you can provide would be greatly appreciated. Thank you.
You can isolate that part of the frame and pass that through to the OCR model. This is a little trickier though particularly if that region is going to be in a different location each time.
Hey, I saw your TensorFlow object detection API video. I have a query to ask. I want to know how to visualize the custom trained model & how we can edit these layers for creating a new model.
Not possible to dive into the underlying layers with the TFOD API afaik. Can fine tune if needed but added explicit layers using keras for example isn't possible .
@NicholasRenotte Is there any option like paddle OCR can we create custom OCR model and application from scratch, what are the steps has to be done from your thoughts.
Thanks for posting. It's not directly related but perhaps you/others can share ideas on how to solve something I've been struggling with: in a supermarket, how to identify what price belongs to what product? Once the above is clear, the OCR solution would fit perfectly.
Kinda tricky, first up, this would be a HUGE model. One approach could be to classify the products with an image could classify the products.....actually wait, do you need to map the physical photos of the products to prices?
So if i have font file but language is old forgotten one from 500s, can I still train OCR with that font which I have for that language? Purpose is to digitilize books from museum
I know this video is old and would be surprised to see a reply but the format is a little different now. To get the text, it is now 4 slots. To get the text, it would be [0][line number][1][0] and I cant use the for loop when the var is not in the first slot. Suggestions?
In instantiating model and detect section, when i setup the model with ocr = PaddleOCR(leng='eng') , I am getting an error that is: AttributeError: module 'paddle.fluid.core_avx' has no attribute 'is_compiled_with_rocm' How can solve this, any help?
Hey nick, i've run into another error i can't seem to install paddleocr. the error "failed building wheel for python-Levenshtein" keeps appearing i've tried everything i can find related to paddleocr on the internet and can't seem to find a way to fix the error
@@NicholasRenotte hey nick thanks for replying, apparently it's smtng to do with my computer? Cause when i send the code to my friend who is using a different model year and brand computer it can run perfectly. Anyways, thanks for the help 😄
Good Afternoon Mr Nicholas Renotte, my kernel keeps dying when i run 'from paddleocr import PaddleOCR...' even though i manage to install the packages required... do you know how i might resolve this issue or what the potential problem is? THANK YOU!!
Hi Nick, very well explained. But I get error 'Can not import paddle core while this file exists: /usr/local/lib/python3.10/dist-packages/paddle/fluid/libpaddle.so' Please help
RuntimeError: (PreconditionNotMet) The third-party dynamic library (cudnn64_8.dll) that Paddle depends on is not configured correctly. (error code is 126)
Could not find a version that satisfies the requirement paddlepaddle-gpu==2.0.0 (from versions: none) I'm getting above error can anyone suggest a solution
FileNotFoundError: Could not find module 'C:\Users\budho\anaconda3\Library\bin\geos_c.dll' (or one of its dependencies). Try using the full path with constructor syntax. while importing paddleocr!!!!!!!!!
Hi Nick! Thank you very much for a super great tutorial. I would truly love if you can share how one can train the paddle ocr with some customized images, which would help improve on text extraction for different use cases. Once again thank you very much.
please help with this issue TypeError Traceback (most recent call last) Cell In[141], line 6 3 plt.figure(figsize=(15,15)) 5 # draw annotations on image ----> 6 annotated = draw_ocr(img, boxes, texts, scores, font_path=font_path) 8 # show the image using matplotlib 9 plt.imshow(annotated) File ~\AppData\Roaming\Python\Python311\site-packages\paddleocr\tools\infer\utility.py:387, in draw_ocr(image, boxes, txts, scores, drop_score, font_path) 385 box_num = len(boxes) 386 for i in range(box_num): --> 387 if scores is not None and (scores[i] < drop_score or 388 math.isnan(scores[i])): 389 continue 390 box = np.reshape(np.array(boxes[i]), [-1, 1, 2]).astype(np.int64)
Just watched some of your previous tf videos, and man was I blown away by how you didn't already have at least a million subs. Keep up the awesome work!
This guy deserves more subs! One of the best content creator in the ML space.
This model is crazy good with accuracy! Love your tutorial as always
Hi nick, I love your videos because of how detailed you are. Since I discovered your channel I have never been the same because I have learned a lot from your teaching method and technical stuff on AI and deep learning. Kudos to you for your effort to impart knowledge.
Agree
Agree 👌👍👌
Thanks so much @Salabson! Means a ton to me 🙏
Always LOVED YOU CONTENT FROM DAY 1 SINCE I GOT HERE! Simple and Intelligent content which any KID can understand the most complex things! good job!
Your channel is the only one that I've click the bell button. Your vids are 💎
🙏❤
Still works, very easy, thanks Nick!
Thanks for your positivity in all of your videos 😊
Gotta be! Life is too short, just tryna have some fun along the way!
I love the video thanks Nick! Something that might be useful for Mac users is adding the following option at the beginning of step 2:
ocr_model = PaddleOCR(lang=‘en’, use_gpu=False)
Tested it on MBP Intel and worked like a charm with impressive accuracy :-)
Ha, that's awesome. Thanks so much for sharing my guy!!
Accidently stumbled upon this channel .Thanks a lot ,This was exactly what I was looking for !!
Glad you stumbled over here Janice, welcome to the team!
@@NicholasRenotte Hello sir your channel has pretty amazing content .I was planning on building an app that helps in detecting fake medicines using computer vision techniques .Could u please make a video wherein it analyse the input images and detects the difference and highlight it or any related tutorial .Thanks a lot
thanks for always posting videos, love them
Stoked man, so pumped you're enjoying them!
Like always, 😎😎😎. Good job Nicholas.
Thanks a mil!
Amazing content, you are the best.
Hi Nick!
TypeError Traceback (most recent call last)
in
----> 1 annotated = draw_ocr(img, boxes, texts, scores, font_path=font_path)
/usr/local/lib/python3.7/dist-packages/paddleocr/tools/infer/utility.py in draw_ocr(image, boxes, txts, scores, drop_score, font_path)
380 box_num = len(boxes)
381 for i in range(box_num):
--> 382 if scores is not None and (scores[i] < drop_score or
383 math.isnan(scores[i])):
384 continue
TypeError: '
this is kinda late, but has anyone gotten a solution for this? chatgpt is not helping at all
looks like you're comparing a tuple to a float which isn't allowed. seems like scores[i] or drop_score is not the type of data that you think it is. Try using the type() func or print out the data.
by the way, i think this video is created a weeks ago because there has been certain changes in the stable release and code would have to be modified accordingly
Nice and simple! 👏👏
Thanks a billion @Gustavo!
Great Presentation and work. did you try to train it by using your own datasets ?
Hiii @Nicholas is it possible for live camera detection???
😊 😊 😊 😊 😊 😊 😊 thank you so much
Very good one. Have you tried it for watermark removal? I am testing paddle vs easyocr vs tesseract vs craft... Maximum detection success rate after training with synthetic data is 45%
Amazing video! Is it possible to implement it in a mobile app through React Native? Thanks a lot Nick!!!
good and clear.
Just here to let you know how awesome you are ❤️
Love your videos!
Thanks a bunch @tejraj!!
So awesome!!
Hello Nicholas, thank you so much for your awesome videos! they are really very helpful! keep it up . I have one question if we can use paddle ocr for handwritten text too? Thank you
Nope, would suggest using a handwritten dl model for that!
bro I LOVE YOU
Loved the video man this is an awsome video, Is there anyway we can specifially get only drug name from that text.
Yep, should be able to extract it from the results using some indexing.
Hi Nicolas. Your video was awesome and it was a great help in implementing our ocr model. If possible, Please do a tutorial for paddle lite in mobile
Thank you :)
hello, Awesome video!, i have a small doubt, if im trying to extract data from a table is there any way to extract data as as rows instead of selecting each line as a raw which it does by default? Thank you!
superb video, but I have a question, is there any possible way to extract texts from only specific parts of an image, for example. I have an image of the license card, I need to extract only the name and address. Any help you can provide would be greatly appreciated. Thank you.
Same question.
You can isolate that part of the frame and pass that through to the OCR model. This is a little trickier though particularly if that region is going to be in a different location each time.
@@NicholasRenotte yeah, figured out a way using object detection and crop the roi then pasting roi's separetly. Anyway Thank you, You are the best
Loving it Nicholas........can you please share the video link or upload for, how we can detect the specific drug label.
Thanks in Adv..
Hi nick. who is jan lokon, lokoun at 2:26 ?
Hey, I saw your TensorFlow object detection API video. I have a query to ask.
I want to know how to visualize the custom trained model & how we can edit these layers for creating a new model.
Not possible to dive into the underlying layers with the TFOD API afaik. Can fine tune if needed but added explicit layers using keras for example isn't possible .
You are awesome!!!!
Hello sir.....kindly do a video on how to recognise handwritten text characters.....it would be great help
@NicholasRenotte
Is there any option like paddle OCR can we create custom OCR model and application from scratch, what are the steps has to be done from your thoughts.
Amazing content as always ❤️! Is this licensed model? How good is paddleOCR when compared to Tesseract?
It's very good especially when it comes to preprocessing (virtually none needed). Looks like it's open source under Apache!
you are the best
noooo, you are!
Thanks for posting.
It's not directly related but perhaps you/others can share ideas on how to solve something I've been struggling with: in a supermarket, how to identify what price belongs to what product? Once the above is clear, the OCR solution would fit perfectly.
Kinda tricky, first up, this would be a HUGE model. One approach could be to classify the products with an image could classify the products.....actually wait, do you need to map the physical photos of the products to prices?
@@NicholasRenotte yup. We need from an image to get the products and their corresponding prices
@@juanricardorivera3591 you could build an image classifier to classify the product then once classified do a lookup to a prices table!
thank you for very greate video. but in paddle paddle do we need preprocess our image first ?
first of all thanks so much u have been a great help, i clicked on the link below to install but it was all chinese
Awsome content again! I'm wondering if you have created a tutorial for Emotion Recognition? Thanks!
Yup! Check out the body language decoder vid on the channel!
Hey, Nicholas Thanks for the content. Can we run this on the windows vscode
Good job . In anpr system in pi4 who the best easyocr or paddleocr
I'm using Paddle these days, works really well. Haven't done a proper comparison though @Hatem
Can u make a video on recognizing words in handwritten sentence?
far better than tesseract and easy ocr i guess , im using it for prediciting the food can serial number and paddle is really great
amazing video .. can you make tutorials about custom mask rcnn for object segmentation and thanks
So if i have font file but language is old forgotten one from 500s, can I still train OCR with that font which I have for that language? Purpose is to digitilize books from museum
i want to extract hand written text from image,
I know this video is old and would be surprised to see a reply but the format is a little different now. To get the text, it is now 4 slots. To get the text, it would be [0][line number][1][0] and I cant use the for loop when the var is not in the first slot. Suggestions?
also I am very new to coding
I'm getting an error installing paddleOCR. I have a MacBook with M1 chip. It would be great if you could help me with that! Thank you
Hi, I have same error, did you happen to find the fix?
Fix for M1 issues with PyMyPDF:
ERROR: Failed building wheel for PyMuPDF
# brew install swig
# pip3 install paddleocr --upgrade PyMuPDF==1.21.1
thank you , it worked
Which one is the best? EasyOCR or PaddleOCR..!
In instantiating model and detect section, when i setup the model with ocr = PaddleOCR(leng='eng') , I am getting an error that is:
AttributeError: module 'paddle.fluid.core_avx' has no attribute 'is_compiled_with_rocm'
How can solve this, any help?
i have the same problem
Seems to me that your pc might not have GPU integrated. While installing paddle-ocr do not provide the gpu argument
@@aviparnabiswas3707 hello you can change it with this:
# Setup model
ocr_model = PaddleOCR(lang='en', use_gpu=False)
Hi Nick, I'm having a little issue...when I try to call PaddleOCR in my python script I keep getting a WinError 126, any advice?
Got a longer error for me?
@@NicholasRenotte yes same i am also getting when i was trying to write the script in vscode
Hey nick, i've run into another error i can't seem to install paddleocr. the error "failed building wheel for python-Levenshtein" keeps appearing i've tried everything i can find related to paddleocr on the internet and can't seem to find a way to fix the error
Hey Nick, I am also facing same error.
Got Visual C installed? Take a look at this; stackoverflow.com/questions/37676623/cant-install-levenshtein-distance-package-on-windows-python-3-5
@@NicholasRenotte hey nick thanks for replying, apparently it's smtng to do with my computer? Cause when i send the code to my friend who is using a different model year and brand computer it can run perfectly. Anyways, thanks for the help 😄
@@ridewithkay hmmm, does it run on Colab? Might be a faster solution!
Good Afternoon Mr Nicholas Renotte, my kernel keeps dying when i run 'from paddleocr import PaddleOCR...' even though i manage to install the packages required... do you know how i might resolve this issue or what the potential problem is? THANK YOU!!
Hi nick, I want to ask you if you try to deploy paddleocr on mobile application ?
iam unable to extract only text ,but able to extract info about coordinates and text together .what should i do extraxt only text
Hi Nick, very well explained. But I get error 'Can not import paddle core while this file exists: /usr/local/lib/python3.10/dist-packages/paddle/fluid/libpaddle.so' Please help
Hi
Is it possible to use this model to extract : number of rows and columns, elements of matrix ?
How did you handle the PHI?
HI nick,, can you pls ping how to retrain that Paddleocr module.. any video or documents.. Thanks in advance
github.com/PaddlePaddle/PaddleOCR/blob/release/2.2/doc/doc_en/recognition_en.md
RuntimeError: (PreconditionNotMet) The third-party dynamic library (cudnn64_8.dll) that Paddle depends on is not configured correctly. (error code is 126)
Instead of image can I use scanned pdf
Thanks for the tutorial ! Could you tell me please how to use a custom dictionary with python ?
"positional argument follows keyword argument"
can you add links to documentation for OCR?? I've got problems with arguments if I set gpu as False
pypi.org/project/paddleocr/
👏👏
pdf to json extract how
Could not find a version that satisfies the requirement paddlepaddle-gpu==2.0.0 (from versions: none)
I'm getting above error can anyone suggest a solution
❤
Great video, Thanks a lot!!!!!!!!!
can you please make a video on how to install and use paddle ocr on raspberry pi???
the for is not working for me
Wow!
FileNotFoundError: Could not find module 'C:\Users\budho\anaconda3\Library\bin\geos_c.dll' (or one of its dependencies). Try using the full path with constructor syntax.
while importing paddleocr!!!!!!!!!
Hmmm, not sure if this will help but does this work? github.com/Toblerity/Shapely/issues/1032
Hi Nick! Thank you very much for a super great tutorial. I would truly love if you can share how one can train the paddle ocr with some customized images, which would help improve on text extraction for different use cases. Once again thank you very much.
Did you give this a go?
please help with this issue
TypeError Traceback (most recent call last)
Cell In[141], line 6
3 plt.figure(figsize=(15,15))
5 # draw annotations on image
----> 6 annotated = draw_ocr(img, boxes, texts, scores, font_path=font_path)
8 # show the image using matplotlib
9 plt.imshow(annotated)
File ~\AppData\Roaming\Python\Python311\site-packages\paddleocr\tools\infer\utility.py:387, in draw_ocr(image, boxes, txts, scores, drop_score, font_path)
385 box_num = len(boxes)
386 for i in range(box_num):
--> 387 if scores is not None and (scores[i] < drop_score or
388 math.isnan(scores[i])):
389 continue
390 box = np.reshape(np.array(boxes[i]), [-1, 1, 2]).astype(np.int64)
bro make QR on band to solve
how would you do real time ocr detection via webcam?
Check this out: ua-cam.com/video/0-4p_QgrdbE/v-deo.html