Rip out Drug Labels using Deep Learning with PaddleOCR & Python

Nicholas Renotte

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 8 гру 2024

КОМЕНТАРІ • 125

@danielbozinovski8419 3 роки тому ⁺⁵
Just watched some of your previous tf videos, and man was I blown away by how you didn't already have at least a million subs. Keep up the awesome work!
@dan3370 3 роки тому ⁺⁶
This guy deserves more subs! One of the best content creator in the ML space.
@csoham96 2 роки тому ⁺¹
This model is crazy good with accuracy! Love your tutorial as always
@mathewsjoby6455 3 роки тому ⁺²
Always LOVED YOU CONTENT FROM DAY 1 SINCE I GOT HERE! Simple and Intelligent content which any KID can understand the most complex things! good job!
@brunospfc8511 Рік тому ⁺¹
Still works, very easy, thanks Nick!
@hilariouzclips1235 3 роки тому ⁺³
Your channel is the only one that I've click the bell button. Your vids are 💎
@NicholasRenotte 3 роки тому
🙏❤
@salabsonisa1650 3 роки тому ⁺³
Hi nick, I love your videos because of how detailed you are. Since I discovered your channel I have never been the same because I have learned a lot from your teaching method and technical stuff on AI and deep learning. Kudos to you for your effort to impart knowledge.
@flashkachannel2756 3 роки тому ⁺²
Agree
@alexandregagne4151 3 роки тому ⁺²
Agree 👌👍👌
@NicholasRenotte 3 роки тому
Thanks so much @Salabson! Means a ton to me 🙏
@pseudounknow5559 3 роки тому ⁺¹
Thanks for your positivity in all of your videos 😊
@NicholasRenotte 3 роки тому
Gotta be! Life is too short, just tryna have some fun along the way!
@leoletchoum 3 роки тому ⁺³
I love the video thanks Nick! Something that might be useful for Mac users is adding the following option at the beginning of step 2:
ocr_model = PaddleOCR(lang=‘en’, use_gpu=False)
Tested it on MBP Intel and worked like a charm with impressive accuracy :-)
@NicholasRenotte 3 роки тому
Ha, that's awesome. Thanks so much for sharing my guy!!
@janicejose5809 3 роки тому ⁺¹
Accidently stumbled upon this channel .Thanks a lot ,This was exactly what I was looking for !!
@NicholasRenotte 3 роки тому
Glad you stumbled over here Janice, welcome to the team!
@janicejose5809 3 роки тому
@@NicholasRenotte Hello sir your channel has pretty amazing content .I was planning on building an app that helps in detecting fake medicines using computer vision techniques .Could u please make a video wherein it analyse the input images and detects the difference and highlight it or any related tutorial .Thanks a lot
@VivekSingh-in6rq 3 роки тому ⁺¹
thanks for always posting videos, love them
@NicholasRenotte 3 роки тому
Stoked man, so pumped you're enjoying them!
@samahashabbir6584 2 роки тому ⁺⁴
Hi Nick!
TypeError Traceback (most recent call last)
in
----> 1 annotated = draw_ocr(img, boxes, texts, scores, font_path=font_path)
/usr/local/lib/python3.7/dist-packages/paddleocr/tools/infer/utility.py in draw_ocr(image, boxes, txts, scores, drop_score, font_path)
380 box_num = len(boxes)
381 for i in range(box_num):
--> 382 if scores is not None and (scores[i] < drop_score or
383 math.isnan(scores[i])):
384 continue
TypeError: '
@KD-qo7kx 9 місяців тому
this is kinda late, but has anyone gotten a solution for this? chatgpt is not helping at all
@ball-e Місяць тому
looks like you're comparing a tuple to a float which isn't allowed. seems like scores[i] or drop_score is not the type of data that you think it is. Try using the type() func or print out the data.
@alexandregagne4151 3 роки тому ⁺²
Like always, 😎😎😎. Good job Nicholas.
@NicholasRenotte 3 роки тому
Thanks a mil!
@serboutyassine1733 3 роки тому ⁺¹
Amazing content, you are the best.
@VivekSingh-in6rq 3 роки тому ⁺¹
by the way, i think this video is created a weeks ago because there has been certain changes in the stable release and code would have to be modified accordingly
@PD-iu2sr 2 роки тому ⁺²
Hiii @Nicholas is it possible for live camera detection???
@donovanvanderlinde3478 9 місяців тому ⁺¹
Fix for M1 issues with PyMyPDF:
ERROR: Failed building wheel for PyMuPDF
# brew install swig
# pip3 install paddleocr --upgrade PyMuPDF==1.21.1
@medjawherzgolli9507 9 місяців тому
thank you , it worked
@brianfc1906 3 роки тому ⁺¹
Just here to let you know how awesome you are ❤️
@shareb1t 3 місяці тому
So if i have font file but language is old forgotten one from 500s, can I still train OCR with that font which I have for that language? Purpose is to digitilize books from museum
@gustavojuantorena 3 роки тому ⁺¹
Nice and simple! 👏👏
@NicholasRenotte 3 роки тому
Thanks a billion @Gustavo!
@rachanareddy6115 2 роки тому ⁺¹
I'm getting an error installing paddleOCR. I have a MacBook with M1 chip. It would be great if you could help me with that! Thank you
@rikeshpatel309 2 роки тому
Hi, I have same error, did you happen to find the fix?
@SasiKumar-zz5ps 8 місяців тому
@NicholasRenotte
Is there any option like paddle OCR can we create custom OCR model and application from scratch, what are the steps has to be done from your thoughts.
@saharshsinha5349 2 роки тому ⁺¹
In instantiating model and detect section, when i setup the model with ocr = PaddleOCR(leng='eng') , I am getting an error that is:
AttributeError: module 'paddle.fluid.core_avx' has no attribute 'is_compiled_with_rocm'
How can solve this, any help?
@lailigita4251 2 роки тому
i have the same problem
@aviparnabiswas3707 2 роки тому
Seems to me that your pc might not have GPU integrated. While installing paddle-ocr do not provide the gpu argument
@myriamagrebi2114 Рік тому
@@aviparnabiswas3707 hello you can change it with this:
# Setup model
ocr_model = PaddleOCR(lang='en', use_gpu=False)
@romeshrajoria4267 3 роки тому ⁺¹
Hey, I saw your TensorFlow object detection API video. I have a query to ask.
I want to know how to visualize the custom trained model & how we can edit these layers for creating a new model.
@NicholasRenotte 3 роки тому
Not possible to dive into the underlying layers with the TFOD API afaik. Can fine tune if needed but added explicit layers using keras for example isn't possible .
@dinugakasun5218 3 роки тому ⁺¹
superb video, but I have a question, is there any possible way to extract texts from only specific parts of an image, for example. I have an image of the license card, I need to extract only the name and address. Any help you can provide would be greatly appreciated. Thank you.
@gisellepena7927 3 роки тому ⁺¹
Same question.
@NicholasRenotte 3 роки тому ⁺¹
You can isolate that part of the frame and pass that through to the OCR model. This is a little trickier though particularly if that region is going to be in a different location each time.
@dinugakasun5218 3 роки тому
@@NicholasRenotte yeah, figured out a way using object detection and crop the roi then pasting roi's separetly. Anyway Thank you, You are the best
@ameerazam3269 3 роки тому ⁺¹
😊 😊 😊 😊 😊 😊 😊 thank you so much
@svetoslavchilingirov3858 Місяць тому
Very good one. Have you tried it for watermark removal? I am testing paddle vs easyocr vs tesseract vs craft... Maximum detection success rate after training with synthetic data is 45%
@leezx337 Рік тому
Good Afternoon Mr Nicholas Renotte, my kernel keeps dying when i run 'from paddleocr import PaddleOCR...' even though i manage to install the packages required... do you know how i might resolve this issue or what the potential problem is? THANK YOU!!
@sheetaljoshi6740 Рік тому
RuntimeError: (PreconditionNotMet) The third-party dynamic library (cudnn64_8.dll) that Paddle depends on is not configured correctly. (error code is 126)
@bankcrawpackchannel6936 3 роки тому ⁺¹
So awesome!!
@TEAM-sn1lw 8 місяців тому
iam unable to extract only text ,but able to extract info about coordinates and text together .what should i do extraxt only text
@vasamsettisharmilasairagha5863 2 роки тому
Could not find a version that satisfies the requirement paddlepaddle-gpu==2.0.0 (from versions: none)
I'm getting above error can anyone suggest a solution
@ChetnaLongani Рік тому
Hi Nick, very well explained. But I get error 'Can not import paddle core while this file exists: /usr/local/lib/python3.10/dist-packages/paddle/fluid/libpaddle.so' Please help
@BurnLai Рік тому
good and clear.
@cissemy Рік тому
Hi
Is it possible to use this model to extract : number of rows and columns, elements of matrix ?
@ridewithkay 3 роки тому ⁺¹
Hey nick, i've run into another error i can't seem to install paddleocr. the error "failed building wheel for python-Levenshtein" keeps appearing i've tried everything i can find related to paddleocr on the internet and can't seem to find a way to fix the error
@mrityunjaysingh607 3 роки тому
Hey Nick, I am also facing same error.
@NicholasRenotte 3 роки тому
Got Visual C installed? Take a look at this; stackoverflow.com/questions/37676623/cant-install-levenshtein-distance-package-on-windows-python-3-5
@ridewithkay 3 роки тому ⁺¹
@@NicholasRenotte hey nick thanks for replying, apparently it's smtng to do with my computer? Cause when i send the code to my friend who is using a different model year and brand computer it can run perfectly. Anyways, thanks for the help 😄
@NicholasRenotte 3 роки тому
@@ridewithkay hmmm, does it run on Colab? Might be a faster solution!
@pandat99 3 роки тому ⁺¹
Hi Nick, I'm having a little issue...when I try to call PaddleOCR in my python script I keep getting a WinError 126, any advice?
@NicholasRenotte 3 роки тому
Got a longer error for me?
@prateekeshsabat4631 3 роки тому
@@NicholasRenotte yes same i am also getting when i was trying to write the script in vscode
@PurushothamReddy-ff6vp 7 місяців тому
hello, Awesome video!, i have a small doubt, if im trying to extract data from a table is there any way to extract data as as rows instead of selecting each line as a raw which it does by default? Thank you!
@TejrajParab 3 роки тому ⁺¹
Love your videos!
@NicholasRenotte 3 роки тому
Thanks a bunch @tejraj!!
@TamilSelvan-jl5nx 3 роки тому ⁺¹
HI nick,, can you pls ping how to retrain that Paddleocr module.. any video or documents.. Thanks in advance
@NicholasRenotte 3 роки тому
github.com/PaddlePaddle/PaddleOCR/blob/release/2.2/doc/doc_en/recognition_en.md
@arifshaik9201 2 роки тому
Loving it Nicholas........can you please share the video link or upload for, how we can detect the specific drug label.
Thanks in Adv..
@hatemhenchir 2 роки тому ⁺¹
Good job . In anpr system in pi4 who the best easyocr or paddleocr
@NicholasRenotte 2 роки тому
I'm using Paddle these days, works really well. Haven't done a proper comparison though @Hatem
@ramkrishnachoudhary633 2 роки тому ⁺¹
Hello sir.....kindly do a video on how to recognise handwritten text characters.....it would be great help
@yohannesabrha6823 Рік тому
Great Presentation and work. did you try to train it by using your own datasets ?
@kuronokei388 8 місяців тому
first of all thanks so much u have been a great help, i clicked on the link below to install but it was all chinese
@RAZZKIRAN 3 роки тому ⁺¹
i want to extract hand written text from image,
@TaoZen888 2 роки тому
Hi nick. who is jan lokon, lokoun at 2:26 ?
@lebaolochuynh4222 3 роки тому
thank you for very greate video. but in paddle paddle do we need preprocess our image first ?
@ChirawatNg Рік тому
Thank you :)
@tylerkim4799 7 місяців тому
I know this video is old and would be surprised to see a reply but the format is a little different now. To get the text, it is now 4 slots. To get the text, it would be [0][line number][1][0] and I cant use the for loop when the var is not in the first slot. Suggestions?
@tylerkim4799 7 місяців тому
also I am very new to coding
@morancium Рік тому
bro I LOVE YOU
@senthilkumarnadarajan2247 2 роки тому
Hi Nicolas. Your video was awesome and it was a great help in implementing our ocr model. If possible, Please do a tutorial for paddle lite in mobile
@irugal8964 3 роки тому ⁺¹
you are the best
@NicholasRenotte 3 роки тому
noooo, you are!
@johnurena2863 Рік тому
You are awesome!!!!
@СветаСерикова-е2ю 3 роки тому
"positional argument follows keyword argument"
can you add links to documentation for OCR?? I've got problems with arguments if I set gpu as False
@NicholasRenotte 3 роки тому
pypi.org/project/paddleocr/
@prateekeshsabat4631 3 роки тому
Hey, Nicholas Thanks for the content. Can we run this on the windows vscode
@juanricardorivera3591 3 роки тому ⁺¹
Thanks for posting.
It's not directly related but perhaps you/others can share ideas on how to solve something I've been struggling with: in a supermarket, how to identify what price belongs to what product? Once the above is clear, the OCR solution would fit perfectly.
@NicholasRenotte 3 роки тому
Kinda tricky, first up, this would be a HUGE model. One approach could be to classify the products with an image could classify the products.....actually wait, do you need to map the physical photos of the products to prices?
@juanricardorivera3591 3 роки тому
@@NicholasRenotte yup. We need from an image to get the products and their corresponding prices
@NicholasRenotte 3 роки тому
@@juanricardorivera3591 you could build an image classifier to classify the product then once classified do a lookup to a prices table!
@stevengiallourakis5816 Рік тому
How did you handle the PHI?
@krtk9675 3 роки тому ⁺¹
Loved the video man this is an awsome video, Is there anyway we can specifially get only drug name from that text.
@NicholasRenotte 3 роки тому ⁺¹
Yep, should be able to extract it from the results using some indexing.
@thelazydeveloper 3 роки тому ⁺¹
amazing video .. can you make tutorials about custom mask rcnn for object segmentation and thanks
@dalinsixtus6752 10 місяців тому
far better than tesseract and easy ocr i guess , im using it for prediciting the food can serial number and paddle is really great
@sebastianromero5686 2 роки тому ⁺¹
Amazing video! Is it possible to implement it in a mobile app through React Native? Thanks a lot Nick!!!
@kiranquidwai1128 Рік тому
Can u make a video on recognizing words in handwritten sentence?
@ayamahmoud5700 Рік тому
Hi nick, I want to ask you if you try to deploy paddleocr on mobile application ?
@saurabhbudhori1776 3 роки тому
FileNotFoundError: Could not find module 'C:\Users\budho\anaconda3\Library\bin\geos_c.dll' (or one of its dependencies). Try using the full path with constructor syntax.
while importing paddleocr!!!!!!!!!
@NicholasRenotte 3 роки тому
Hmmm, not sure if this will help but does this work? github.com/Toblerity/Shapely/issues/1032
@__-_-__.. 3 роки тому ⁺¹
Awsome content again! I'm wondering if you have created a tutorial for Emotion Recognition? Thanks!
@NicholasRenotte 3 роки тому
Yup! Check out the body language decoder vid on the channel!
@shraddhapawar5921 3 роки тому ⁺¹
Hello Nicholas, thank you so much for your awesome videos! they are really very helpful! keep it up . I have one question if we can use paddle ocr for handwritten text too? Thank you
@NicholasRenotte 3 роки тому
Nope, would suggest using a handwritten dl model for that!
@profapps 8 місяців тому
please help with this issue
TypeError Traceback (most recent call last)
Cell In[141], line 6
3 plt.figure(figsize=(15,15))
5 # draw annotations on image
----> 6 annotated = draw_ocr(img, boxes, texts, scores, font_path=font_path)
8 # show the image using matplotlib
9 plt.imshow(annotated)
File ~\AppData\Roaming\Python\Python311\site-packages\paddleocr\tools\infer\utility.py:387, in draw_ocr(image, boxes, txts, scores, drop_score, font_path)
385 box_num = len(boxes)
386 for i in range(box_num):
--> 387 if scores is not None and (scores[i] < drop_score or
388 math.isnan(scores[i])):
389 continue
390 box = np.reshape(np.array(boxes[i]), [-1, 1, 2]).astype(np.int64)
@anishaudayakumar1778 3 роки тому ⁺¹
Amazing content as always ❤️! Is this licensed model? How good is paddleOCR when compared to Tesseract?
@NicholasRenotte 3 роки тому ⁺¹
It's very good especially when it comes to preprocessing (virtually none needed). Looks like it's open source under Apache!
@suryatejamenta6555 2 роки тому
Which one is the best? EasyOCR or PaddleOCR..!
@bambitousttous4311 3 роки тому
Thanks for the tutorial ! Could you tell me please how to use a custom dictionary with python ?
@jayadevi2341 2 роки тому
Instead of image can I use scanned pdf
@tjtj1122 6 місяців тому
pdf to json extract how
@meghanarprakash7862 Рік тому
Great video, Thanks a lot!!!!!!!!!
can you please make a video on how to install and use paddle ocr on raspberry pi???
@ashleysami1640 3 роки тому ⁺¹
👏👏
@sathyanarayanan4196 2 роки тому
the for is not working for me
@wachangastephen2859 2 роки тому ⁺¹
Hi Nick! Thank you very much for a super great tutorial. I would truly love if you can share how one can train the paddle ocr with some customized images, which would help improve on text extraction for different use cases. Once again thank you very much.
@alexsollis5504 2 роки тому
Did you give this a go?
@souramrakesh8900 3 роки тому ⁺¹
❤
@nosuchthing4789 2 роки тому
Wow!
@sidmagger Рік тому
bro make QR on band to solve
@fr3847 3 роки тому ⁺¹
how would you do real time ocr detection via webcam?
@NicholasRenotte 3 роки тому
Check this out: ua-cam.com/video/0-4p_QgrdbE/v-deo.html