Rip out Drug Labels using Deep Learning with PaddleOCR & Python

Поділитися
Вставка
  • Опубліковано 4 гру 2024

КОМЕНТАРІ • 125

  • @danielbozinovski8419
    @danielbozinovski8419 3 роки тому +5

    Just watched some of your previous tf videos, and man was I blown away by how you didn't already have at least a million subs. Keep up the awesome work!

  • @dan3370
    @dan3370 3 роки тому +6

    This guy deserves more subs! One of the best content creator in the ML space.

  • @csoham96
    @csoham96 2 роки тому +1

    This model is crazy good with accuracy! Love your tutorial as always

  • @salabsonisa1650
    @salabsonisa1650 3 роки тому +3

    Hi nick, I love your videos because of how detailed you are. Since I discovered your channel I have never been the same because I have learned a lot from your teaching method and technical stuff on AI and deep learning. Kudos to you for your effort to impart knowledge.

  • @mathewsjoby6455
    @mathewsjoby6455 3 роки тому +2

    Always LOVED YOU CONTENT FROM DAY 1 SINCE I GOT HERE! Simple and Intelligent content which any KID can understand the most complex things! good job!

  • @hilariouzclips1235
    @hilariouzclips1235 3 роки тому +3

    Your channel is the only one that I've click the bell button. Your vids are 💎

  • @brunospfc8511
    @brunospfc8511 Рік тому +1

    Still works, very easy, thanks Nick!

  • @pseudounknow5559
    @pseudounknow5559 3 роки тому +1

    Thanks for your positivity in all of your videos 😊

    • @NicholasRenotte
      @NicholasRenotte  3 роки тому

      Gotta be! Life is too short, just tryna have some fun along the way!

  • @leoletchoum
    @leoletchoum 3 роки тому +3

    I love the video thanks Nick! Something that might be useful for Mac users is adding the following option at the beginning of step 2:
    ocr_model = PaddleOCR(lang=‘en’, use_gpu=False)
    Tested it on MBP Intel and worked like a charm with impressive accuracy :-)

    • @NicholasRenotte
      @NicholasRenotte  3 роки тому

      Ha, that's awesome. Thanks so much for sharing my guy!!

  • @janicejose5809
    @janicejose5809 3 роки тому +1

    Accidently stumbled upon this channel .Thanks a lot ,This was exactly what I was looking for !!

    • @NicholasRenotte
      @NicholasRenotte  3 роки тому

      Glad you stumbled over here Janice, welcome to the team!

    • @janicejose5809
      @janicejose5809 3 роки тому

      @@NicholasRenotte Hello sir your channel has pretty amazing content .I was planning on building an app that helps in detecting fake medicines using computer vision techniques .Could u please make a video wherein it analyse the input images and detects the difference and highlight it or any related tutorial .Thanks a lot

  • @VivekSingh-in6rq
    @VivekSingh-in6rq 3 роки тому +1

    thanks for always posting videos, love them

  • @alexandregagne4151
    @alexandregagne4151 3 роки тому +2

    Like always, 😎😎😎. Good job Nicholas.

  • @serboutyassine1733
    @serboutyassine1733 3 роки тому +1

    Amazing content, you are the best.

  • @samahashabbir6584
    @samahashabbir6584 2 роки тому +4

    Hi Nick!
    TypeError Traceback (most recent call last)
    in
    ----> 1 annotated = draw_ocr(img, boxes, texts, scores, font_path=font_path)
    /usr/local/lib/python3.7/dist-packages/paddleocr/tools/infer/utility.py in draw_ocr(image, boxes, txts, scores, drop_score, font_path)
    380 box_num = len(boxes)
    381 for i in range(box_num):
    --> 382 if scores is not None and (scores[i] < drop_score or
    383 math.isnan(scores[i])):
    384 continue
    TypeError: '

    • @KD-qo7kx
      @KD-qo7kx 9 місяців тому

      this is kinda late, but has anyone gotten a solution for this? chatgpt is not helping at all

    • @ball-e
      @ball-e Місяць тому

      looks like you're comparing a tuple to a float which isn't allowed. seems like scores[i] or drop_score is not the type of data that you think it is. Try using the type() func or print out the data.

  • @VivekSingh-in6rq
    @VivekSingh-in6rq 3 роки тому +1

    by the way, i think this video is created a weeks ago because there has been certain changes in the stable release and code would have to be modified accordingly

  • @gustavojuantorena
    @gustavojuantorena 3 роки тому +1

    Nice and simple! 👏👏

  • @yohannesabrha6823
    @yohannesabrha6823 Рік тому

    Great Presentation and work. did you try to train it by using your own datasets ?

  • @PD-iu2sr
    @PD-iu2sr 2 роки тому +2

    Hiii @Nicholas is it possible for live camera detection???

  • @ameerazam3269
    @ameerazam3269 3 роки тому +1

    😊 😊 😊 😊 😊 😊 😊 thank you so much

  • @svetoslavchilingirov3858
    @svetoslavchilingirov3858 Місяць тому

    Very good one. Have you tried it for watermark removal? I am testing paddle vs easyocr vs tesseract vs craft... Maximum detection success rate after training with synthetic data is 45%

  • @sebastianromero5686
    @sebastianromero5686 2 роки тому +1

    Amazing video! Is it possible to implement it in a mobile app through React Native? Thanks a lot Nick!!!

  • @BurnLai
    @BurnLai Рік тому

    good and clear.

  • @brianfc1906
    @brianfc1906 3 роки тому +1

    Just here to let you know how awesome you are ❤️

  • @TejrajParab
    @TejrajParab 3 роки тому +1

    Love your videos!

  • @bankcrawpackchannel6936
    @bankcrawpackchannel6936 3 роки тому +1

    So awesome!!

  • @shraddhapawar5921
    @shraddhapawar5921 3 роки тому +1

    Hello Nicholas, thank you so much for your awesome videos! they are really very helpful! keep it up . I have one question if we can use paddle ocr for handwritten text too? Thank you

    • @NicholasRenotte
      @NicholasRenotte  3 роки тому

      Nope, would suggest using a handwritten dl model for that!

  • @morancium
    @morancium Рік тому

    bro I LOVE YOU

  • @krtk9675
    @krtk9675 3 роки тому +1

    Loved the video man this is an awsome video, Is there anyway we can specifially get only drug name from that text.

    • @NicholasRenotte
      @NicholasRenotte  3 роки тому +1

      Yep, should be able to extract it from the results using some indexing.

  • @senthilkumarnadarajan2247
    @senthilkumarnadarajan2247 2 роки тому

    Hi Nicolas. Your video was awesome and it was a great help in implementing our ocr model. If possible, Please do a tutorial for paddle lite in mobile

  • @ChirawatNg
    @ChirawatNg Рік тому

    Thank you :)

  • @PurushothamReddy-ff6vp
    @PurushothamReddy-ff6vp 7 місяців тому

    hello, Awesome video!, i have a small doubt, if im trying to extract data from a table is there any way to extract data as as rows instead of selecting each line as a raw which it does by default? Thank you!

  • @dinugakasun5218
    @dinugakasun5218 3 роки тому +1

    superb video, but I have a question, is there any possible way to extract texts from only specific parts of an image, for example. I have an image of the license card, I need to extract only the name and address. Any help you can provide would be greatly appreciated. Thank you.

    • @gisellepena7927
      @gisellepena7927 3 роки тому +1

      Same question.

    • @NicholasRenotte
      @NicholasRenotte  3 роки тому +1

      You can isolate that part of the frame and pass that through to the OCR model. This is a little trickier though particularly if that region is going to be in a different location each time.

    • @dinugakasun5218
      @dinugakasun5218 3 роки тому

      @@NicholasRenotte yeah, figured out a way using object detection and crop the roi then pasting roi's separetly. Anyway Thank you, You are the best

  • @arifshaik9201
    @arifshaik9201 2 роки тому

    Loving it Nicholas........can you please share the video link or upload for, how we can detect the specific drug label.
    Thanks in Adv..

  • @TaoZen888
    @TaoZen888 2 роки тому

    Hi nick. who is jan lokon, lokoun at 2:26 ?

  • @romeshrajoria4267
    @romeshrajoria4267 3 роки тому +1

    Hey, I saw your TensorFlow object detection API video. I have a query to ask.
    I want to know how to visualize the custom trained model & how we can edit these layers for creating a new model.

    • @NicholasRenotte
      @NicholasRenotte  3 роки тому

      Not possible to dive into the underlying layers with the TFOD API afaik. Can fine tune if needed but added explicit layers using keras for example isn't possible .

  • @johnurena2863
    @johnurena2863 Рік тому

    You are awesome!!!!

  • @ramkrishnachoudhary633
    @ramkrishnachoudhary633 2 роки тому +1

    Hello sir.....kindly do a video on how to recognise handwritten text characters.....it would be great help

  • @SasiKumar-zz5ps
    @SasiKumar-zz5ps 8 місяців тому

    @NicholasRenotte
    Is there any option like paddle OCR can we create custom OCR model and application from scratch, what are the steps has to be done from your thoughts.

  • @anishaudayakumar1778
    @anishaudayakumar1778 3 роки тому +1

    Amazing content as always ❤️! Is this licensed model? How good is paddleOCR when compared to Tesseract?

    • @NicholasRenotte
      @NicholasRenotte  3 роки тому +1

      It's very good especially when it comes to preprocessing (virtually none needed). Looks like it's open source under Apache!

  • @irugal8964
    @irugal8964 3 роки тому +1

    you are the best

  • @juanricardorivera3591
    @juanricardorivera3591 3 роки тому +1

    Thanks for posting.
    It's not directly related but perhaps you/others can share ideas on how to solve something I've been struggling with: in a supermarket, how to identify what price belongs to what product? Once the above is clear, the OCR solution would fit perfectly.

    • @NicholasRenotte
      @NicholasRenotte  3 роки тому

      Kinda tricky, first up, this would be a HUGE model. One approach could be to classify the products with an image could classify the products.....actually wait, do you need to map the physical photos of the products to prices?

    • @juanricardorivera3591
      @juanricardorivera3591 3 роки тому

      @@NicholasRenotte yup. We need from an image to get the products and their corresponding prices

    • @NicholasRenotte
      @NicholasRenotte  3 роки тому

      @@juanricardorivera3591 you could build an image classifier to classify the product then once classified do a lookup to a prices table!

  • @lebaolochuynh4222
    @lebaolochuynh4222 3 роки тому

    thank you for very greate video. but in paddle paddle do we need preprocess our image first ?

  • @kuronokei388
    @kuronokei388 8 місяців тому

    first of all thanks so much u have been a great help, i clicked on the link below to install but it was all chinese

  • @__-_-__..
    @__-_-__.. 3 роки тому +1

    Awsome content again! I'm wondering if you have created a tutorial for Emotion Recognition? Thanks!

    • @NicholasRenotte
      @NicholasRenotte  3 роки тому

      Yup! Check out the body language decoder vid on the channel!

  • @prateekeshsabat4631
    @prateekeshsabat4631 3 роки тому

    Hey, Nicholas Thanks for the content. Can we run this on the windows vscode

  • @hatemhenchir
    @hatemhenchir 2 роки тому +1

    Good job . In anpr system in pi4 who the best easyocr or paddleocr

    • @NicholasRenotte
      @NicholasRenotte  2 роки тому

      I'm using Paddle these days, works really well. Haven't done a proper comparison though @Hatem

  • @kiranquidwai1128
    @kiranquidwai1128 Рік тому

    Can u make a video on recognizing words in handwritten sentence?

  • @dalinsixtus6752
    @dalinsixtus6752 10 місяців тому

    far better than tesseract and easy ocr i guess , im using it for prediciting the food can serial number and paddle is really great

  • @thelazydeveloper
    @thelazydeveloper 3 роки тому +1

    amazing video .. can you make tutorials about custom mask rcnn for object segmentation and thanks

  • @shareb1t
    @shareb1t 3 місяці тому

    So if i have font file but language is old forgotten one from 500s, can I still train OCR with that font which I have for that language? Purpose is to digitilize books from museum

  • @RAZZKIRAN
    @RAZZKIRAN 3 роки тому +1

    i want to extract hand written text from image,

  • @tylerkim4799
    @tylerkim4799 7 місяців тому

    I know this video is old and would be surprised to see a reply but the format is a little different now. To get the text, it is now 4 slots. To get the text, it would be [0][line number][1][0] and I cant use the for loop when the var is not in the first slot. Suggestions?

    • @tylerkim4799
      @tylerkim4799 7 місяців тому

      also I am very new to coding

  • @rachanareddy6115
    @rachanareddy6115 2 роки тому +1

    I'm getting an error installing paddleOCR. I have a MacBook with M1 chip. It would be great if you could help me with that! Thank you

    • @rikeshpatel309
      @rikeshpatel309 2 роки тому

      Hi, I have same error, did you happen to find the fix?

  • @donovanvanderlinde3478
    @donovanvanderlinde3478 9 місяців тому +1

    Fix for M1 issues with PyMyPDF:
    ERROR: Failed building wheel for PyMuPDF
    # brew install swig
    # pip3 install paddleocr --upgrade PyMuPDF==1.21.1

  • @suryatejamenta6555
    @suryatejamenta6555 2 роки тому

    Which one is the best? EasyOCR or PaddleOCR..!

  • @saharshsinha5349
    @saharshsinha5349 2 роки тому +1

    In instantiating model and detect section, when i setup the model with ocr = PaddleOCR(leng='eng') , I am getting an error that is:
    AttributeError: module 'paddle.fluid.core_avx' has no attribute 'is_compiled_with_rocm'
    How can solve this, any help?

    • @lailigita4251
      @lailigita4251 2 роки тому

      i have the same problem

    • @aviparnabiswas3707
      @aviparnabiswas3707 2 роки тому

      Seems to me that your pc might not have GPU integrated. While installing paddle-ocr do not provide the gpu argument

    • @myriamagrebi2114
      @myriamagrebi2114 Рік тому

      @@aviparnabiswas3707 hello you can change it with this:
      # Setup model
      ocr_model = PaddleOCR(lang='en', use_gpu=False)

  • @pandat99
    @pandat99 3 роки тому +1

    Hi Nick, I'm having a little issue...when I try to call PaddleOCR in my python script I keep getting a WinError 126, any advice?

    • @NicholasRenotte
      @NicholasRenotte  3 роки тому

      Got a longer error for me?

    • @prateekeshsabat4631
      @prateekeshsabat4631 3 роки тому

      @@NicholasRenotte yes same i am also getting when i was trying to write the script in vscode

  • @ridewithkay
    @ridewithkay 3 роки тому +1

    Hey nick, i've run into another error i can't seem to install paddleocr. the error "failed building wheel for python-Levenshtein" keeps appearing i've tried everything i can find related to paddleocr on the internet and can't seem to find a way to fix the error

    • @mrityunjaysingh607
      @mrityunjaysingh607 3 роки тому

      Hey Nick, I am also facing same error.

    • @NicholasRenotte
      @NicholasRenotte  3 роки тому

      Got Visual C installed? Take a look at this; stackoverflow.com/questions/37676623/cant-install-levenshtein-distance-package-on-windows-python-3-5

    • @ridewithkay
      @ridewithkay 3 роки тому +1

      @@NicholasRenotte hey nick thanks for replying, apparently it's smtng to do with my computer? Cause when i send the code to my friend who is using a different model year and brand computer it can run perfectly. Anyways, thanks for the help 😄

    • @NicholasRenotte
      @NicholasRenotte  3 роки тому

      @@ridewithkay hmmm, does it run on Colab? Might be a faster solution!

  • @leezx337
    @leezx337 Рік тому

    Good Afternoon Mr Nicholas Renotte, my kernel keeps dying when i run 'from paddleocr import PaddleOCR...' even though i manage to install the packages required... do you know how i might resolve this issue or what the potential problem is? THANK YOU!!

  • @ayamahmoud5700
    @ayamahmoud5700 Рік тому

    Hi nick, I want to ask you if you try to deploy paddleocr on mobile application ?

  • @TEAM-sn1lw
    @TEAM-sn1lw 8 місяців тому

    iam unable to extract only text ,but able to extract info about coordinates and text together .what should i do extraxt only text

  • @ChetnaLongani
    @ChetnaLongani Рік тому

    Hi Nick, very well explained. But I get error 'Can not import paddle core while this file exists: /usr/local/lib/python3.10/dist-packages/paddle/fluid/libpaddle.so' Please help

  • @cissemy
    @cissemy Рік тому

    Hi
    Is it possible to use this model to extract : number of rows and columns, elements of matrix ?

  • @stevengiallourakis5816
    @stevengiallourakis5816 Рік тому

    How did you handle the PHI?

  • @TamilSelvan-jl5nx
    @TamilSelvan-jl5nx 3 роки тому +1

    HI nick,, can you pls ping how to retrain that Paddleocr module.. any video or documents.. Thanks in advance

    • @NicholasRenotte
      @NicholasRenotte  3 роки тому

      github.com/PaddlePaddle/PaddleOCR/blob/release/2.2/doc/doc_en/recognition_en.md

  • @sheetaljoshi6740
    @sheetaljoshi6740 Рік тому

    RuntimeError: (PreconditionNotMet) The third-party dynamic library (cudnn64_8.dll) that Paddle depends on is not configured correctly. (error code is 126)

  • @jayadevi2341
    @jayadevi2341 2 роки тому

    Instead of image can I use scanned pdf

  • @bambitousttous4311
    @bambitousttous4311 3 роки тому

    Thanks for the tutorial ! Could you tell me please how to use a custom dictionary with python ?

  • @СветаСерикова-е2ю
    @СветаСерикова-е2ю 3 роки тому

    "positional argument follows keyword argument"
    can you add links to documentation for OCR?? I've got problems with arguments if I set gpu as False

  • @ashleysami1640
    @ashleysami1640 3 роки тому +1

    👏👏

  • @tjtj1122
    @tjtj1122 6 місяців тому

    pdf to json extract how

  • @vasamsettisharmilasairagha5863
    @vasamsettisharmilasairagha5863 2 роки тому

    Could not find a version that satisfies the requirement paddlepaddle-gpu==2.0.0 (from versions: none)
    I'm getting above error can anyone suggest a solution

  • @souramrakesh8900
    @souramrakesh8900 3 роки тому +1

  • @meghanarprakash7862
    @meghanarprakash7862 Рік тому

    Great video, Thanks a lot!!!!!!!!!
    can you please make a video on how to install and use paddle ocr on raspberry pi???

  • @sathyanarayanan4196
    @sathyanarayanan4196 2 роки тому

    the for is not working for me

  • @nosuchthing4789
    @nosuchthing4789 2 роки тому

    Wow!

  • @saurabhbudhori1776
    @saurabhbudhori1776 3 роки тому

    FileNotFoundError: Could not find module 'C:\Users\budho\anaconda3\Library\bin\geos_c.dll' (or one of its dependencies). Try using the full path with constructor syntax.
    while importing paddleocr!!!!!!!!!

    • @NicholasRenotte
      @NicholasRenotte  3 роки тому

      Hmmm, not sure if this will help but does this work? github.com/Toblerity/Shapely/issues/1032

  • @wachangastephen2859
    @wachangastephen2859 2 роки тому +1

    Hi Nick! Thank you very much for a super great tutorial. I would truly love if you can share how one can train the paddle ocr with some customized images, which would help improve on text extraction for different use cases. Once again thank you very much.

  • @profapps
    @profapps 8 місяців тому

    please help with this issue
    TypeError Traceback (most recent call last)
    Cell In[141], line 6
    3 plt.figure(figsize=(15,15))
    5 # draw annotations on image
    ----> 6 annotated = draw_ocr(img, boxes, texts, scores, font_path=font_path)
    8 # show the image using matplotlib
    9 plt.imshow(annotated)
    File ~\AppData\Roaming\Python\Python311\site-packages\paddleocr\tools\infer\utility.py:387, in draw_ocr(image, boxes, txts, scores, drop_score, font_path)
    385 box_num = len(boxes)
    386 for i in range(box_num):
    --> 387 if scores is not None and (scores[i] < drop_score or
    388 math.isnan(scores[i])):
    389 continue
    390 box = np.reshape(np.array(boxes[i]), [-1, 1, 2]).astype(np.int64)

  • @sidmagger
    @sidmagger Рік тому

    bro make QR on band to solve

  • @fr3847
    @fr3847 3 роки тому +1

    how would you do real time ocr detection via webcam?

    • @NicholasRenotte
      @NicholasRenotte  3 роки тому

      Check this out: ua-cam.com/video/0-4p_QgrdbE/v-deo.html