Feature Pyramid Network | Neck | Essentials of Object Detection

Поділитися
Вставка
  • Опубліковано 18 гру 2024

КОМЕНТАРІ • 65

  • @paedrufernando2351
    @paedrufernando2351 Рік тому +9

    Keep the pearls of wisdom dropping sir..Privilage to learn from you miles across...

  • @lostpenguin3682
    @lostpenguin3682 Рік тому +3

    very helpful! I really like that you're explaining it with an example with concrete numbers!

  • @AkhileshShukla-d5x
    @AkhileshShukla-d5x Рік тому +1

    Sir, I have a lot of to say after finding your video on UA-cam but just ❤ , respect and thank you. 🙏🙏

  • @ianhowe8881
    @ianhowe8881 3 місяці тому

    Incredible explanatory skills!

  • @brunodias3524
    @brunodias3524 Рік тому +1

    I am so happy I found this video. Really good content!

  • @TeamDman
    @TeamDman Рік тому +2

    Thank you for sharing your knowledge!

  • @NehadHirmiz
    @NehadHirmiz Рік тому +2

    Excellent tutorial. Thank you very much.

  • @vipingautam9501
    @vipingautam9501 Рік тому +2

    This is excellent! I just love it.

  • @AdnanMunirkhokhar
    @AdnanMunirkhokhar Рік тому +1

    amazing explanation Dr.

  • @applestarpie
    @applestarpie Рік тому +1

    I like your videos, which are easy and fun to learn. Thanks a lot!

  • @abhishekdhiman5719
    @abhishekdhiman5719 5 місяців тому

    Thanks for sharing the knowledge

  • @rampavanmedipelli6152
    @rampavanmedipelli6152 Рік тому +1

    Thank you... excellent clarity... please try to make a tutorial on anchor free detectors like FCOS..

  • @ranjithtevnan2909
    @ranjithtevnan2909 5 місяців тому

    I have 2 questions. How are the 1X1 and 3X3 CNN used trained to obtain the weight parameters? Also shouldn't 3X3 with stride 1 change the dimension, though it keeps the number of channels the same the size of the output feature would have changed and reduced by 2

  • @science_electronique
    @science_electronique 9 місяців тому +1

    is useful to add channel and spatial attention in conv layers to improve

  • @kylehuang9035
    @kylehuang9035 Рік тому +2

    Could you give a tutorial of diffusing model to your VAE series? Its related and would like to see your explanation!

    • @KapilSachdeva
      @KapilSachdeva  Рік тому

      Though I understand the theory it’s just that I have never implemented/used them myself. I prefer to share those concepts that I have implemented myself and applied on some real world problem.
      But not saying no :) maybe one day. Thanks for the ask though.

  • @rampavan4094
    @rampavan4094 Рік тому +1

    Could you give a tutorial on the vision transformer model for object detection?

    • @KapilSachdeva
      @KapilSachdeva  Рік тому

      in some time. have been preoccupied with some stuff but would try my best

  • @yogeshwarshendye4857
    @yogeshwarshendye4857 8 місяців тому

    If done with UNet, it won't require upsampling as we concatenate the layers right?

  • @dmgeo
    @dmgeo 4 місяці тому

    How is this different from U-net? I think they're pretty similar if you think that in the U-net you're going down in the encoder, up in the decoder and sideways with the skip connections. It's like an upside-down U-net

  • @user-uf3md5ub5j
    @user-uf3md5ub5j Рік тому +1

    Thanks a lot! would be the following videos soon?

  • @krishnachaitanya7374
    @krishnachaitanya7374 Рік тому +1

    This is quite informative and helpful. Can you please create a video on prediction heads in fpn as in how to assign a predicted bbox to a particular feature map. That would be quite helpful.

    • @KapilSachdeva
      @KapilSachdeva  Рік тому

      Yes, thinking to make some videos about different label assignment techniques.
      Now about your question - the right terminology or phrasing of your request would be how to assign an anchor box to a particular feature map.

  • @vincentpelletier1246
    @vincentpelletier1246 9 місяців тому

    I don't know if I got this wrong but if I take a 1x64x26x26 feature through a convolution that has a K=3 and S=1, I will definitely not end up with a 1x64x26x26, but with a 1x64x24x24. To achieve the desired shape would require a P=1.
    If I'm not correct, would someone please explain how the dimensions would work in this case?

  • @DIAHAYUNINGTYASWATI
    @DIAHAYUNINGTYASWATI Рік тому

    Do you know how to combine AFPN with the YOLO v8 algorithm? If you know, please tell me. Thanks

  • @LongLeNgoc-qq5qn
    @LongLeNgoc-qq5qn Рік тому

    what about height and width are odd number (415), sir? In that case, the size after conv and after upsample is miss match. How to fix that, please!

    • @KapilSachdeva
      @KapilSachdeva  Рік тому

      Resize the image to 416 or any other size (e.g. 640) before feeding it to the network.

  • @cheeziobodini
    @cheeziobodini Рік тому

    Instead of doing the upsampling via pytorch module and being angry about it, would it be any more useful to train an additional layer to do the upsampling instead? I'm thinking of a layer analogous to the decoder layer in an autoencoder.

    • @KapilSachdeva
      @KapilSachdeva  Рік тому

      No need to be angry at it :) … yes you could do that. As a matter of fact the additional layers after upsampling is to reduce it effects. The cost would be number of parameters. So it is always a trade off.

    • @cheeziobodini
      @cheeziobodini Рік тому +1

      @@KapilSachdeva Thank you! informative video btw

    • @KapilSachdeva
      @KapilSachdeva  Рік тому

      🙏

  • @lordfarquad-by1dq
    @lordfarquad-by1dq Рік тому +1

    thank you for the content , next video soon?

    • @KapilSachdeva
      @KapilSachdeva  Рік тому +1

      🙏 … yes. Most likely tomorrow. Thanks for keeping me accountable.

    • @lordfarquad-by1dq
      @lordfarquad-by1dq Рік тому +1

      @@KapilSachdeva thank you again for the content, looking forward for more of these videos

    • @KapilSachdeva
      @KapilSachdeva  Рік тому +1

      Still working on the next video; not yet happy with it hence not published yet.

  • @harshith_takkala
    @harshith_takkala Рік тому +1

    thankyou sir !

  • @manueljohnson1354
    @manueljohnson1354 6 місяців тому

    Excellent

  • @farooqdsp
    @farooqdsp Рік тому

    new video when ?

  • @III.Jennifer
    @III.Jennifer 3 місяці тому

    209 Lisandro Ridge

  • @nayab.quteer
    @nayab.quteer Рік тому

    Can you make the video in Urdu language

    • @KapilSachdeva
      @KapilSachdeva  Рік тому

      There are urdu subtitles and may be that will be of some help!

  • @TameraSweet-n3t
    @TameraSweet-n3t 2 місяці тому

    Haley Corner

  • @TeddyFlanagan-q8l
    @TeddyFlanagan-q8l 2 місяці тому

    Clement Landing

  • @DorisCorey-j7i
    @DorisCorey-j7i 2 місяці тому

    Hernandez Betty Lewis Kenneth Gonzalez Christopher

  • @FinancialYaweli
    @FinancialYaweli 4 місяці тому

    Walker William Moore Patricia Perez Anthony

  • @MichelleMoore-l2c
    @MichelleMoore-l2c 3 місяці тому

    Pagac Road

  • @StudentsThough
    @StudentsThough 3 місяці тому

    Garcia Larry Lewis Charles Hernandez Carol

  • @RobertWhite-m3p
    @RobertWhite-m3p 2 місяці тому

    Franco Neck

  • @GoldYvonne-r9o
    @GoldYvonne-r9o 3 місяці тому

    Hernandez Michael Taylor Donald Walker Richard

  • @SgheGejsj
    @SgheGejsj 3 місяці тому

    Wilson Jose Lewis Matthew Smith Matthew

  • @LoisStewart-t6g
    @LoisStewart-t6g 3 місяці тому

    Thompson Cynthia Martin Frank Brown Jason

  • @EraRyba
    @EraRyba 3 місяці тому

    8831 Osvaldo Heights