Detection Head | Essentials of Object Detection

Поділитися
Вставка
  • Опубліковано 15 жов 2024
  • This tutorial shows you how to make the detection head(s) that takes features from the backbone or the neck.
    Link to the Google Colab notebook:
    colab.research...

КОМЕНТАРІ • 22

  • @deeptirawat6091
    @deeptirawat6091 3 місяці тому

    At 11:02, instead of "the first rows of the first four channels will be for box coordinates", do you intend to say "the first cells of the first four channels will be for box coordinates"?

  • @AdityaPrakash-nk9gc
    @AdityaPrakash-nk9gc 4 місяці тому

    At 5:01 could you please explain why is it [1,5] and not [5,1]? Shouldn't the coordinates be in (x,y) format?

    • @KapilSachdeva
      @KapilSachdeva  4 місяці тому

      No the coordinates are in [y,x] … nothing specific about it as such, just a convention used in all object detection models.

  • @frazuppi4897
    @frazuppi4897 Рік тому +2

    this channel is amazing!!!!

    • @KapilSachdeva
      @KapilSachdeva  Рік тому +1

      🙏 not sure if there is anything for you to learn from my channel but sincerely appreciate your kind words.

  • @husseinjlailaty5852
    @husseinjlailaty5852 Рік тому +1

    Very nice lecture sir. Thank you!
    Isn't the cell position [1][5] at the 19th cell (not the 15th cell) ?

    • @KapilSachdeva
      @KapilSachdeva  Рік тому

      Yes it will be the 19th cell, do I say it 15th cell in the tutorial?

    • @husseinjlailaty5852
      @husseinjlailaty5852 Рік тому

      @@KapilSachdeva Yes, no worries. Please keep on doing your magnificent work.

  • @kshamanthkumar6042
    @kshamanthkumar6042 Рік тому +1

    Awesome 🤩, thank you so much sir.

  • @gqgqrghqrhgq
    @gqgqrghqrhgq Рік тому

    Hi ! Thank you for the great tutorial. I understand why we use the detection head and how it works. But I dont get, how we would combine the 3 outputs of the 3 heads. How would we know, which output/head (highleve, lowlevel, midlevel) is responsible for which ground truth box ? So that we can calculate the loss. Or is there a way to combine the output of the 3 heads to a single one ? Thank you

    • @KapilSachdeva
      @KapilSachdeva  Рік тому +1

      Assuming you are familiar with the notion of anchor boxes.
      The anchor boxes are assigned to different levels and during training you associate the ground truth box with an anchor box. This is how a particular level becomes responsible for predicting for the ground truth box.

    • @gqgqrghqrhgq
      @gqgqrghqrhgq Рік тому

      ​@@KapilSachdeva Thank you for the response and your awesome videos ! I think I get it now. I never found a good explantation for it. Do we consequently use bigger anchor boxes for the higher up levels and smaller ones for the lower level bounding boxes ? And therefore we know which ground truth box to assign to which layer using the IuO score ?

    • @KapilSachdeva
      @KapilSachdeva  Рік тому +1

      Yes

  • @harshith_takkala
    @harshith_takkala Рік тому +1

    clean explanation !

  • @КириллКлимушин
    @КириллКлимушин 11 місяців тому +1

    Thank you❤

  • @chaouidhuzgen6818
    @chaouidhuzgen6818 Рік тому +1

    hi, amzing explanations, bravo
    how can i contact you sir ?

    • @KapilSachdeva
      @KapilSachdeva  Рік тому

      🙏 if you have questions you can always ask them in comments.

  • @shabbirahammed4596
    @shabbirahammed4596 Рік тому +1

    nice...