RT DETR - realtime object detection with transformers

Поділитися
Вставка
  • Опубліковано 17 гру 2024

КОМЕНТАРІ • 8

  • @taido4883
    @taido4883 22 дні тому

    Wow. Thank you Mak!!!
    I was learning about this model and having no idea where to start. This video must have saved me weeks of reading materials and going through source codes!

  • @RussWest-n8c
    @RussWest-n8c 2 місяці тому

    Amazing work, I really love the DETR video series, your explanations are so clear and easy to understand!
    Is there any chance these latest videos will be also available in text format on your github page, like in the case of DAB-DETR and Deformable DETR?

    • @makgaiduk
      @makgaiduk  2 місяці тому

      Certainly! Will get started on that

    • @makgaiduk
      @makgaiduk  2 місяці тому

      There we go: github.com/adensur/blog/tree/main/computer_vision_zero_to_hero/30_rt_detr
      WIll do other videos to, in due time

  • @codewithdev1375
    @codewithdev1375 2 місяці тому

    Amazing work, can you please do video on vrwkv its very recent vision encoder with linear attention. Thanks a lot for your videos.

    • @makgaiduk
      @makgaiduk  2 місяці тому

      Sure thing! Never heard of this one before, will be interesting

  • @rahulharsha2243
    @rahulharsha2243 2 місяці тому +1

    It’s surprising that transformers are not very good at small object detection as self attention should help the decoder to find them.

    • @makgaiduk
      @makgaiduk  2 місяці тому +1

      I think this is a performance problem. Looking at "fat" transformer-based models like CODETR (arxiv.org/pdf/2211.12860), they don't seem to have any problem with APsmall. Perhaps it's just that looking at HD feature maps is expensive for a small model like rt detr?