Reading Deformable DETR source code

DAB Detr (dynamic anchor boxes)

DETR: End-to-End Object Detection with Transformers (Paper Explained)

Б0РЗАЯ ЖЕНА И ЕЁ НЕАDЕКVАТНЫЙ ZАРАБАТЫВАТЕLЬ НА КР0VИ @VolodymyrZolkin

Это было очень близко...

DOMIY & SHUMEI - Не пройде

Deformable DETR

Mak Gaiduk

Переглядів 3 953

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 26 жов 2024

КОМЕНТАРІ • 28

@huangshijie3038 7 місяців тому ⁺⁴
Great video!!!
thank you for making this video
I was praying to see this 2 years ago lol
@makgaiduk 7 місяців тому ⁺¹
I am glad you enjoyed it!
@Jacob011 Місяць тому ⁺¹
This is an actual explanation. Unlike most of the other channels that purport to "explain" these architectures.
@jeffreyheo7120 6 місяців тому ⁺²
thank you so much for this amazing video! looking forward to more of your content :D
@makgaiduk 6 місяців тому
Glad you enjoyed it! More to come!
@leanderheine4135 2 місяці тому
Literally the only in depth source other than the "Deformable Convolution Networks" paper. Helped me a lot for my bachelors thesis!
@makgaiduk 7 місяців тому
Check out my next video: reading Deformable DETR source code ua-cam.com/video/3M9mS_3eiaw/v-deo.html
@ajeyamandikal2010 8 місяців тому ⁺²
Great explanation!!
Could I request videos covering the object tracking problem, and more specifically models like MOTR?
@makgaiduk 8 місяців тому ⁺¹
Certainly! I was hoping to climb up to current state of the art in object detection, and then expand towards more advanced problems like object tracking
@ajeyamandikal2010 8 місяців тому
@@makgaiduk Great!! Looking forward to it
@Taehyoung_Kim 6 місяців тому ⁺²
Was really helpful :) keep it up
@makgaiduk 6 місяців тому ⁺¹
Glad it helped!
@Taehyoung_Kim 6 місяців тому ⁺¹
@@makgaidukby any chance you plan to go over sota segmentation model as well?
@makgaiduk 6 місяців тому ⁺¹
@@Taehyoung_Kim I have that in my plan. I plan to make videos about Bert, CoDetr, Grounding DINO, and probably Mamba for Vision, and then start digging into segmentation models. It will also probably take some time to get up to speed on all concept before doing SOTA. I am making 1 video per week, so we are looking at something like 2-3 months at least
@Taehyoung_Kim 6 місяців тому
@@makgaiduk nice! I’ll stay tuned
@davidro00 Місяць тому ⁺¹
In the deformable convolution, I still dont get how the "offset branch" is calculating the offset map via a convolution kernel of the same size as the original one. How is its output re arranged to match the specific pixel offsets
EDIT:
I think it is the following:
N refers to the number of kernel elements (eg 9 for a 3x3 kernel) and 2 for x and y offset. So channel 1 and 2 refer to the x and y offsets for the top left position of the kernel.
Then, the spatial dimensions of the offset map correspond to the current position of the sliding operation of the kernel. Thus, the first 2 channels of the top left value in the offset map determine the x y offsets of the top left kernel item when the kernel is currently in its first position during sliding
@makgaiduk Місяць тому
Good question. I guess I should do a "deformable convolution" code read
@davidro00 Місяць тому
@@makgaiduk would be cool, but maybe not that relevant anymore... I also think its written in cuda, as well as they did for deformable attention because of the bilinear interpolation thingy
@guillaumehai 2 місяці тому
This was amazing, nice work - I really appreciate it. Please continue with the vids :)
@shaodongwang3029 6 місяців тому
Thank you for this insightful video! The explanations are clear and easy to follow. Love it!
Regarding the object detection task, especially for detecting stacked or cluttered items, would a DETR-based model be more suitable than YOLO?
@makgaiduk 6 місяців тому ⁺¹
By design and reported metrics, more advanced DETR based model like DINO or CoDETR should be better.
Depending on what sort of data you have, you might also take a look at multi-modal models like OpenAI's "CLIP" or Grounding DINO, they might get better accuracy without finetuning
@shaodongwang3029 6 місяців тому ⁺¹
@@makgaiduk Got it. Thank you for sharing❤
@AhmedEssamFakharany 4 місяці тому
this is awesome !!
@rajampetasharath21 9 місяців тому
Great video!!
@makgaiduk 9 місяців тому
Glad it was useful! And thanks for commenting!

Наступне

Автоматичне відтворення

Reading Deformable DETR source code

Reading Deformable DETR source code

DAB Detr (dynamic anchor boxes)

DAB Detr (dynamic anchor boxes)

DETR: End-to-End Object Detection with Transformers (Paper Explained)

DETR: End-to-End Object Detection with Transformers (Paper Explained)

Б0РЗАЯ ЖЕНА И ЕЁ НЕАDЕКVАТНЫЙ ZАРАБАТЫВАТЕLЬ НА КР0VИ @VolodymyrZolkin

Б0РЗАЯ ЖЕНА И ЕЁ НЕАDЕКVАТНЫЙ ZАРАБАТЫВАТЕLЬ НА КР0VИ @VolodymyrZolkin

Это было очень близко...

Это было очень близко...

DOMIY & SHUMEI - Не пройде

DOMIY & SHUMEI - Не пройде

🤯ЗАБИЛИ В САМОЕ ВЫСОКОЕ КОЛЬЦО В МИРЕ🏀 #shorts #баскетбол

🤯ЗАБИЛИ В САМОЕ ВЫСОКОЕ КОЛЬЦО В МИРЕ🏀 #shorts #баскетбол

Visualizing Convolutional Neural Networks | Layer by Layer

Visualizing Convolutional Neural Networks | Layer by Layer

Swin Transformer

Swin Transformer

Managing classical processing requirements for fault-tolerant quantum computers

Managing classical processing requirements for fault-tolerant quantum computers

DN Detr (Denoising Detr for object detection)

DN Detr (Denoising Detr for object detection)

Transformer Neural Networks Derived from Scratch

Transformer Neural Networks Derived from Scratch

[CVPR 2024] RT-DETR, DETRs Beat YOLOs on Real-time Object Detection.

[CVPR 2024] RT-DETR, DETRs Beat YOLOs on Real-time Object Detection.

Object Detection with Transformers (DETR)

Object Detection with Transformers (DETR)

DINO - DETR with Improved DeNoising AnchorBoxes for End-to-End Object Detection

DINO - DETR with Improved DeNoising AnchorBoxes for End-to-End Object Detection

skibidi toilet 77 (part 4)

skibidi toilet 77 (part 4)

Скабєєва ПЕРЕКОСИЛО! Пропагандист ВИХВАЛЯЄ ЗСУ #shоrts

Скабєєва ПЕРЕКОСИЛО! Пропагандист ВИХВАЛЯЄ ЗСУ #shоrts

"Вони мене заставили розмовляти російською мовою": староста села про катування #shorts

"Вони мене заставили розмовляти російською мовою": староста села про катування #shorts

Когда у вас с подругой чуть разные размерчики 😅🍒 #юмор

Когда у вас с подругой чуть разные размерчики 😅🍒 #юмор

Олександр Мацієвський з роду городових козаків. Ростислав Мартинюк у Інструкція.Смисл

Олександр Мацієвський з роду городових козаків. Ростислав Мартинюк у Інструкція.Смисл

Cool Items!🥰 New Gadgets, Smart Appliances, Kitchen Tools Utensils, Home Cleaning, Beauty #shorts

Cool Items!🥰 New Gadgets, Smart Appliances, Kitchen Tools Utensils, Home Cleaning, Beauty #shorts

Проверка Лайфхаков, Мифов и Экспериментов + Гостфакерс (Кореш, Парадеич, ФрамеТамер)

Проверка Лайфхаков, Мифов и Экспериментов + Гостфакерс (Кореш, Парадеич, ФрамеТамер)