DETR: End-to-End Object Detection with Transformers (Paper Explained)

Swin Transformer

DINOv2 Explained: Visual Model Insights & Comprehensive Code Guide

Что будет если украсть в магазине шоколадку 🍫

“Don’t stop the chances.”

ЧТО ОПАСНЕЕ? ОТВЕТЫ ВАС ШОКИРУЮТ... (1% ОТВЕЧАЮТ ПРАВИЛЬНО) #Shorts #Глент

DETR - End to end object detection with transformers (ECCV2020)

Nicolas Carion

Переглядів 24 410

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 17 гру 2024

КОМЕНТАРІ • 25

@fire_nakamura 14 днів тому ⁺¹
I'm fascinated by you and your team members' craft, with tweaks on loss, ideas of encodings and sufficient amount of data, applications will be huge. I would love to learn and explore those possibilities, Isn’t there anyway to be a part of your team or contribute to any related projects?
@kvnptl4400 6 місяців тому
A very nice presentation with clear visualizations and easy-to-understand explanations! Great Work!!🌟🌟🌟🌟🌟
Smooth animations 👌
@QuintinMassey 2 роки тому ⁺³
Outstanding work. I’m also very interested in the, arguably more difficult, small object detection problem.
@syedabdul8509 3 роки тому ⁺⁷
Excellent Explanation.
But I want to know the most important thing in this video,
How did you create those cool animations like @1:58-@2:20 and @8:00-@8:05
@praveen9083 3 роки тому ⁺²
I'm expecting this answer too!
@nicollenunes4459 11 місяців тому
@@praveen9083 me 2!
@azharhussian4326 2 місяці тому
anyone has idea?
@MarioHari 4 роки тому ⁺²
Nice work!
A small correction to what you said: "Semantic segmentation labels each pixel in the whole image. It is not restricted to only pixels in the background".
@nicolascarion3111 4 роки тому ⁺⁵
You're right, my statement is imprecise. I meant that semantic annotations of foreground classes are not used in the panoptic task.
@MarioHari 4 роки тому
@@nicolascarion3111 merci infiniment :)
@ujjalkrdutta7854 2 роки тому
@@nicolascarion3111 Can we then say that: "Panoptic Segmentation= Instance Segmentation+Semantic Segmentation minus annotations of foreground classes" ?
@Ramakrishnan-bq9is 3 роки тому ⁺¹
Thanks for sharing!
Could you please explain what you mean by full differentiable and how other methods might not be fully differentiable?
@goldenshale 2 роки тому
This is an end to end neural network defined by functions which all have derivatives. In the R-CNN family of algorithms you have one procedure that produces a bunch of region proposals, then you crop out these regions and feed them to a classifier, and then you run another algorithm to prune out overlapping and low confidence predictions. Since there are multiple steps that have logical rather than mathematical implementations, you can't take derivatives all the way through to back propagate information through the whole system.
@morancium 26 днів тому
WoW thankyou for your contribution!
@Nino234mff 3 роки тому
Thank you for the great work and the presentation!
@kaceangelo132 3 роки тому
i realize it is quite off topic but do anyone know of a good website to watch new movies online ?
@bakercain265 3 роки тому
@Kace Angelo try Flixzone. Just google for it =)
@chandrahasp6697 Рік тому
Really good work!
@ujjalkrdutta7854 2 роки тому
Elegant explanation. liked it
@rohinim7707 4 роки тому ⁺¹
Amazing! What was the main motivation behind using a sequence model for an object detection?
@redjammie8342 4 роки тому
It is not a sequence model. It was successfully used for sequences, but it's not a sequence model by definition.
@ZobeirRaisi 4 роки тому ⁺¹
What this mean?: "since the transformer is a permutation
equivalent some extra care is required to retain
the 2d structure of the image."
@nicolascarion3111 4 роки тому ⁺⁷
The transformer isn't aware of the 2D structure of the image, because 1) we flatten it and 2) permuting the inputs of a transformer simply permutes its outputs (permutation equivariance). That's why we add 2D positional encodings. This is similar to what is done in NLP, to retain the order of the sentence.
@ZobeirRaisi 4 роки тому ⁺¹
@@nicolascarion3111 Thanks for your explanation. I have another question: Right now DETR because of rectangle bboxes of COCO-dataset produces rectangle-bboxes outputs, if we had polygon bboxes (8 points), which parts of the architecture must be modified to output a polygon shape bboxes?
@nicolascarion3111 4 роки тому ⁺⁴
@@ZobeirRaisi Well you need to modify the regression head as well as the loss and matching function (GiOU may not make sense anymore, so you'll likely have to stick to L1). For this kind of questions, it's best to open an issue on our github. Thanks!

Наступне

Автоматичне відтворення

DETR: End-to-End Object Detection with Transformers (Paper Explained)

DETR: End-to-End Object Detection with Transformers (Paper Explained)

Swin Transformer

Swin Transformer

DINOv2 Explained: Visual Model Insights & Comprehensive Code Guide

DINOv2 Explained: Visual Model Insights & Comprehensive Code Guide

Что будет если украсть в магазине шоколадку 🍫

Что будет если украсть в магазине шоколадку 🍫

“Don’t stop the chances.”

“Don’t stop the chances.”

ЧТО ОПАСНЕЕ? ОТВЕТЫ ВАС ШОКИРУЮТ... (1% ОТВЕЧАЮТ ПРАВИЛЬНО) #Shorts #Глент

ЧТО ОПАСНЕЕ? ОТВЕТЫ ВАС ШОКИРУЮТ... (1% ОТВЕЧАЮТ ПРАВИЛЬНО) #Shorts #Глент

вернулись в ПРОШЛОЕ 🔃 | WICSUR #shorts

вернулись в ПРОШЛОЕ 🔃 | WICSUR #shorts

RT DETR - realtime object detection with transformers

RT DETR - realtime object detection with transformers

Vision Transformer in PyTorch

Vision Transformer in PyTorch

Attention in transformers, visually explained | DL6

Attention in transformers, visually explained | DL6

How to Train DETR Object Detection Transformer on Custom Dataset

How to Train DETR Object Detection Transformer on Custom Dataset

Real Time Detection Transformer (RT-DETR) | Episode 42

Real Time Detection Transformer (RT-DETR) | Episode 42

DINO: Self-Supervised Vision Transformers

DINO: Self-Supervised Vision Transformers

[CVPR 2024] RT-DETR, DETRs Beat YOLOs on Real-time Object Detection.

[CVPR 2024] RT-DETR, DETRs Beat YOLOs on Real-time Object Detection.

Object Detection introduction and an overview | Essentials of Object Detection

Object Detection introduction and an overview | Essentials of Object Detection

Object detection Using Detection Transformer (Detr) on custom dataset

Object detection Using Detection Transformer (Detr) on custom dataset

МІША ЛЕБІГА і АНДРІЙ ЛУЗАН в СРАЧІ #32

МІША ЛЕБІГА і АНДРІЙ ЛУЗАН в СРАЧІ #32

«Я жити не хочу»: винесли «з нуля» пораненого побратима #shorts

«Я жити не хочу»: винесли «з нуля» пораненого побратима #shorts

Нельзя смеяться | Смех с водой | 97 #shorts

Нельзя смеяться | Смех с водой | 97 #shorts

Unexpected way to open the new Audi A6 e-tron Frunk 😮! #shorts

Unexpected way to open the new Audi A6 e-tron Frunk 😮! #shorts

Cute Baby Ties Up Dad And Wants To Play With His Phone #funny #fatherhoodlove#cute#fatherhoodmoments

Cute Baby Ties Up Dad And Wants To Play With His Phone #funny #fatherhoodlove#cute#fatherhoodmoments

Перший наступ КНДРівців

Перший наступ КНДРівців

«Шнурки не зрізайте, акуратненько»: медик про реакцію військових на поранення #shorts

«Шнурки не зрізайте, акуратненько»: медик про реакцію військових на поранення #shorts

Ердоган ЖОРСТКО поставив на МІСЦЕ Путіна! В Кремлі терміново ГОТУЮТЬСЯ закінчувати ВІЙНУ.

Ердоган ЖОРСТКО поставив на МІСЦЕ Путіна! В Кремлі терміново ГОТУЮТЬСЯ закінчувати ВІЙНУ.