DINO: Emerging Properties in Self-Supervised Vision Transformers (Facebook AI Research Explained)

Attention in transformers, visually explained | DL6

How DINO learns to see the world - Paper Explained

Тайское мороженое в Калининграде

How Strong Is Tape?

ВОТ ПОЧЕМУ Япония живет в будущем 🤫 Утилизация масла #япония #токио #путешествия #shorts

DINOv2 Explained: Visual Model Insights & Comprehensive Code Guide

Ai Ape

Переглядів 8 770

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 17 гру 2024

КОМЕНТАРІ • 21

@adityapillai3091 3 місяці тому ⁺¹
Really good explanation. Would love to see you make more videos. You're very clear and the visual content you present is easily digestible
@aiape6954 3 місяці тому
Thank you! Started at a start-up and it has eaten my time lol
@adityapillai3091 3 місяці тому
@@aiape6954 Start up grind ain’t no joke fr
@Erosis Рік тому ⁺⁴
Awesome explanation!
@arseniypolyubin7076 Рік тому ⁺¹
Thanks a lot for this video!
@leeyuguang4424 Рік тому ⁺⁴
How is it different than DINO itself? I wish there's more explanation.
@pratyushk2693 6 місяців тому
Really easy to understand! Thanks!
@零鱼芃 11 місяців тому ⁺²
Amazing work! I really want to know how to decide the cropping parameters based on different datasets. Is it completely based on experience?
@aiape6954 11 місяців тому ⁺²
The research does not explain any optimization strategies of tuning these parameters, so you have to assume it’s some mixture of intuition and trial and error. I would be interested in applying some evolutionary algorithm to find the best parameter set and see if you can push DINO performance.
@elahe4737 Рік тому ⁺¹
Thank you so much. It was clear and interesting. I have a question please, is it possible to modify the attention maps in this model?
@aiape6954 Рік тому
Checkout this repo! I use it all the time.
github.com/ShirAmir/dino-vit-features/tree/main
@DevelopmentTeam-b8x Рік тому ⁺¹
well Explained!!
@VLM234 10 місяців тому
that's a great explanation. Are you planning to make a video on the Florence-2 model? I would love to see for livestock use case.
@Kofi-qu9zc 6 місяців тому
Hi, great video. Had a tangent question, I am trying to use the base pretrained model of DINOV2 from huggingface on the broad institutes BBBC021 dataset of MCF7 breast cancer cells and I'm finding that the CLS embeddings when clustered don't align with the labels (MoA's) in the dataset... Given your experience with DINO, do you think this is due to the cropping strategy used in the pretrained model, and I would have to retrain a bare-bones DINOv2 model on millions of microscopy images to achieve the task of classification correctly?
Thanks for any help!
@mortezasjah6168 6 місяців тому
Thank you for wrapping up the code and explanation, does your code support multi node implementation? and is there any difference between your notebook and DinoV2 code?
@vizlifestudios 3 місяці тому
thank you!
@WildWonders7-u9z 9 місяців тому
Hello i have a paid project on DINO IBOT and DINOV2 will you help?
@rickli3746 10 місяців тому
I wonder if you think DINOv2 could be applied to CNNs?
@aiape6954 10 місяців тому ⁺¹
My intuition is that it would work but not as well as the transformers. Transformers are slow and computationally expensive but they hold information in a way that CNNs cannot. Probably better off distilling down to a CNN from a transformer.
@aesaerthherbo3783 Рік тому ⁺⁴
Amazing explanation, but I think you are just explaining DINO instead of DINOv2.
@aiape6954 Рік тому ⁺¹
Everything in this video applies to both. The process was optimized for DINOv2 but the structure remained the same.

Наступне

Автоматичне відтворення

DINO: Emerging Properties in Self-Supervised Vision Transformers (Facebook AI Research Explained)

DINO: Emerging Properties in Self-Supervised Vision Transformers (Facebook AI Research Explained)

Attention in transformers, visually explained | DL6

Attention in transformers, visually explained | DL6

How DINO learns to see the world - Paper Explained

How DINO learns to see the world - Paper Explained

Тайское мороженое в Калининграде

Тайское мороженое в Калининграде

How Strong Is Tape?

How Strong Is Tape?

ВОТ ПОЧЕМУ Япония живет в будущем 🤫 Утилизация масла #япония #токио #путешествия #shorts

ВОТ ПОЧЕМУ Япония живет в будущем 🤫 Утилизация масла #япония #токио #путешествия #shorts

Заява ЗАЛУЖНОГО ШОКУВАЛА увесь СВІТ😱ТРЕТЯ СВІТОВА ВІЙНА ПОЧАЛАСЬ?

Заява ЗАЛУЖНОГО ШОКУВАЛА увесь СВІТ😱ТРЕТЯ СВІТОВА ВІЙНА ПОЧАЛАСЬ?

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression?

Fine-Tuning Large Language Models (LLMs)

Fine-Tuning Large Language Models (LLMs)

DINO: Emerging Properties in Self-Supervised Vision Transformers | Paper Explained!

DINO: Emerging Properties in Self-Supervised Vision Transformers | Paper Explained!

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Vision Transformer Quick Guide - Theory and Code in (almost) 15 min

Vision Transformer Quick Guide - Theory and Code in (almost) 15 min

DinoV2 AI Feature Detection and Feature Matching from Meta AI

DinoV2 AI Feature Detection and Feature Matching from Meta AI

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

DINOv2 from Meta AI: Data pipeline, model training and results explained

DINOv2 from Meta AI: Data pipeline, model training and results explained

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

🤔Можно ли спастись от Ядерки в Холодильнике ? #shorts

🤔Можно ли спастись от Ядерки в Холодильнике ? #shorts

Удержаться на воде?? 🌊 #симбочкапимпочка #симбочка #симба

Удержаться на воде?? 🌊 #симбочкапимпочка #симбочка #симба

To Brawl AND BEYOND!

To Brawl AND BEYOND!

МІША ЛЕБІГА і АНДРІЙ ЛУЗАН в СРАЧІ #32

МІША ЛЕБІГА і АНДРІЙ ЛУЗАН в СРАЧІ #32

Рождение Немецкой Легенды - Mercedes 190E 2.3-16

Рождение Немецкой Легенды - Mercedes 190E 2.3-16

Правильный подход к детям

Правильный подход к детям

Что выбрать Вике айфон или таба лапку? SchoolBoy Runaway

Что выбрать Вике айфон или таба лапку? SchoolBoy Runaway