[CVPR 2024] Brain Decodes Deep Nets

[CVPR'24 Highlight] Gaussian Splatting SLAM

[ICASSP XAI-SA 2024] Why does music source separation benefit from cacophony?

Запорізьку АЕС повернуть Україні / Режим припинення вогню

Кто Последний Уснёт - Получит 250.000 Рублей! (Хазяева, Сатир, Кокошка, Дилблин) Часть 1

ТЫ С ДРУГОМ В ДЕТСТВЕ ИГРАЕШЬ В ПРЯТКИ😂#shorts

[CVPR 2024] Long-Tailed Anomaly Detection with Learnable Class Names

Mitsubishi Electric Research Labs (MERL)

Переглядів 356

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 6 тра 2024
MERL Intern Chih-Hui Ho and MERL Researcher Kuan-Chuan Peng present their paper titled "Long-Tailed Anomaly Detection with Learnable Class Names" for the IEEE Computer Vision and Pattern Recognition (CVPR) conference, held in Seattle, WA on June 17-21, 2024. The paper was co-authored with Prof. Nuno Vasconcelos.
Paper: www.merl.com/publications/TR2...
Abstract: Anomaly detection (AD) aims to identify defective images and localize their defects (if any). Ideally, AD models should be able to detect defects over many image classes; without relying on hard-coded class names that can be uninformative or inconsistent across datasets; learn without anomaly supervision; and be robust to the long-tailed distributions of real-world applications. To address these challenges, we formulate the problem of long-tailed AD by introducing several datasets with different levels of class imbalance and metrics for performance evaluation. We then propose a novel method, LTAD, to detect defects from multiple and long-tailed classes, without relying on dataset class names. LTAD combines AD by reconstruction and semantic AD modules. AD by reconstruction is implemented with a transformer-based reconstruction module. Semantic AD is implemented with a binary classifier, which relies on learned pseudo class names and a pretrained foundation model. These modules are learned over two phases. Phase 1 learns the pseudo-class names and a variational autoencoder (VAE) for feature synthesis that augments the training data to combat long-tails. Phase 2 then learns the parameters of the reconstruction and classification modules of LTAD. Extensive experiments using the proposed long-tailed datasets show that LTAD substantially outperforms the state-of-the-art methods for most forms of dataset imbalance. The long-tailed dataset split is available at zenodo.org/records/10854201.
Наука та технологія

КОМЕНТАРІ •

Наступне

Автоматичне відтворення

[CVPR 2024] Brain Decodes Deep Nets

[CVPR 2024] Brain Decodes Deep Nets

[CVPR'24 Highlight] Gaussian Splatting SLAM

[CVPR'24 Highlight] Gaussian Splatting SLAM

[ICASSP XAI-SA 2024] Why does music source separation benefit from cacophony?

[ICASSP XAI-SA 2024] Why does music source separation benefit from cacophony?

Запорізьку АЕС повернуть Україні / Режим припинення вогню

Запорізьку АЕС повернуть Україні / Режим припинення вогню

Кто Последний Уснёт - Получит 250.000 Рублей! (Хазяева, Сатир, Кокошка, Дилблин) Часть 1

Кто Последний Уснёт - Получит 250.000 Рублей! (Хазяева, Сатир, Кокошка, Дилблин) Часть 1

ТЫ С ДРУГОМ В ДЕТСТВЕ ИГРАЕШЬ В ПРЯТКИ😂#shorts

ТЫ С ДРУГОМ В ДЕТСТВЕ ИГРАЕШЬ В ПРЯТКИ😂#shorts

Самое Романтичное Видео ❤️

Самое Романтичное Видео ❤️

Autoencoders | Deep Learning Animated

Autoencoders | Deep Learning Animated

[CVPR 2024] Action Detection via an Image Diffusion Process

[CVPR 2024] Action Detection via an Image Diffusion Process

Anomaly detection in time series with Python | Data Science with Marco

Anomaly detection in time series with Python | Data Science with Marco

Why Computer Vision Is a Hard Problem for AI

Why Computer Vision Is a Hard Problem for AI

Should You Use Open Source Large Language Models?

Should You Use Open Source Large Language Models?

Introduction to Anomaly Detection for Engineers

Introduction to Anomaly Detection for Engineers

A.I. Experiments: Visualizing High-Dimensional Space

A.I. Experiments: Visualizing High-Dimensional Space

[CVPR 2024] Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware Spatio-Temporal Sa...

[CVPR 2024] Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware Spatio-Temporal Sa...

ИГРОВОЙ ПК ЗА 10К КОТОРЫЙ ДЕЙСТВИТЕЛЬНО ТАЩИТ В 2024 ГОДУ / СБОРКА ПК ЗА 10000 РУБЛЕЙ by KOMPUKTER

ИГРОВОЙ ПК ЗА 10К КОТОРЫЙ ДЕЙСТВИТЕЛЬНО ТАЩИТ В 2024 ГОДУ / СБОРКА ПК ЗА 10000 РУБЛЕЙ by KOMPUKTER

100+ Linux Things you Need to Know

100+ Linux Things you Need to Know

⚡Контактная сварка медной ленты

⚡Контактная сварка медной ленты

Samsung Z Flip/Fold 6, Watch Ultra, Buds Pro and Ring Impressions!

Samsung Z Flip/Fold 6, Watch Ultra, Buds Pro and Ring Impressions!

ИГРОВОЙ ПК ЗА 10К КОТОРЫЙ ДЕЙСТВИТЕЛЬНО ТАЩИТ В 2024 ГОДУ / СБОРКА ПК ЗА 10000 РУБЛЕЙ by KOMPUKTER

ИГРОВОЙ ПК ЗА 10К КОТОРЫЙ ДЕЙСТВИТЕЛЬНО ТАЩИТ В 2024 ГОДУ / СБОРКА ПК ЗА 10000 РУБЛЕЙ by KOMPUKTER

Мой инст: denkiselef. Как забрать телефон через экран.

Мой инст: denkiselef. Как забрать телефон через экран.

Здесь упор в процессор

Здесь упор в процессор

Забудьте о RX 580 | Тест Nvidia P102, P106 и GTX 1650 Super

Забудьте о RX 580 | Тест Nvidia P102, P106 и GTX 1650 Super