3DGV Seminar: Gül Varol - 3D Humans in Action

Generalized and Incremental Few-Shot Learning by Explicit Learning & Calibration without Forgetting

Memory-Economic Continual Test-TimeAdaptation| Junyuan Hong, PhD @MichiganStateUni, Intern@SonyAI

Ветеран Максим Чаньківський згадує, як отримав поранення #суспільневінниця

НОВЫЙ AMONG US в РЕАЛЬНОЙ ЖИЗНИ! Utopia Show VS Масленников

ДАЖЕ победителю СТАЛО СТРАШНО от того, ЧТО он СДЕЛАЛ с проигравшим #shorts

What Can We Learn From Subtitled Sign Language Data? Gül Varol, Asst. Prof@École des Ponts ParisTech

Computer Vision Talks

Переглядів 544

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 3 чер 2024
Gül Varol is a research faculty at the IMAGINE team of École des Ponts ParisTech. Previously, she was a postdoctoral researcher at the University of Oxford (VGG). She obtained her PhD from the WILLOW team of Inria Paris and École Normale Supérieure. Her research is focused on human understanding in videos, specifically action recognition, body shape and motion analysis, and sign languages.
Abstract
In this talk, Prof Gul will present our works on automatic sign language analysis by leveraging weakly-aligned subtitles for broadcast footage.
In her own words,
We first use subtitles to provide us candidate keywords to search and localize individual signs with two different approaches: (i) using mouthing cues at the lip region and (ii) looking up videos from sign language dictionaries. Then, we attempt to train a direct video-to-text sign language translation Transformer model with this unconstrained data. We observe that while the translation performance is low, a sign localisation ability emerges from the attention mechanism (iii). These three approaches allow us to automatically annotate 1 million video-sign pairs, which we use to train strong sign recognition models for a vocabulary of over 1,000 signs. However, the subtitles remain noisy, especially their alignments with the interpreted signing video when they are obtained through speech. Therefore, recently we tackle the problem of automatic subtitle alignment to temporally localise a sequence of text within a long continuous sign language video (iv). I will summarise results from the papers listed below and conclude by discussing open problems.
(i) lnkd.in/gBPySa8a
(ii) lnkd.in/g-i2zT3N
(iii) lnkd.in/gFNqrBnk
(iv) lnkd.in/gF37qwjF
Розваги

КОМЕНТАРІ •

Наступне

Автоматичне відтворення

3DGV Seminar: Gül Varol - 3D Humans in Action

3DGV Seminar: Gül Varol - 3D Humans in Action

Generalized and Incremental Few-Shot Learning by Explicit Learning & Calibration without Forgetting

Generalized and Incremental Few-Shot Learning by Explicit Learning & Calibration without Forgetting

Memory-Economic Continual Test-TimeAdaptation| Junyuan Hong, PhD @MichiganStateUni, Intern@SonyAI

Memory-Economic Continual Test-TimeAdaptation| Junyuan Hong, PhD @MichiganStateUni, Intern@SonyAI

Ветеран Максим Чаньківський згадує, як отримав поранення #суспільневінниця

Ветеран Максим Чаньківський згадує, як отримав поранення #суспільневінниця

НОВЫЙ AMONG US в РЕАЛЬНОЙ ЖИЗНИ! Utopia Show VS Масленников

НОВЫЙ AMONG US в РЕАЛЬНОЙ ЖИЗНИ! Utopia Show VS Масленников

ДАЖЕ победителю СТАЛО СТРАШНО от того, ЧТО он СДЕЛАЛ с проигравшим #shorts

ДАЖЕ победителю СТАЛО СТРАШНО от того, ЧТО он СДЕЛАЛ с проигравшим #shorts

Passat CC на 300 л.с. Начало проекта!

Passat CC на 300 л.с. Начало проекта!

Discriminative Region-based Multi-Label Zero-Shot Learning [ICCV 2021] Akshita Gupta @IIAI

Discriminative Region-based Multi-Label Zero-Shot Learning [ICCV 2021] Akshita Gupta @IIAI

PAWS : Semi-Supervised Learning of Visual Features

PAWS : Semi-Supervised Learning of Visual Features

Speaking Skills | This is why people don't understand your English

Speaking Skills | This is why people don't understand your English

Master of All : Simultaneous Generalization of Urban-Scene Segmentation | ECCV 2022

Master of All : Simultaneous Generalization of Urban-Scene Segmentation | ECCV 2022

‘Everything is Going to Be Robotic’ Nvidia Promises, as AI Gets More Real

‘Everything is Going to Be Robotic’ Nvidia Promises, as AI Gets More Real

My Story | Learn English Through Story | Improve English While Sleeping | English Story

My Story | Learn English Through Story | Improve English While Sleeping | English Story

ViTGAN : Training GANs with Vision Transformers | Paper Discussion with the Author

ViTGAN : Training GANs with Vision Transformers | Paper Discussion with the Author

Learn Russian in 4 Hours - ALL the Russian Basics You Need

Learn Russian in 4 Hours - ALL the Russian Basics You Need

Russian language lesson 5 - Russian verbs conjugation

Russian language lesson 5 – Russian verbs conjugation

пов: ти приїхав в село @DniproM-channel

пов: ти приїхав в село @DniproM-channel

тут свіча - це як символ поганих часів, які здолає найсильніший

тут свіча - це як символ поганих часів, які здолає найсильніший

Дурнєв дивиться сторіс ZОМБІ #48

Дурнєв дивиться сторіс ZОМБІ #48

Его зовут Персик

Его зовут Персик

Екатерина Великая выбрала невесту своему сыну 🤯Сериал:Великая. #великая #врек #хочуврек #хочувтренды

Екатерина Великая выбрала невесту своему сыну 🤯Сериал:Великая. #великая #врек #хочуврек #хочувтренды

Дурнєв дивиться сторіс ZОМБІ #48

Дурнєв дивиться сторіс ZОМБІ #48