Google Research on End-to-End Models for Speech Recognition -English version-

Transformer-Transducers for Code-Switched Speech Recognition @ ICASSP2021

State-of-the-Art in Speech Technologies

повтори звуки животного 😱

Как мы играем в игры 😂

🤯 ФАНТАСТИЧНИЙ НОКАУТ! ОГЛЯД БОЮ ДЖОШУА - ДЮБУА

[ICASSP 2020] Streaming Automatic Speech Recognition with the Transformer Model

Mitsubishi Electric Research Labs (MERL)

Переглядів 5 776

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 26 кві 2020
MERL Researcher Niko Moritz presents his paper titled "Streaming Automatic Speech Recognition with the Transformer Model" for the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), held virtually May 4-8 2020. The paper was co-authored with MERL researchers Takaaki Hori and Jonathan Le Roux.
Paper: ieeexplore.iee..., www.merl.com/p...
Abstract: Encoder-decoder based sequence-to-sequence models have demonstrated state-of-the-art results in end-to-end automatic speech recognition (ASR). Recently, the transformer architecture, which uses self-attention to model temporal context information, has been shown to achieve significantly lower word error rates (WERs) compared to recurrent neural network (RNN) based system architectures. Despite its success, the practical usage is limited to offline ASR tasks, since encoder-decoder architectures typically require an entire speech utterance as input. In this work, we propose a transformer based end-to-end ASR system for streaming ASR, where an output must be generated shortly after each spoken word. To achieve this, we apply time-restricted self-attention for the encoder and triggered attention for the encoder-decoder attention mechanism. Our proposed streaming transformer architecture achieves 2.8% and 7.2% WER for the "clean" and "other" test data of LibriSpeech, which to our knowledge is the best published streaming end-to-end ASR result for this task.
Наука та технологія

КОМЕНТАРІ •

Наступне

Автоматичне відтворення

Google Research on End-to-End Models for Speech Recognition -English version-

Google Research on End-to-End Models for Speech Recognition -English version-

Transformer-Transducers for Code-Switched Speech Recognition @ ICASSP2021

Transformer-Transducers for Code-Switched Speech Recognition @ ICASSP2021

State-of-the-Art in Speech Technologies

State-of-the-Art in Speech Technologies

повтори звуки животного 😱

повтори звуки животного 😱

Как мы играем в игры 😂

Как мы играем в игры 😂

🤯 ФАНТАСТИЧНИЙ НОКАУТ! ОГЛЯД БОЮ ДЖОШУА - ДЮБУА

🤯 ФАНТАСТИЧНИЙ НОКАУТ! ОГЛЯД БОЮ ДЖОШУА - ДЮБУА

ПОЛНОЕ видео на канале. Нажми СРАЖАЮСЬ с ЗЛЫМИ РОДИТЕЛЯМИ в schoolboy ranaway

ПОЛНОЕ видео на канале. Нажми СРАЖАЮСЬ с ЗЛЫМИ РОДИТЕЛЯМИ в schoolboy ranaway

A Basic Introduction to Speech Recognition (Hidden Markov Model & Neural Networks)

A Basic Introduction to Speech Recognition (Hidden Markov Model & Neural Networks)

Speech Transformer | Automatic Speech Recognition (ASR)

Speech Transformer | Automatic Speech Recognition (ASR)

Illustrated Guide to Transformers Neural Network: A step by step explanation

Illustrated Guide to Transformers Neural Network: A step by step explanation

Deep Learning for Speech Recognition (Adam Coates, Baidu)

Deep Learning for Speech Recognition (Adam Coates, Baidu)

How Voice Recognition Works

How Voice Recognition Works

I have released a Music Transformer - How to use it

I have released a Music Transformer - How to use it

I used to hate QR codes. But they're actually genius

I used to hate QR codes. But they're actually genius

[MERL Seminar Series Fall 2024] Tools from cognitive science to understand the behavior of large ...

[MERL Seminar Series Fall 2024] Tools from cognitive science to understand the behavior of large ...

The Deep Learning Revolution in Automatic Speech Recognition by Dr Ananth Sankar at #ODSC_India

The Deep Learning Revolution in Automatic Speech Recognition by Dr Ananth Sankar at #ODSC_India

Московский сервис веников не вяжет. Игровой ноутбук Intel® NUC KC57 и что ждать от Китайских...

Московский сервис веников не вяжет. Игровой ноутбук Intel® NUC KC57 и что ждать от Китайских...

#major #airdrop #telegram #web3 #listing #crypto

#major #airdrop #telegram #web3 #listing #crypto

Creality Raptor 3D scanning mixed with 3d sculpting in Nomad On IPad. 3d printing right now

Creality Raptor 3D scanning mixed with 3d sculpting in Nomad On IPad. 3d printing right now

iPhone or Samsung?

iPhone or Samsung?

Как одна маленькая батарейка вывела из строя весь мой ПК?

Как одна маленькая батарейка вывела из строя весь мой ПК?

iPad Turned Into Car Touchscreen (Via: @biggibril)

iPad Turned Into Car Touchscreen (Via: @biggibril)

iPhone 16 разбирается через задн… 🪛

iPhone 16 разбирается через задн… 🪛

Дешевый планшет на Windows | Продавец снова ОБМАНУЛ | Лучшая ОС для старого железа

Дешевый планшет на Windows | Продавец снова ОБМАНУЛ | Лучшая ОС для старого железа