ICNLSP 2023: Towards Joint Modeling of Dialogue Response and Speech Synthesis based on Large ...

What is generative AI and how does it work? - The Turing Lectures with Mirella Lapata

ICNLSP 2023: ADCluster: Adaptive Deep Clustering for Unsupervised Learning from Unlabeled Documents.

Пофарбувала люк і вийшла...

這種要是上擂台，幾個泰森才能打的過？ #shorts #sports #fighting

🤣 Придумали, как зарабатывать, ничего не делая! И всё получилось! | Новостничок

ICNLSP 2023: Comparing Modular and End-To-End Approaches in ASR for Well-Resourced and ...

ICNLSP Conference

Переглядів 11

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 14 жов 2024
Title of the presentation: Comparing Modular and End-To-End Approaches in ASR for Well-Resourced and Low-Resourced
Languages.
By: Aditya Parikh, Louis ten Bosch, Henk van den Heuvel, Cristian Tejedor-Garcia
,
Centre for Language and Speech Technology, Radboud University, Nijmegen, The Netherlands.
6th International Conference on Natural Language and Speech Processing.
icnlsp.org/202...
Abstract:
We present a comparative study of a state-ofthe-art traditional modular Automatic Speech
Recognition (Kaldi ASR) and an end-to-end ASR (wav2vec 2.0) for a well-resourced language (Spanish) and a low-resourced language (Irish). We created ASRs for both languages
and evaluated their performance under different update regimes. Our results show that the
end-to-end wav2vec 2.0 outperforms the modular ASR for both languages in terms of Word
Error Rate (WER) but performs worst in terms of real-time decoding. We also addressed the
issue of non-lexical words in wav2vec 2.0’s output. We found that in wav2vec 2.0 by LM
integration with shallow fusion and increasing LM weight to 0.7 and 0.8 respectively for the
Spanish and Irish provided the optimum ASR performance by reducing non-lexical words.
However, this does not eliminate all non-lexical words. Finally, our study found that Kaldi ASR
would perform best for real-time decoding for longer audio inputs compared to wav2vec 2.0
model trained on the same dataset on the minimal infrastructure, although wav2vec 2.0’s performance can be improved with a GPU acceleration in backend. These results may have
significant implications for creating real-time ASR services, especially for low-resourced languages.

КОМЕНТАРІ •

Наступне

Автоматичне відтворення

ICNLSP 2023: Towards Joint Modeling of Dialogue Response and Speech Synthesis based on Large ...

ICNLSP 2023: Towards Joint Modeling of Dialogue Response and Speech Synthesis based on Large ...

What is generative AI and how does it work? - The Turing Lectures with Mirella Lapata

What is generative AI and how does it work? – The Turing Lectures with Mirella Lapata

ICNLSP 2023: ADCluster: Adaptive Deep Clustering for Unsupervised Learning from Unlabeled Documents.

ICNLSP 2023: ADCluster: Adaptive Deep Clustering for Unsupervised Learning from Unlabeled Documents.

Пофарбувала люк і вийшла...

Пофарбувала люк і вийшла...

這種要是上擂台，幾個泰森才能打的過？ #shorts #sports #fighting

這種要是上擂台，幾個泰森才能打的過？ #shorts #sports #fighting

🤣 Придумали, как зарабатывать, ничего не делая! И всё получилось! | Новостничок

🤣 Придумали, как зарабатывать, ничего не делая! И всё получилось! | Новостничок

Я сделала самое маленькое в мире мороженое!

Я сделала самое маленькое в мире мороженое!

Think Faster, Talk Smarter with Matt Abrahams

Think Faster, Talk Smarter with Matt Abrahams

The Impossible Equation at the Heart of Astronomy [Kepler’s Equation]

The Impossible Equation at the Heart of Astronomy [Kepler’s Equation]

Feedback from SModelS - Sabine Kraml

Feedback from SModelS - Sabine Kraml

ICNLSP 2023: Transformer-Based Analysis of Sentiment Towards German Political Parties on Twitter ...

ICNLSP 2023: Transformer-Based Analysis of Sentiment Towards German Political Parties on Twitter ...

How to win a argument

How to win a argument

Small Modular Reactors Explained - Nuclear Power's Future?

Small Modular Reactors Explained - Nuclear Power's Future?

ICNLSP 2023: Methods for Phonetic Scraping of Youtube Videos

ICNLSP 2023: Methods for Phonetic Scraping of Youtube Videos

Microsoft & Quantinuum Just Changed Quantum Computing Forever: Meet the Logical Qubit

Microsoft & Quantinuum Just Changed Quantum Computing Forever: Meet the Logical Qubit

If Your Tech Job is Comfortable, You're in Danger

If Your Tech Job is Comfortable, You're in Danger

Обменялись песнями с POLI

Обменялись песнями с POLI

Затулин: Цели СВО ПРОВАЛЕНЫ. Украина под руководством ЗЕЛЕНСКОГО останется существовать!

Затулин: Цели СВО ПРОВАЛЕНЫ. Украина под руководством ЗЕЛЕНСКОГО останется существовать!

Україна - Франція: ОГЛЯД МАТЧУ / футзал, Чемпіонат світу-2024, МАТЧ ЗА БРОНЗУ

Україна — Франція: ОГЛЯД МАТЧУ / футзал, Чемпіонат світу-2024, МАТЧ ЗА БРОНЗУ

Indoor plant care hacks for plant lovers 🌿 #shorts #planting #garden #diy #indoor

Indoor plant care hacks for plant lovers 🌿 #shorts #planting #garden #diy #indoor

Странная суперспособность мух и жуткое насекомое из джунглей

Странная суперспособность мух и жуткое насекомое из джунглей

🔥 ПРЕМЬЕРА 2024! 🔥 Роман с секретом (2024). 1 серия. Детектив, мелодрама, сериал.

🔥 ПРЕМЬЕРА 2024! 🔥 Роман с секретом (2024). 1 серия. Детектив, мелодрама, сериал.