Barbara Plank: Revisiting Trustworthiness in NLP - Two Views on Uncertainty

Theodor Herzl-Dozentur mit Martin Thür: Auf Abstand - Journalismus zwischen Nähe und Distanz

Claude Computer Use + Bolt.new - The ULTIMATE AI Coding Combo?!

这个同学真的太捣蛋了……#小丑#家庭

ПОДРАЛСЯ С БРАТОМ (Смешное видео, юмор, приколы, поржать )

"Він залишив свій слід в Україні та світі": у Вінниці попрощалися з В'ячеславом Узелковим

Sepp Hochreiter: Memory Architectures for Deep Learning

Uni Vienna live

Переглядів 1 673

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 24 лис 2024
Currently, the most successful Deep Learning architecture is the transformer. The attention mechanism of the transformer is equivalent to modern Hopfield networks, therefore is an associative memory. However, this associative memory has disadvantages like its quadratic complexity with the sequence length when mutually associating sequences elements, its restriction to pairwise associations, its limitations in modifying the memory, its insufficient abstraction capabilities. In contrast, recurrent neural networks (RNNs) like LSTMs have linear complexity, associate sequence elements with a representation of all previous elements, can directly modify memory content, and have high abstraction capabilities. However, RNNs cannot store sequence elements that were rare in the training data, since RNNs have to learn to store. Transformer can store rare or even new sequence elements, which is one of the main reasons besides their high parallelization why they outperformed RNNs in language modelling. Future successful Deep Learning architectures should comprise both of these memories: attention for implementing episodic memories and RNNs for implementing short-term memories and abstraction.
👉 More information about the lecture series "Machines that understand?": dm.cs.univie.a...
👉 Research Group Data Mining and Machine Learning at the University of Vienna: dm.cs.univie.a...
👉 Playlist Machines that understand? • Was bedeutet Generativ...

КОМЕНТАРІ •

Наступне

Автоматичне відтворення

Barbara Plank: Revisiting Trustworthiness in NLP - Two Views on Uncertainty

Barbara Plank: Revisiting Trustworthiness in NLP - Two Views on Uncertainty

Theodor Herzl-Dozentur mit Martin Thür: Auf Abstand - Journalismus zwischen Nähe und Distanz

Theodor Herzl-Dozentur mit Martin Thür: Auf Abstand - Journalismus zwischen Nähe und Distanz

Claude Computer Use + Bolt.new - The ULTIMATE AI Coding Combo?!

Claude Computer Use + Bolt.new - The ULTIMATE AI Coding Combo?!

这个同学真的太捣蛋了……#小丑#家庭

这个同学真的太捣蛋了……#小丑#家庭

ПОДРАЛСЯ С БРАТОМ (Смешное видео, юмор, приколы, поржать )

ПОДРАЛСЯ С БРАТОМ (Смешное видео, юмор, приколы, поржать )

"Він залишив свій слід в Україні та світі": у Вінниці попрощалися з В'ячеславом Узелковим

"Він залишив свій слід в Україні та світі": у Вінниці попрощалися з В'ячеславом Узелковим

Час РАСПЛАТЫ от МАЙКА ТАЙСОНА

Час РАСПЛАТЫ от МАЙКА ТАЙСОНА

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

ICML 2021 | Modern Hopfield Networks - Dr Sepp Hochreiter

ICML 2021 | Modern Hopfield Networks - Dr Sepp Hochreiter

Attention in transformers, visually explained | DL6

Attention in transformers, visually explained | DL6

Thomas Hartung: “Organoid Intelligence: New Frontier in Biocomputing and Intelligence-in-a-dish”

Thomas Hartung: “Organoid Intelligence: New Frontier in Biocomputing and Intelligence-in-a-dish”

Yoshua Bengio: Obtaining Safety Guarantees to avoid AI Catastrophic Risks

Yoshua Bengio: Obtaining Safety Guarantees to avoid AI Catastrophic Risks

25 Jahre LSTM - mit Prof. Dr. Jürgen Schmidhuber und Prof. Dr. Sepp Hochreiter

25 Jahre LSTM - mit Prof. Dr. Jürgen Schmidhuber und Prof. Dr. Sepp Hochreiter

Steht in der KI der nächste Durchbruch bevor, Sepp Hochreiter? - FAZ D:ECONOMY

Steht in der KI der nächste Durchbruch bevor, Sepp Hochreiter? - FAZ D:ECONOMY

True Artificial Intelligence will change everything | Juergen Schmidhuber | TEDxLakeComo

True Artificial Intelligence will change everything | Juergen Schmidhuber | TEDxLakeComo

Andrew Ng Explores The Rise Of AI Agents And Agentic Reasoning | BUILD 2024 Keynote

Andrew Ng Explores The Rise Of AI Agents And Agentic Reasoning | BUILD 2024 Keynote

Players push long pins through a cardboard box attempting to pop the balloon!

Players push long pins through a cardboard box attempting to pop the balloon!

Когда муж не доверяет жене @Oscar_elteacher

Когда муж не доверяет жене @Oscar_elteacher

Кто круче, как думаешь?

Кто круче, как думаешь?

Холостяк 13 - Випуск 1 від 01.11.2024 | ПРЕМ’ЄРА

Холостяк 13 – Випуск 1 від 01.11.2024 | ПРЕМ’ЄРА

Молодой паренёк шокировал всех!

Молодой паренёк шокировал всех!

⚡️ МАЙК ТАЙСОН ОФІЦІЙНО ПОВЕРНУВСЯ! Огляд бою Джейк Пол - Майк Тайсон

⚡️ МАЙК ТАЙСОН ОФІЦІЙНО ПОВЕРНУВСЯ! Огляд бою Джейк Пол - Майк Тайсон

Его считали НЕПОБЕДИМЫМ, но Али доказал, что нет НИЧЕГО НЕВОЗМОЖНОГО #shorts

Его считали НЕПОБЕДИМЫМ, но Али доказал, что нет НИЧЕГО НЕВОЗМОЖНОГО #shorts

Лишилося кілька днів? Коли буде ракетна атака РФ

Лишилося кілька днів? Коли буде ракетна атака РФ