Teaching Large Language Models to Reason with Reinforcement Learning with Alex Havrilla - 680

Language Understanding and LLMs with Christopher Manning - 686

Lecture 1: Data-Centric AI vs. Model-Centric AI

100❤️ #shorts #construction #mizumayuuki

顔面水槽をカラフルにしたらキモ過ぎたwwwww

Шалений трюк із монетками від Усика

Localizing and Editing Knowledge in LLMs with Peter Hase - 679

The TWIML AI Podcast with Sam Charrington

Переглядів 477

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 30 тра 2024
Today we're joined by Peter Hase, a fifth-year PhD student at the University of North Carolina NLP lab. We discuss "scalable oversight", and the importance of developing a deeper understanding of how large neural networks make decisions. We learn how matrices are probed by interpretability researchers, and explore the two schools of thought regarding how LLMs store knowledge. Finally, we discuss the importance of deleting sensitive information from model weights, and how "easy-to-hard generalization" could increase the risk of releasing open-source foundation models.
🔔 Subscribe to our channel for more great content just like this: ua-cam.com/users/twimlai?sub_confi...
🗣️ CONNECT WITH US!
===============================
Subscribe to the TWIML AI Podcast: twimlai.com/podcast/twimlai/
Join our Slack Community: twimlai.com/community/
Subscribe to our newsletter: twimlai.com/newsletter/
Want to get in touch? Send us a message: twimlai.com/contact/
Follow us on Twitter: / twimlai
Follow us on LinkedIn: / twimlai
📖 CHAPTERS
===============================
00:00 - Introduction
03:57 - Knowledge localization in LLMs
14:16 - Model editing methods
29:11 - Deleting information from model weights
33:17 - Scalable oversight and easy-to-hard generalization
46:29 - Shoutouts
48:00 - Different frameworks for LLM reasoning
49:45 - Conclusion
🔗 LINKS & RESOURCES
===============================
Peter Hase's personal page - peterbhase.github.io/
The Unreasonable Effectiveness of Easy Training Data for Hard Tasks - arxiv.org/abs/2401.06751
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback - arxiv.org/abs/2307.15217
A Unified Framework for Model Editing (MEMIT) - arxiv.org/abs/2403.14236
Locating and Editing Factual Associations in GPT (ROME) - arxiv.org/abs/2202.05262
📸 Camera: amzn.to/3TQ3zsg
🎙️Microphone: amzn.to/3t5zXeV
🚦Lights: amzn.to/3TQlX49
🎛️ Audio Interface: amzn.to/3TVFAIq
🎚️ Stream Deck: amzn.to/3zzm7F5
Наука та технологія

КОМЕНТАРІ • 1

@squarehead6c1 Місяць тому
Although it appears interesting to investigate the internal properties of deep neural networks,
in practice it seems very difficult to guarantee that a fact has been completely removed from
the LLM.
Conversely, it would be interesting if one could find a way to "clamp down" facts in LLMs in
such a way that the LLM always returns the same (correct) fact regardless of how the question
is formulated. It would possibly require an adapted (ANN) structure of the model.

Наступне

Автоматичне відтворення

Teaching Large Language Models to Reason with Reinforcement Learning with Alex Havrilla - 680

Teaching Large Language Models to Reason with Reinforcement Learning with Alex Havrilla - 680

Language Understanding and LLMs with Christopher Manning - 686

Language Understanding and LLMs with Christopher Manning - 686

Lecture 1: Data-Centric AI vs. Model-Centric AI

Lecture 1: Data-Centric AI vs. Model-Centric AI

100❤️ #shorts #construction #mizumayuuki

100❤️ #shorts #construction #mizumayuuki

顔面水槽をカラフルにしたらキモ過ぎたwwwww

顔面水槽をカラフルにしたらキモ過ぎたwwwww

Шалений трюк із монетками від Усика

Шалений трюк із монетками від Усика

Now He’ll Never Leave😭

Now He’ll Never Leave😭

Coercing LLMs to Do and Reveal (Almost) Anything with Jonas Geiping - 678

Coercing LLMs to Do and Reveal (Almost) Anything with Jonas Geiping - 678

Joscha at Microsoft

Joscha at Microsoft

ROME: Locating and Editing Factual Associations in GPT (Paper Explained & Author Interview)

ROME: Locating and Editing Factual Associations in GPT (Paper Explained & Author Interview)

A Decoder-only Foundation Model For Time-series Forecasting

A Decoder-only Foundation Model For Time-series Forecasting

Chronos: Learning the Language of Time Series with Abdul Fatir Ansari - 685

Chronos: Learning the Language of Time Series with Abdul Fatir Ansari - 685

AI for Power & Energy with Laurent Boinot - 683

AI for Power & Energy with Laurent Boinot - 683

Causality and (Graph) Neural Networks

Causality and (Graph) Neural Networks

Обзор игрового компьютера Макса 2в1

Обзор игрового компьютера Макса 2в1

Рукописные сообщения на iPhone 😳

Рукописные сообщения на iPhone 😳

Теперь это его телефон

Теперь это его телефон

Трагичная История Девушки 😱🔥

Трагичная История Девушки 😱🔥

What percentage of charge is on your phone now? #entertainment

What percentage of charge is on your phone now? #entertainment

Эволюция телефонов!

Эволюция телефонов!

Цветные мониторы E-Ink никому не нужны, почему?

Цветные мониторы E-Ink никому не нужны, почему?

Apple iPhone 15 Pro Max With Smallrig Professional Photography kit #shorts

Apple iPhone 15 Pro Max With Smallrig Professional Photography kit #shorts