NEW "Harmonized" Chain of Thought (CoT) Complexity

ADD LLM TO Knowledge-Graph: NEW GIVE Method (Berkeley)

AI AGENTS w/ new Q-Learning: Q-SFT (UC Berkeley)

БАСКЕТБОЛИСТЫ ИГРАЮТ В НАСТОЛЬНЫЙ ТЕННИС #иванабрамов #дедищев #баскетбол #пингпонг #shorts

НОВЫЙ AMONG US в РЕАЛЬНОЙ ЖИЗНИ - Масленников, Егорик, Милана Хаметова, Супер Стас

Из какого города смотришь? 😃

Why Chain-of-Thought Isn't Enough & Google's SCoRe Method Explained

Discover AI

Переглядів 4 874

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 11 лис 2024

КОМЕНТАРІ • 10

@matten_zero Місяць тому ⁺⁴
This is awesome. Imagine a musician that understands music and can really do with this tool
@TheExodusLost Місяць тому
What kind of stuff are you imagining? Musician here
@kevon217 Місяць тому
I’ve been thoroughly impressed with o1 and the quality/insights of its responses.
@cancelebi8939 Місяць тому ⁺¹
We have a study where we investigate classification performance of text for different levels of reasoning. We do find a positive effect of 0-shot-cot in these cases but this is more apparent when 0-shot-cot is accompanied by n-shot examples. Level k reasoning is more mechanistic due to its game theoretical nature than say sentiment analysis. This may be why cot is observed to improve in our case
@rohankumarverma4309 Місяць тому ⁺²
Clear my confusion, I am a newbie in this field:
Goal is to maximize reward. And for self correction we are providing bonus,
Using this approach, aren't we encouraging the model to do more mistakes in first attempt and then do self correction in 2nd atrempt to get max reward?
@swamikannan943 Місяць тому
Great video ! Could you share the link for the chatgpt chat extract that you demonstrated? The share link on the top right as seen in the video? It's a little difficult to follow towards the end
@ruslan.vasylev Місяць тому ⁺¹
Strawberry is so exhaustive that people will be afraid to ask it questions lol. :)
@pensiveintrovert4318 Місяць тому
Have they actually achieved any positive results? The most successful RL models usually limit how much a policy should change to avoid instabilities. One can say anything, if no results are there to prove it.
@diga4696 Місяць тому ⁺¹
I love the part where you say "I am sorry, but I am going to ask you something personal, strawberry".
Hope your account is still active 😅
@code4AI Місяць тому
I feel watched .....

Наступне

Автоматичне відтворення

NEW "Harmonized" Chain of Thought (CoT) Complexity

NEW "Harmonized" Chain of Thought (CoT) Complexity

ADD LLM TO Knowledge-Graph: NEW GIVE Method (Berkeley)

ADD LLM TO Knowledge-Graph: NEW GIVE Method (Berkeley)

AI AGENTS w/ new Q-Learning: Q-SFT (UC Berkeley)

AI AGENTS w/ new Q-Learning: Q-SFT (UC Berkeley)

БАСКЕТБОЛИСТЫ ИГРАЮТ В НАСТОЛЬНЫЙ ТЕННИС #иванабрамов #дедищев #баскетбол #пингпонг #shorts

БАСКЕТБОЛИСТЫ ИГРАЮТ В НАСТОЛЬНЫЙ ТЕННИС #иванабрамов #дедищев #баскетбол #пингпонг #shorts

НОВЫЙ AMONG US в РЕАЛЬНОЙ ЖИЗНИ - Масленников, Егорик, Милана Хаметова, Супер Стас

НОВЫЙ AMONG US в РЕАЛЬНОЙ ЖИЗНИ - Масленников, Егорик, Милана Хаметова, Супер Стас

Из какого города смотришь? 😃

Из какого города смотришь? 😃

😮 Прикол с динозавром пошёл не по плану! | Новостничок

😮 Прикол с динозавром пошёл не по плану! | Новостничок

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression?

The moment we stopped understanding AI [AlexNet]

The moment we stopped understanding AI [AlexNet]

Free CodeLLM: New Tech for AI Coding

Free CodeLLM: New Tech for AI Coding

LIQUID AI 40B (MIT): REAL Performance on Reasoning (My 5 Tests)

LIQUID AI 40B (MIT): REAL Performance on Reasoning (My 5 Tests)

Chain-of-thought explained | Aravind Srinivas and Lex Fridman

Chain-of-thought explained | Aravind Srinivas and Lex Fridman

AI can't cross this line and we don't know why.

AI can't cross this line and we don't know why.

The Paradox of Being a Good Person - George Orwell's Warning to the World

The Paradox of Being a Good Person - George Orwell's Warning to the World

Anthropic's new improved RAG: Explained (for all LLM)

Anthropic's new improved RAG: Explained (for all LLM)

The Genius Behind the Quantum Navigation Breakthrough

The Genius Behind the Quantum Navigation Breakthrough

Полицейский Гнев Головоломка 2 Ищет Шин Тейпс Крафти Корн

Полицейский Гнев Головоломка 2 Ищет Шин Тейпс Крафти Корн

👀Пропозиція від військового #війна #мобілізація #зсу #тцк #повістки

👀Пропозиція від військового #війна #мобілізація #зсу #тцк #повістки

ПРЕМ'ЄРА! Неймовірний серіал! РЕВАНШ. 17 серія

ПРЕМ'ЄРА! Неймовірний серіал! РЕВАНШ. 17 серія

ПРЕМ'ЄРА! Неймовірний серіал! РЕВАНШ. 18 серія

ПРЕМ'ЄРА! Неймовірний серіал! РЕВАНШ. 18 серія

Пробую гриб за 880 000 рублей за кг

Пробую гриб за 880 000 рублей за кг

Речь Дональда Трампа по итогам выборов: «беспрецедентный и мощный мандат», «золотой век Америки»

Речь Дональда Трампа по итогам выборов: «беспрецедентный и мощный мандат», «золотой век Америки»

МЕНЯ УКУСИЛ ПАУК #shorts

МЕНЯ УКУСИЛ ПАУК #shorts

🔴ЗСУ ЖОРСТОКО ПОМСТИЛИСЬ! СПЕЦНАЗ РФ - РОЗНЕСЛИ В ХЛАМ! КОРЕЙЦІВ ВЖЕ ПАКУЮТЬ У ЧОРНІ ПАКЕТИ!

🔴ЗСУ ЖОРСТОКО ПОМСТИЛИСЬ! СПЕЦНАЗ РФ – РОЗНЕСЛИ В ХЛАМ! КОРЕЙЦІВ ВЖЕ ПАКУЮТЬ У ЧОРНІ ПАКЕТИ!