ACL 2024 Keynote: Can LLMs Reason & Plan?

Can LLMs Reason & Plan? (Keynote at KR 2024; Hanoi)

On the Role of LLMs in Planning (ICML 2024 Tutorial)

«Шнурки не зрізайте, акуратненько»: медик про реакцію військових на поранення #shorts

Мама загинула у блокадному Чернігові, а тато у полоні РФ #війна #люди #україна #shorts #смерть

How to treat Acne💉

Can LLMs Reason & Plan? (Talk

Subbarao Kambhampati

Переглядів 9 993

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 28 січ 2025

КОМЕНТАРІ • 9

@julkiewitz 9 місяців тому ⁺²
So basically an AlphaGo Master (?) architecture? Seems like AlphaGo Zero was kind of an appendix in the sense that it just got rid of its planner-driven System 2 in favor of a hugely overgrown System 1. Now that's good enough against humans who also cannot possibly analyse that many possibilities either and also revert to System 1 often. But maybe that's actually an inferior architecture for generalization. At least until somebody actually makes progress in NN-driven Systems 2s.
@kulkarniankur 6 місяців тому
I think you have shown that LLMs cannot reason or plan for your definition of planning. But they can compose essays, and doesn't the very act of composing an essay involve a kind of planning -- the organization of ideas, breaking them down into paragraphs, and expressing them through carefully chosen words? They seem to be doing planning, but in the domain of words and linguistically expressed ideas.
@szebike 5 місяців тому ⁺¹
What you describe can be extracted statistically, so given enough essay trainingdata you can extract where to put which words to look like a convincing essay without really thinking about creating an essay.
@BrianPeiris 8 місяців тому ⁺¹
Prof. Rao, I've had a short discussion with Liron Shapira and we were wondering if you feel strongly enough about this argument that you would make a prediction about what GPT-5 *won't* be able to do. Assuming GPT-5 will just be a bigger transformer with more training data, more parameters, and better RLHF, could you predict that it still won't be able to solve your Randomized Mystery Blocksworld problems past, say 10%?
@billykotsos4642 8 місяців тому
does solving 10% of the problems make it imrpessive?
@BrianPeiris 8 місяців тому
@@billykotsos4642 Maybe not impressive, but it would be surprising. At 20:16, Rao shows that GPT-4 can only get to 2% with a Randomized Mystery Blockworld. Humans can solve it at close to 100%. Going from 2% to 10% would at least be a bit of a signal that there's more to transformer-based LLMs expected.
@billykotsos4642 6 днів тому
@@BrianPeiris I wonder how O1 would fair here
@BrianPeiris 6 днів тому
@@billykotsos4642 Indeed, or o3 for that matter. I would also like to see updated stats on the blocksworld problems, but the ARC-AGI scores for o3 are pretty surprising. Chollet thinks that ARC-AGI-2 will bring the scores down considerably though, so it's possible that blocksworld is still a challenge.
@billykotsos4642 6 днів тому ⁺¹
@@BrianPeiris I just had a look and there is a new paper on arxiv by the author covering o1-preview. It seems that there is a significant step up compared to LLMs (the paper calls 01-like models LRMs -> ‘Language Reasoning Models’) I need to go through the paper thoroughly though… planning is obviously something that simple LLMs and even LRMs can’t do out of the box. It would be great to see also how DeepSeek models fair on these benchmarks.

Наступне

Автоматичне відтворення

ACL 2024 Keynote: Can LLMs Reason & Plan?

ACL 2024 Keynote: Can LLMs Reason & Plan?

Can LLMs Reason & Plan? (Keynote at KR 2024; Hanoi)

Can LLMs Reason & Plan? (Keynote at KR 2024; Hanoi)

On the Role of LLMs in Planning (ICML 2024 Tutorial)

On the Role of LLMs in Planning (ICML 2024 Tutorial)

«Шнурки не зрізайте, акуратненько»: медик про реакцію військових на поранення #shorts

«Шнурки не зрізайте, акуратненько»: медик про реакцію військових на поранення #shorts

Мама загинула у блокадному Чернігові, а тато у полоні РФ #війна #люди #україна #shorts #смерть

Мама загинула у блокадному Чернігові, а тато у полоні РФ #війна #люди #україна #shorts #смерть

How to treat Acne💉

How to treat Acne💉

Рабочий способ бросить вредную привычку

Рабочий способ бросить вредную привычку

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

NVIDIA CEO Jensen Huang's Vision for Your Future

NVIDIA CEO Jensen Huang's Vision for Your Future

Can LLMs Reason and Plan? (CLunch Seminar @UPENN NLP)

Can LLMs Reason and Plan? (CLunch Seminar @UPENN NLP)

Demystifying LLMs (AI Frontiers webinar at PanIIT USA Forum)

Demystifying LLMs (AI Frontiers webinar at PanIIT USA Forum)

[1hr Talk] Intro to Large Language Models

[1hr Talk] Intro to Large Language Models

Reasoning Using Large Language Models

Reasoning Using Large Language Models

Geoffrey Hinton in conversation with Fei-Fei Li - Responsible AI development

Geoffrey Hinton in conversation with Fei-Fei Li — Responsible AI development

Learning to Reason with LLMs

Learning to Reason with LLMs

DSPy End-to-End: Meetup in San Francisco

DSPy End-to-End: Meetup in San Francisco

Морпіх із Каліфорнії доєднався до лав ЗСУ #shorts

Морпіх із Каліфорнії доєднався до лав ЗСУ #shorts

СОЛДАТ КНДР: ВТЕЧА/ВІЙНА В УКРАЇНІ/10 РОКІВ ШПИГУВАВ У ПІВНІЧНІЙ КОРЕЇ/ТОРГУЮТЬ НАРКОТИКАМИ І ЗБРОЄЮ

СОЛДАТ КНДР: ВТЕЧА/ВІЙНА В УКРАЇНІ/10 РОКІВ ШПИГУВАВ У ПІВНІЧНІЙ КОРЕЇ/ТОРГУЮТЬ НАРКОТИКАМИ І ЗБРОЄЮ

Прочистка шлюзов

Прочистка шлюзов

«Шнурки не зрізайте, акуратненько»: медик про реакцію військових на поранення #shorts

«Шнурки не зрізайте, акуратненько»: медик про реакцію військових на поранення #shorts

ふわふわシフォン大作戦🩷スイーツ戦隊のキラキラミッション✨【銀座コージーコーナー】 #shorts #シフォンケーキ #クリスマスケーキ #クリスマス #ケーキ #チョコケーキ #christmas

ふわふわシフォン大作戦🩷スイーツ戦隊のキラキラミッション✨【銀座コージーコーナー】 #shorts #シフォンケーキ #クリスマスケーキ #クリスマス #ケーキ #チョコケーキ #christmas

Заява ЗАЛУЖНОГО ШОКУВАЛА увесь СВІТ😱ТРЕТЯ СВІТОВА ВІЙНА ПОЧАЛАСЬ?

Заява ЗАЛУЖНОГО ШОКУВАЛА увесь СВІТ😱ТРЕТЯ СВІТОВА ВІЙНА ПОЧАЛАСЬ?

🔥"СВОшник" РОЗНОСИТЬ шоу путіністів! Ведучий ШОКОВАНИЙ від цих СЛІВ #shorts

🔥"СВОшник" РОЗНОСИТЬ шоу путіністів! Ведучий ШОКОВАНИЙ від цих СЛІВ #shorts

The Witcher IV - Cinematic Reveal Trailer | The Game Awards 2024

The Witcher IV — Cinematic Reveal Trailer | The Game Awards 2024