NEW CriticGPT by OpenAI: RLHF + FSBS

GraphRAG or SpeculativeRAG ?

AI Pioneer Shows The Power of AI AGENTS - "The Future Is Agentic"

Губин вернулся? Творческий вечер певца

Statue of Liberty Helps Blind Man Cross Road #shorts

Оригинальный способ подобрать кольцо @stas.yornik

Improve AGENTIC AI (Princeton)

code_your_own_AI

Переглядів 3 484

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 7 вер 2024

КОМЕНТАРІ • 13

@Kriss-studios Місяць тому ⁺⁴
StoneAge, IronAge, ModernAge, AgenticAge❤
@davidwynter6856 Місяць тому
Thank you, I read a lot of papers working full time on GenAI projects, but missed the ground changing paper you presented. A comment on the economics of GenAI, it is clear to me that new models like Jamba, with linear complexity, with their equivalent performance to Transformer based LLMs, with quadratic complexity, will come to the fore. I have experience using Ray Tune, so that will be my optimizer :)
@user-vs3tt8xc6j Місяць тому
Regarding the comparison of complex agents and retry.
Did the agents provide 1 answer or a choice from the top 10?
It is incorrect to compare the top 1 with the top 10.
I would like to see a comparison of the top 1. After all, in practical tasks, I most often need one specific correct answer, not a bunch of answers among which there is a correct one.
Also, the agent explains its actions. They are divided into stages. It's easier to find errors in its reasoning. All else being equal, this can be an extremely important criterion for solving the task.
@code4AI Місяць тому
Some commercial agents can be black boxes. And it is not uncommon, that agents perform internal majority voting to present the "correct" answer to you, an answer with the highest probability score. As with the example of SWE, I can't follow several hundred of thousand tokens for a $4 run.
@user-vs3tt8xc6j Місяць тому
@@code4AI It seems that the agent-based approach does not improve the reasoning capabilities of networks, BUT:
It allows for the decomposition of reasoning into stages, the correctness of which can be verified by instrumental means (checking the validity of the logical construction, code compilation, passing tests, etc.).
It allows for an increase in the length of the correct reasoning chain, i.e., to improve the perplexity of the response in a long context. For example, to write a coherent, logically, and stylistically correct book.
And the complexity of real tasks lies precisely in their multi-stage nature. This involves a long context of reasoning and actions, the correctness of which needs to be maintained. Are agent systems evaluated by the right benchmarks?
However, I do have questions about the feasibility of agent systems. Won't they be eventually overtaken by LLMs that can maintain a very long context and independently generate requests for various actions?
Are there any fundamental reasons to consider the agent-based approach as something unique and irreplaceable in the near future?
@user-vu4or4ih8p Місяць тому
Thanks
@user-de9hv2gu9z Місяць тому
very insightful! thanks
@code4AI Місяць тому
Thank you.
@ProgressRobotics Місяць тому
Can I do optimization on langgraph agents?
@code4AI Місяць тому
You can run an optimization on almost any system ...
@christopherc168 Місяць тому
Get out of my bubble
@code4AI Місяць тому
See you.

Наступне

Автоматичне відтворення

NEW CriticGPT by OpenAI: RLHF + FSBS

NEW CriticGPT by OpenAI: RLHF + FSBS

GraphRAG or SpeculativeRAG ?

GraphRAG or SpeculativeRAG ?

AI Pioneer Shows The Power of AI AGENTS - "The Future Is Agentic"

AI Pioneer Shows The Power of AI AGENTS - "The Future Is Agentic"

Губин вернулся? Творческий вечер певца

Губин вернулся? Творческий вечер певца

Statue of Liberty Helps Blind Man Cross Road #shorts

Statue of Liberty Helps Blind Man Cross Road #shorts

Оригинальный способ подобрать кольцо @stas.yornik

Оригинальный способ подобрать кольцо @stas.yornik

大家都拉出了什么#小丑 #shorts

大家都拉出了什么#小丑 #shorts

AI Game Theory explained for Multi-Agents

AI Game Theory explained for Multi-Agents

🔴 This Agentic AI Workflow Will Take Over 🤯 Algorithm + Papers Explained

🔴 This Agentic AI Workflow Will Take Over 🤯 Algorithm + Papers Explained

"Don't Learn to Code, But Study This Instead..." says NVIDIA CEO Jensen Huang

"Don't Learn to Code, But Study This Instead..." says NVIDIA CEO Jensen Huang

NEW AGENTLESS AI Software Development

NEW AGENTLESS AI Software Development

Find the research gap with AI in ONE day: Groundbreaking new process

Find the research gap with AI in ONE day: Groundbreaking new process

Andrew Ng On AI Agentic Workflows And Their Potential For Driving AI Progress

Andrew Ng On AI Agentic Workflows And Their Potential For Driving AI Progress

No Priors Ep. 80 | With Andrej Karpathy from OpenAI and Tesla

No Priors Ep. 80 | With Andrej Karpathy from OpenAI and Tesla

NEW TextGrad by Stanford: Better than DSPy

NEW TextGrad by Stanford: Better than DSPy

AI Research - 40 New Papers this Monday

AI Research - 40 New Papers this Monday

Throwing Swords From My Blue Cybertruck

Throwing Swords From My Blue Cybertruck

У ГОРДЕЯ ПОЖАР в ОФИСЕ!

У ГОРДЕЯ ПОЖАР в ОФИСЕ!

ВОЛОДИМИР ДАНТЕС В КЛУБІ ДИЛЕТАНТІВ #40

ВОЛОДИМИР ДАНТЕС В КЛУБІ ДИЛЕТАНТІВ #40

Russian soldier catches Ukraine FPV drone with his bare hands and runs with it

Russian soldier catches Ukraine FPV drone with his bare hands and runs with it

Никогда не Спасай АДМИНА на Сервере и Вот Почему... #майнкрафт

Никогда не Спасай АДМИНА на Сервере и Вот Почему... #майнкрафт

A$AP Rocky - Tailor Swif (Official Video)

A$AP Rocky - Tailor Swif (Official Video)

skibidi toilet 77 (part 2)

skibidi toilet 77 (part 2)

大家都拉出了什么#小丑 #shorts

大家都拉出了什么#小丑 #shorts