Deep dive into Mixture of Experts (MOE) with the Mixtral 8x7B paper

Timo Schick | Toolformer: Language Models Can Teach Themselves to Use Tools

Unreasonably Effective AI with Demis Hassabis

Cool Parenting Gadget Against Mosquitos! 🦟👶 #parentinghacks #funny #DIY

КОТЯТА НАУЧИЛИСЬ ГОВОРИТЬ#cat

这娘俩太坏了！合起伙来欺负爸爸 #funny #萌娃 #搞笑#cutebaby

Deep Dive Into The Toolformer

Oxen

Переглядів 755

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 4 жов 2024

КОМЕНТАРІ • 2

@KennethFeur 7 місяців тому ⁺¹
The big downside of this approach is that to you have to finetune llm each time you want to add a new tool, and finetuning is complicated. It's much easier to use special languages like sglang or guidance.
And if you are restricted to use small llm, you always may finetune it the regular way and use with sglang. Actually, it would be interesting to see which model would win: Toolformer or regular finetuned transformer + sglang
@oxen-ai 7 місяців тому ⁺¹
Totally agree! I think it's a good framework for thinking about how an LLM could learn to use tools, but to be practical in reality you need to allow it to pick arbitrary tools from a codebase or toolchain without fine-tuning each time.

Наступне

Автоматичне відтворення

Deep dive into Mixture of Experts (MOE) with the Mixtral 8x7B paper

Deep dive into Mixture of Experts (MOE) with the Mixtral 8x7B paper

Timo Schick | Toolformer: Language Models Can Teach Themselves to Use Tools

Timo Schick | Toolformer: Language Models Can Teach Themselves to Use Tools

Unreasonably Effective AI with Demis Hassabis

Unreasonably Effective AI with Demis Hassabis

Cool Parenting Gadget Against Mosquitos! 🦟👶 #parentinghacks #funny #DIY

Cool Parenting Gadget Against Mosquitos! 🦟👶 #parentinghacks #funny #DIY

КОТЯТА НАУЧИЛИСЬ ГОВОРИТЬ#cat

КОТЯТА НАУЧИЛИСЬ ГОВОРИТЬ#cat

这娘俩太坏了！合起伙来欺负爸爸 #funny #萌娃 #搞笑#cutebaby

这娘俩太坏了！合起伙来欺负爸爸 #funny #萌娃 #搞笑#cutebaby

Папа из-за ТАКОГО снова за хлебом ушёл😁А у тебя есть папа?🤔@KOTFIN

Папа из-за ТАКОГО снова за хлебом ушёл😁А у тебя есть папа?🤔@KOTFIN

LangGraph Deep Dive: Build Better Agents

LangGraph Deep Dive: Build Better Agents

How ReFT Works w/ Author Zhengxuan Wu

How ReFT Works w/ Author Zhengxuan Wu

Why Vertical LLM Agents Are The New $1 Billion SaaS Opportunities

Why Vertical LLM Agents Are The New $1 Billion SaaS Opportunities

How to Build LLMs on Your Company’s Data While on a Budget

How to Build LLMs on Your Company’s Data While on a Budget

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

What are AI Agents?

What are AI Agents?

Supercharging Developer Productivity with ChatGPT and Claude with Simon Willison - 701

Supercharging Developer Productivity with ChatGPT and Claude with Simon Willison - 701

How Diffusion Works for Text

How Diffusion Works for Text

Toolformer - Overview

Toolformer - Overview

От первого лица: Школа 7 😡 УБЕЖАЛ из ДОМА 😱 БРОСИЛ ДЕВУШКУ ИЗ-ЗА ДЕНЕГ 😰 СТЫД ГЛАЗАМИ ШКОЛЬНИКА

От первого лица: Школа 7 😡 УБЕЖАЛ из ДОМА 😱 БРОСИЛ ДЕВУШКУ ИЗ-ЗА ДЕНЕГ 😰 СТЫД ГЛАЗАМИ ШКОЛЬНИКА

Этот чехол НЕ ЗАЩИТИТ твой телефон #shorts #шортс #смартфон #факты #чехол

Этот чехол НЕ ЗАЩИТИТ твой телефон #shorts #шортс #смартфон #факты #чехол

Україна - Бразилія: ПРЯМА ТРАНСЛЯЦІЯ МАТЧУ / футзал, Чемпіонат світу-2024, ПІВФІНАЛ

Україна – Бразилія: ПРЯМА ТРАНСЛЯЦІЯ МАТЧУ / футзал, Чемпіонат світу-2024, ПІВФІНАЛ

«У коридор вийшли і все прилетіло»: жителька пошкодженого будинку про удар РФ по Запоріжжю

«У коридор вийшли і все прилетіло»: жителька пошкодженого будинку про удар РФ по Запоріжжю

Загадочная череда смертей участников группы Ласковый май | Документальный фильм

Загадочная череда смертей участников группы Ласковый май | Документальный фильм

Техас - новое место силы Америки / вДудь

Техас – новое место силы Америки / вДудь

Что в джунглях лучше не тpогать?

Что в джунглях лучше не тpогать?