AI Systems in Government: Challenges & Opportunities - Jared Dunnmon | Stanford MLSys#100

Monarch Mixer: Making Foundation Models More Efficient - Dan Fu | Stanford MLSys #86

How Fine-tuning Open Source LLMs Solves GenAI Productionization - Piero Molino | Stanford MLSys #94

Electric Flying Bird with Hanging Wire Automatic for Ceiling Parrot

Самое неинтересное видео

Дмитрук звільнить Татарова? Правоохоронна система пробила дно

Automating Enterprises With Foundation Models - Avanika Narayan & Michael Wornow | Stanford MLSys#99

Stanford MLSys Seminars

Переглядів 2 032

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 12 вер 2024
Episode 99 of the Stanford MLSys Seminar Series!
Teaching LLMs to Use Tools at Scale
Speakers: Avanika Narayan & Michael Wornow
Abstract:
Automating enterprise workflows could unlock $4 trillion/year in productivity gains. Despite being of interest to the data management community for decades, the ultimate vision of end-to-end workflow automation has remained elusive. Current solutions rely on process mining and robotic process automation (RPA), in which a bot is hard-coded to follow a set of predefined rules for completing a workflow. Through case studies of a hospital and large B2B enterprise, we find that the adoption of RPA has been inhibited by high set-up costs (12-18 months), unreliable execution (60% initial accuracy), and burdensome maintenance (requiring multiple FTEs). Multimodal foundation models (FMs) such as GPT-4 offer a promising new approach for end-to-end workflow automation given their generalized reasoning and planning abilities. To study these capabilities we propose ECLAIR, a system to automate enterprise workflows with minimal human supervision. We conduct initial experiments showing that multimodal FMs can address the limitations of traditional RPA with (1) near-human-level understanding of workflows (93% accuracy on a workflow understanding task) and (2) instant set-up with minimal technical barrier (based solely on a natural language description of a workflow, ECLAIR achieves end-to-end completion rates of 40%). We identify human-AI collaboration, validation, and self-improvement as open challenges, and suggest ways they can be solved with data management techniques.
--
Stanford MLSys Seminar hosts: Avanika Narayan, Benjamin Spector, Michael Zhang
Twitter:
/ avanika15
/ bfspector
/ mzhangio
--
Check out our website for the schedule: mlsys.stanford.edu
Join our mailing list to get weekly updates: groups.google....
#machinelearning #ai #artificialintelligence #systems #mlsys #computerscience #stanford

КОМЕНТАРІ • 2

@samsgregson 3 місяці тому
What is the paper being referred to at 55:40? "Step"?
@laurenpinschannels 4 місяці тому
related to this, I'd recommend looking up the story "a disneyland without children", by strataoftheworld.

Наступне

Автоматичне відтворення

AI Systems in Government: Challenges & Opportunities - Jared Dunnmon | Stanford MLSys#100

AI Systems in Government: Challenges & Opportunities - Jared Dunnmon | Stanford MLSys#100

Monarch Mixer: Making Foundation Models More Efficient - Dan Fu | Stanford MLSys #86

Monarch Mixer: Making Foundation Models More Efficient - Dan Fu | Stanford MLSys #86

How Fine-tuning Open Source LLMs Solves GenAI Productionization - Piero Molino | Stanford MLSys #94

How Fine-tuning Open Source LLMs Solves GenAI Productionization - Piero Molino | Stanford MLSys #94

Electric Flying Bird with Hanging Wire Automatic for Ceiling Parrot

Electric Flying Bird with Hanging Wire Automatic for Ceiling Parrot

Самое неинтересное видео

Самое неинтересное видео

Дмитрук звільнить Татарова? Правоохоронна система пробила дно

Дмитрук звільнить Татарова? Правоохоронна система пробила дно

Russian soldier catches Ukraine FPV drone with his bare hands and runs with it

Russian soldier catches Ukraine FPV drone with his bare hands and runs with it

Text2SQL: The Dream versus Reality - Laurel Orr | Stanford MLSys #89

Text2SQL: The Dream versus Reality - Laurel Orr | Stanford MLSys #89

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Computer Vision Meetup: Foundation Models for Electronic Health Records (EHRs)

Computer Vision Meetup: Foundation Models for Electronic Health Records (EHRs)

EVO: DNA Foundation Models - Eric Nguyen | Stanford MLSys #96

EVO: DNA Foundation Models - Eric Nguyen | Stanford MLSys #96

Have You Picked the Wrong AI Agent Framework?

Have You Picked the Wrong AI Agent Framework?

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

Stanford Seminar - Robot Learning in the Era of Large Pretrained Models

Stanford Seminar - Robot Learning in the Era of Large Pretrained Models

The Future of Knowledge Assistants: Jerry Liu

The Future of Knowledge Assistants: Jerry Liu

Notes on AI Hardware - Benjamin Spector | Stanford MLSys #88

Notes on AI Hardware - Benjamin Spector | Stanford MLSys #88

Прохожу маску ЭМОЦИИ🙀 #юмор

Прохожу маску ЭМОЦИИ🙀 #юмор

"Ми в тюрмі побували. Що нас може лякати?": як служать колишні вʼязні / hromadske

"Ми в тюрмі побували. Що нас може лякати?": як служать колишні вʼязні / hromadske

Таня не врахувала уроки важкого дитинства і жила з тираном - Супермама 8 сезон - Випуск 1 | ПРЕМ'ЄРА

Таня не врахувала уроки важкого дитинства і жила з тираном – Супермама 8 сезон – Випуск 1 | ПРЕМ'ЄРА

ДОКАЗАЛ ЧТО НЕ КАБЛУК #shorts

ДОКАЗАЛ ЧТО НЕ КАБЛУК #shorts

Новий концерт Єдиного Кварталу від 1 вересня 2024. Повний випуск

Новий концерт Єдиного Кварталу від 1 вересня 2024. Повний випуск

Пришёл к другу на ночёвку 😂

Пришёл к другу на ночёвку 😂

Apple peeling hack @scottsreality

Apple peeling hack @scottsreality

А ВЫ ЛЮБИТЕ ШКОЛУ?? #shorts

А ВЫ ЛЮБИТЕ ШКОЛУ?? #shorts