How to Reduce Your OpenAI Spend by up to 90% with Small Language Models

Introducing Solar LLM: The Best LLM for Fine-tuning that beats GPT-4, exclusively on Predibase

Andrew Ng Explores The Rise Of AI Agents And Agentic Reasoning | BUILD 2024 Keynote

ЧТО ОПАСНЕЕ? ОТВЕТЫ ВАС ШОКИРУЮТ... (1% ОТВЕЧАЮТ ПРАВИЛЬНО) #Shorts #Глент

The Witcher IV - Cinematic Reveal Trailer | The Game Awards 2024

"Бажано відбити посадку без втрат": військовий розповів, як загибель побратимів впливає на психіку

LoRA Bake-off: Comparing Fine-Tuned Open-source LLMs that Rival GPT-4

Predibase

Переглядів 1 474

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 31 гру 2024

КОМЕНТАРІ •

@ml-simplified 6 місяців тому
@ 55:12 : Wouldn't it be more appropriate to utilize (or whatever the instruction format of the underlying LLM) instead of relying on a customized instruction format? You can use the same prompt but format should be followed depending on underlying LLM
@AhmedKachkach 9 місяців тому ⁺¹
This is great! Just the slide comparing base performance vs performance after fine-tuning makes this exercise worthwhile: proves that differences between foundation models are not *that* large, and that pure prompting is not sufficient to reach good performance (and once you do that, most differences in base models disappear ; though mistral models do seem to be significantly ahead!)
Thanks for putting this together! If you're considering a similar comparison in the future, I'd be curious to see the effect of int4 quantization (with and without Quantization Aware Training) on prediction quality. Hard to find proper experiments testing this, mostly seeing evals with latency alone without a proper analysis of the quality cost (and how to reduce it, e.g. with QAT).
@jeffg4686 9 місяців тому ⁺¹
@5:08 - 😂😂😂

Наступне

Автоматичне відтворення

How to Reduce Your OpenAI Spend by up to 90% with Small Language Models

How to Reduce Your OpenAI Spend by up to 90% with Small Language Models

Introducing Solar LLM: The Best LLM for Fine-tuning that beats GPT-4, exclusively on Predibase

Introducing Solar LLM: The Best LLM for Fine-tuning that beats GPT-4, exclusively on Predibase

Andrew Ng Explores The Rise Of AI Agents And Agentic Reasoning | BUILD 2024 Keynote

Andrew Ng Explores The Rise Of AI Agents And Agentic Reasoning | BUILD 2024 Keynote

ЧТО ОПАСНЕЕ? ОТВЕТЫ ВАС ШОКИРУЮТ... (1% ОТВЕЧАЮТ ПРАВИЛЬНО) #Shorts #Глент

ЧТО ОПАСНЕЕ? ОТВЕТЫ ВАС ШОКИРУЮТ... (1% ОТВЕЧАЮТ ПРАВИЛЬНО) #Shorts #Глент

The Witcher IV - Cinematic Reveal Trailer | The Game Awards 2024

The Witcher IV — Cinematic Reveal Trailer | The Game Awards 2024

"Бажано відбити посадку без втрат": військовий розповів, як загибель побратимів впливає на психіку

"Бажано відбити посадку без втрат": військовий розповів, як загибель побратимів впливає на психіку

Як азовська піхота прийняла групу розвідки вс рф? Зізнання окупантів і кадри з GoPro

Як азовська піхота прийняла групу розвідки вс рф? Зізнання окупантів і кадри з GoPro

Stanford Webinar - Large Language Models Get the Hype, but Compound Systems Are the Future of AI

Stanford Webinar - Large Language Models Get the Hype, but Compound Systems Are the Future of AI

Snowflake + Predibase: Smaller, faster & cheaper LLMs that beat GPT-4

Snowflake + Predibase: Smaller, faster & cheaper LLMs that beat GPT-4

Beat GPT-4 with a Small Model and 10 Rows of Data and Synthetic Data Generation

Beat GPT-4 with a Small Model and 10 Rows of Data and Synthetic Data Generation

How Convirza Analyzes Millions of Calls Monthly with SLMs

How Convirza Analyzes Millions of Calls Monthly with SLMs

Lessons From Fine-Tuning Llama-2

Lessons From Fine-Tuning Llama-2

LightRAG: A More Efficient Solution than GraphRAG for RAG Systems?

LightRAG: A More Efficient Solution than GraphRAG for RAG Systems?

Comparing Open Source and Proprietary LLM's (Leaderboard Ranking Demo)

Comparing Open Source and Proprietary LLM's (Leaderboard Ranking Demo)

EASIEST Way to Train LLM Train w/ unsloth (2x faster with 70% less GPU memory required)

EASIEST Way to Train LLM Train w/ unsloth (2x faster with 70% less GPU memory required)

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression?

ГРАВИТАЦИЯ! ВЫЖИВАНИЕ на ЛЕТАЮЩЕМ ОСТРОВЕ(DDprod.) в РАСТ/RUST

ГРАВИТАЦИЯ! ВЫЖИВАНИЕ на ЛЕТАЮЩЕМ ОСТРОВЕ(DDprod.) в РАСТ/RUST

Рождение Немецкой Легенды - Mercedes 190E 2.3-16

Рождение Немецкой Легенды - Mercedes 190E 2.3-16

How Strong Is Tape?

How Strong Is Tape?

Комаровский. Когда конец войны, Трамп не поможет, потеря Украины, эмиграция, многоженство в Украине

Комаровский. Когда конец войны, Трамп не поможет, потеря Украины, эмиграция, многоженство в Украине

КТО НЕ ДВИНЕТСЯ, ПОЛУЧИТ МАШИНУ!

КТО НЕ ДВИНЕТСЯ, ПОЛУЧИТ МАШИНУ!

ПРОВЕРКА НА ВШИВОСТЬ (смешное видео, юмор, поржать, приколы)

ПРОВЕРКА НА ВШИВОСТЬ (смешное видео, юмор, поржать, приколы)

Дал Свою Безлимитную Карту Друзьям, Потратили Миллионы... (Хазяева, Кокошка, Дилблин, Сатир)

Дал Свою Безлимитную Карту Друзьям, Потратили Миллионы... (Хазяева, Кокошка, Дилблин, Сатир)

Як азовська піхота прийняла групу розвідки вс рф? Зізнання окупантів і кадри з GoPro

Як азовська піхота прийняла групу розвідки вс рф? Зізнання окупантів і кадри з GoPro