The Complete Guide to Building AI Agents for Beginners

How does OpenAI Function Calling work?

SDK for Stellar Qt

Что будет если украсть в магазине шоколадку 🍫

Правильный подход к детям

Как найти себе жену? Больше - тут @stas.yornik.shorts

"Make Agent 10x cheaper, faster & better?" - LLM System Evaluation 101

AI Jason

Переглядів 19 022

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 13 січ 2025

КОМЕНТАРІ • 31

@Jim-ey3ry 7 місяців тому ⁺²⁵
This is gold, most of people just show you how to build toy demo, but not many actually get into details of how to get into production; Thank you Jason!
@xXWillyxWonkaXx 7 місяців тому
Couldnt agree more. This is gold.
@tkp2843 7 місяців тому ⁺⁵
This is great. Loved the use of firecrawl (as a scrape tool) to get the website's data. Feel like it always helps improve the model output quality. Cheers!
@kenchang3456 7 місяців тому ⁺⁵
Way excellent video that goes well beyond demo. Thank you very much for this guidance.
@jasonfinance 7 місяців тому ⁺³
Amazing work as always Jason!
@darrenhinde2971 7 місяців тому
Been looking for more detail on eval on LLMs and been scratching around for a while. Thanks for this.
@apereiracv 7 місяців тому ⁺⁸
I recently be created a whole testing system for our LLM chatbots and we did exactly this:
LLM as evaluator and code
We created it as a series of unit tests with LLM generated cases.
Since our results were mostly conversational, we made tests pass/fail according to a scoring system
@contractorwolf 7 місяців тому
goddamn Jason your videos just blow my mind each time. Thanks for such a thorough explanation and example.
@kayshidow 7 місяців тому ⁺¹
I've used promptfoo for some of my test with local llm to test the ai workflow. It allow you to write assertion like you'll do with software
@titusblair 7 місяців тому
Awesome! Keep up the great work!
@humanish_ai 7 місяців тому ⁺¹
Finally you back 🎉
@jimmy-ef2ow 7 місяців тому ⁺¹
jason can we get another video about comfy ui?
@agenticmark 7 місяців тому ⁺¹
fine tune llama 3 (8bit) - you will get exactly the behavior you want - its what I do
@JorritvanGinkel 7 місяців тому
This is so good, thanks man!
@techfren 7 місяців тому ⁺¹
lesgooo!! ❤‍🔥❤‍🔥❤‍🔥
@someshfengade9623 7 місяців тому ⁺¹
I found langfuse metric monitoring little bit better.
@Joe-bp5mo 7 місяців тому
Sick, whats the best practice metrics for evaluating agents?
@MatrixCodeBreaker88 7 місяців тому
Great Video
@CorkyBallasdancewithme 6 місяців тому
great stuff, as new to hearing this, very interesting, can this be built by a novice . . .
@fullgazz 7 місяців тому ⁺¹
Who never spent 4 hours to save 10 min? That's our hobby spent time to save time.
@AGI-Bingo 7 місяців тому ⁺¹
If 25 people or more use it successfully then you literally gave humanity more time to live and be free
@jordanz9580 7 місяців тому
fireeee content!
@Ms.Robot. 7 місяців тому
I love how my Ai girl insults the competion with flame balls,then tells me.she loves me.❤🎉😊
@KalLif-k3i 7 місяців тому
Why not use Gemini as the LLM? It is free.
@HyperUpscale 7 місяців тому ⁺¹
Lets me share my experience about any google AI model ... because it doesn't understand human and it hallucinate way too much.
Practically ... in my cases 75% of the time what I get back is totally useless result. You cant use for anything... To be considered for evaluation ... you must be joking
@irql2 7 місяців тому
I dont see the value of "Agents". All of this stuff is easily done with basic function calling. I think I'm going to need to see some more creative use cases before I jump on board, i just dont get it yet.
@ayoubfr8660 7 місяців тому
Maybe we can discuss this, I am trying to jump on in but not until I find a decent idea to apply.
@symbol9new 7 місяців тому
when your assistant has a lot of functions, he starts giving out hallucinations, have you ever encountered this?
@SydneyF-eg5lt 7 місяців тому
Good content but so hard to listen to his Engrish. Monotonous Pitch n sped up delivery didn’t seem to help either.

Наступне

Автоматичне відтворення

The Complete Guide to Building AI Agents for Beginners

The Complete Guide to Building AI Agents for Beginners

How does OpenAI Function Calling work?

How does OpenAI Function Calling work?

SDK for Stellar Qt

SDK for Stellar Qt

Что будет если украсть в магазине шоколадку 🍫

Что будет если украсть в магазине шоколадку 🍫

Правильный подход к детям

Правильный подход к детям

Как найти себе жену? Больше - тут @stas.yornik.shorts

Как найти себе жену? Больше - тут @stas.yornik.shorts

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

The REAL cost of LLM (And How to reduce 78%+ of Cost)

The REAL cost of LLM (And How to reduce 78%+ of Cost)

Make your agents 10x more reliable? Flow engineer 101

Make your agents 10x more reliable? Flow engineer 101

EASIEST Way to Train LLM Train w/ unsloth (2x faster with 70% less GPU memory required)

EASIEST Way to Train LLM Train w/ unsloth (2x faster with 70% less GPU memory required)

Better than Cursor? Future Agentic Coding available today

Better than Cursor? Future Agentic Coding available today

INSANELY Fast AI Cold Call Agent- built w/ Groq

INSANELY Fast AI Cold Call Agent- built w/ Groq

This Social Media AI System Creates Unique Content Daily! (100% Automated)

This Social Media AI System Creates Unique Content Daily! (100% Automated)

Unlock AI Agent real power?! Long term memory & Self improving

Unlock AI Agent real power?! Long term memory & Self improving

“Wait, this Agent can Scrape ANYTHING?!” - Build universal web scraping agent

“Wait, this Agent can Scrape ANYTHING?!” - Build universal web scraping agent

Qwen Just Casually Started the Local AI Revolution

Qwen Just Casually Started the Local AI Revolution

СОЛДАТ КНДР: ВТЕЧА/ВІЙНА В УКРАЇНІ/10 РОКІВ ШПИГУВАВ У ПІВНІЧНІЙ КОРЕЇ/ТОРГУЮТЬ НАРКОТИКАМИ І ЗБРОЄЮ

СОЛДАТ КНДР: ВТЕЧА/ВІЙНА В УКРАЇНІ/10 РОКІВ ШПИГУВАВ У ПІВНІЧНІЙ КОРЕЇ/ТОРГУЮТЬ НАРКОТИКАМИ І ЗБРОЄЮ

Заява ЗАЛУЖНОГО ШОКУВАЛА увесь СВІТ😱ТРЕТЯ СВІТОВА ВІЙНА ПОЧАЛАСЬ?

Заява ЗАЛУЖНОГО ШОКУВАЛА увесь СВІТ😱ТРЕТЯ СВІТОВА ВІЙНА ПОЧАЛАСЬ?

Син ПОВАЛІЙ ПЛЮНУВ ЇЙ в ОБЛИЧЧЯ! Скандальне ПРИВІТАННЯ для ЗРАДНИЦІ! | OBOZ.LIFE

Син ПОВАЛІЙ ПЛЮНУВ ЇЙ в ОБЛИЧЧЯ! Скандальне ПРИВІТАННЯ для ЗРАДНИЦІ! | OBOZ.LIFE

REAL or FAKE? #beatbox #tiktok

REAL or FAKE? #beatbox #tiktok

ЧТО ОПАСНЕЕ? ОТВЕТЫ ВАС ШОКИРУЮТ... (1% ОТВЕЧАЮТ ПРАВИЛЬНО) #Shorts #Глент

ЧТО ОПАСНЕЕ? ОТВЕТЫ ВАС ШОКИРУЮТ... (1% ОТВЕЧАЮТ ПРАВИЛЬНО) #Shorts #Глент

СКАНДАЛЬНЫЙ бой Али, когда в ринге ему противостояли сразу ДВОЕ #shorts

СКАНДАЛЬНЫЙ бой Али, когда в ринге ему противостояли сразу ДВОЕ #shorts

Cat mode and a glass of water #family #humor #fun

Cat mode and a glass of water #family #humor #fun

Cute Baby Ties Up Dad And Wants To Play With His Phone #funny #fatherhoodlove#cute#fatherhoodmoments

Cute Baby Ties Up Dad And Wants To Play With His Phone #funny #fatherhoodlove#cute#fatherhoodmoments