How to set up RAG - Retrieval Augmented Generation (demo)

What are Mixture of Experts (GPT4, Mixtral…)?

How to Improve LLMs with RAG (Overview + Python Code)

The Witcher IV - Cinematic Reveal Trailer | The Game Awards 2024

Этот бой - Самое большое РАЗОЧАРОВАНИЕ за всю КАРЬЕРУ БУАКАВА!

Женская супер-сила 😂 #ComedyClub #КамедиКлаб #харламов #тнт4 #тнт #демискарибидис #богатство #кравец

What is Retrieval Augmented Generation (RAG) - Augmenting LLMs with a memory

What's AI by Louis-François Bouchard

Переглядів 35 722

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 26 гру 2024

КОМЕНТАРІ • 35

@WhatsAI 2 місяці тому
Get your copy of "Building LLMs for Production": amzn.to/4bqYU9b
@letseat3553 8 місяців тому ⁺⁹
RAG is just 'full text indexing' on the local data with the ranked results fed into the context window and sent to the LLM along with the question.
Every time I see it described as something of a database guy for the last 30 years all I see are new words describing long solved problems.
@rajeshbasnet4454 8 місяців тому ⁺²
You mean like how elastic search does indexing ?
@ahmedzouaoui8177 7 місяців тому
Well new cars have wheels which is a technology that has thousands of years of existence. It does not mean that new cars are 'obsolete' but using an old tech to improve a new one is a great way of doing engineering !
@KEMBL 5 місяців тому
What happens to the information received from the RAG if the original request already occupies the entire context window?
@WhatsAI 5 місяців тому ⁺¹
It depends on the code implementation of the system! Most will put in place a system to detect it and summarize or extract key points to make it shorter.
@finn_the_dog 11 місяців тому ⁺⁴
Great video. Would you make a video the different types of RAGs? Or how to prepare data for a RAG, for example when your document has tables, math formulas, references to images, I haven't seen much content about how to handle diverse data inside a document with RAGs.
Cheers
@WhatsAI 11 місяців тому ⁺²
Great idea, thank you! Will definitely look into multi modal RAG! :)
@MK-ce7im 9 місяців тому ⁺²
I think this is the best video I have seen on this topic. Wanted to ask if we can use RAG offline maybe with Mistral model ?
@WhatsAI 9 місяців тому
Of course you can host everything locally if you have the capacity! :)
@nealdriscoll22237 9 місяців тому
by any chance do you know which RAG system/framework is giving out the best performance?
@WhatsAI 9 місяців тому ⁺¹
From our work we like to use llamaindex for many parts and adapt on our own code for more personalized settings!
@prattipatimanojsai 11 місяців тому ⁺¹
Very Informative and useful!! Thanks
@rhans6598 10 місяців тому
Thanks but what's the point of sound effects?
@Parsley1965 11 місяців тому ⁺⁴
Truly excellent video!
@smritisrinivas7885 6 місяців тому ⁺¹
Wow. Thanks a lot for this amazing explanation
@bhanujinaidu 8 місяців тому ⁺²
Thanks , very clear excellent explanation
@WhatsAI 8 місяців тому
Thank you! :)
@PriM-z2k 11 місяців тому ⁺¹
Now I understood, What is RAG - Retrieval Augmented Generation ,Very Informative Video, Liked your Video 👍
@sabriboubaker 10 місяців тому ⁺¹
Great video, straight to the point. Thanks again
@WhatsAI 10 місяців тому
Thank you Sabri! :)
@JavierTorres-st7gt 6 місяців тому
How to protect a company's information with this technology?
@the_hasnat 2 місяці тому ⁺¹
You'd only provide placeholders for the company's information within these prompts and make sure that they are in a specific format.
@Plink2120 11 місяців тому ⁺¹
Vraiment clair et précis merci
@Kama45 7 місяців тому ⁺²
Subbed
@helainz7198 7 місяців тому ⁺¹
Et cetera bien sur mon poto
@chairwood 11 місяців тому ⁺²
thx. i enjoyed this video
@WhatsAI 11 місяців тому ⁺¹
Glad to hear so my friend! 😊
@mahiaravaarava 4 місяці тому ⁺¹
AI algorithms facilitate better decision-making in business by providing actionable insights from data analysis.This enhances strategic planning and operational efficiency.
@Mr_Arun_Raj 11 місяців тому
After integrating with RAG. latency increased....
@WhatsAI 11 місяців тому
That is for sure! There is some downsides but the latency if very little.
@paulwillisorg 8 місяців тому
The accent of the speaker is pretty heavy.
@WhatsAI 8 місяців тому ⁺¹
Hope it’s still easy to understand!
@kunjs 10 місяців тому
google launched gemini advanced 1.5, a RAG killer 💀
@WhatsAI 10 місяців тому ⁺⁴
A database can be much larger than this context window and much more efficient I believe. It’s unsure how good the models are vs gpt4 yet. Plus, sending millions of tokens for every prompt will be extremely expensive for each request, haha! It’s good for some use cases like sending a full repo once and asking questions but not for working with customers and handling many requests I believe.

Наступне

Автоматичне відтворення

How to set up RAG - Retrieval Augmented Generation (demo)

How to set up RAG - Retrieval Augmented Generation (demo)

What are Mixture of Experts (GPT4, Mixtral…)?

What are Mixture of Experts (GPT4, Mixtral…)?

How to Improve LLMs with RAG (Overview + Python Code)

How to Improve LLMs with RAG (Overview + Python Code)

The Witcher IV - Cinematic Reveal Trailer | The Game Awards 2024

The Witcher IV — Cinematic Reveal Trailer | The Game Awards 2024

Этот бой - Самое большое РАЗОЧАРОВАНИЕ за всю КАРЬЕРУ БУАКАВА!

Этот бой - Самое большое РАЗОЧАРОВАНИЕ за всю КАРЬЕРУ БУАКАВА!

Женская супер-сила 😂 #ComedyClub #КамедиКлаб #харламов #тнт4 #тнт #демискарибидис #богатство #кравец

Женская супер-сила 😂 #ComedyClub #КамедиКлаб #харламов #тнт4 #тнт #демискарибидис #богатство #кравец

Удержаться на воде?? 🌊 #симбочкапимпочка #симбочка #симба

Удержаться на воде?? 🌊 #симбочкапимпочка #симбочка #симба

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

Why LLMs Are Going to a Dead End Explained | AGI Lambda

Why LLMs Are Going to a Dead End Explained | AGI Lambda

Andrew Ng Explores The Rise Of AI Agents And Agentic Reasoning | BUILD 2024 Keynote

Andrew Ng Explores The Rise Of AI Agents And Agentic Reasoning | BUILD 2024 Keynote

RAG vs. Fine Tuning

RAG vs. Fine Tuning

AI Agents Explained: How This Changes Everything

AI Agents Explained: How This Changes Everything

Stanford Webinar - Large Language Models Get the Hype, but Compound Systems Are the Future of AI

Stanford Webinar - Large Language Models Get the Hype, but Compound Systems Are the Future of AI

Intro to RAG for AI (Retrieval Augmented Generation)

Intro to RAG for AI (Retrieval Augmented Generation)

The 8 AI Skills That Will Separate Winners From Losers in 2025

The 8 AI Skills That Will Separate Winners From Losers in 2025

What are AI Agents?

What are AI Agents?

Как найти себе жену? Больше - тут @stas.yornik.shorts

Как найти себе жену? Больше - тут @stas.yornik.shorts

ПРОВЕРКА НА ВШИВОСТЬ (смешное видео, юмор, поржать, приколы)

ПРОВЕРКА НА ВШИВОСТЬ (смешное видео, юмор, поржать, приколы)

"ХИТРЕЦ": Трамп РОЗЛЮТИВ Скабєєву / Оля ЛИЄ ЯДОМ #shorts

"ХИТРЕЦ": Трамп РОЗЛЮТИВ Скабєєву / Оля ЛИЄ ЯДОМ #shorts

ВОТ ПОЧЕМУ Япония живет в будущем 🤫 Утилизация масла #япония #токио #путешествия #shorts

ВОТ ПОЧЕМУ Япония живет в будущем 🤫 Утилизация масла #япония #токио #путешествия #shorts

Cat mode and a glass of water #family #humor #fun

Cat mode and a glass of water #family #humor #fun

Хто такий РОМАН СВІТАН? Звідки бере інформацію про фронт?

Хто такий РОМАН СВІТАН? Звідки бере інформацію про фронт?

TOY STORY IN BRAWL STARS!?

TOY STORY IN BRAWL STARS!?