How I'd Learn AI in 2024 (if I could start over)

Why Agent Frameworks Will Fail (and what to use instead)

OpenAI Structured Output - All You Need to Know

БЮДЖЕТНЫЙ КОСПЛЕЙ на WEDNESDAY 🔥 ПОВТОРЯЕМ СЕРИАЛ в ДОМАШНИХ УСЛОВИЯХ (Уэнсдей)

[UA] Team Vitality проти NAVI | IEM Cologne 2024

Пять Ночей Фредди в Реальной Жизни ! ОНИ СУЩЕСТВУЮТ !

Improve RAG with This Simple API (code included)

Dave Ebbelaar

Переглядів 8 041

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 23 сер 2024

КОМЕНТАРІ • 30

@farhanafridi8694 Місяць тому ⁺¹²
I believe you are the hero every AI engineer needs. Unlike most UA-camrs who copy and paste code from documentation, you address the real problems AI engineers face.
@daveebbelaar Місяць тому ⁺²
Appreciate that!
@BrockMesarich Місяць тому ⁺²
So good at simplifying concepts in these tutorials. Loved this Dave!
@daveebbelaar Місяць тому ⁺¹
Thanks Brock 🙏🏻
@bernardo4290 День тому
Fucking awesome. Best AI youtuber for me. Keep up the good work, Dave!
@ClarkeBishopConsulting Місяць тому ⁺¹
Very helpful, Dave! Many companies try to use naive chunking because there are so many examples on the web, UA-cam videos, etc. You gave us a very good way to do smarter chunking and get more useful results. This is the future for RAG use cases.
@daveebbelaar Місяць тому
Thanks Clarke!
@skyrayzor3693 Місяць тому ⁺¹
Your video is detailed and very helpful, thank you for these type of techniques.
@krlospatrick Місяць тому
Thanks a lot for sharing this knowledge, it's really useful!
@GeertBaeke Місяць тому ⁺¹
Good stuff! We use the exact same technique with markdown-based chunking and extra metadata for the chunks. Works really well!
@daveebbelaar Місяць тому
I think this is currently the best approach for RAG.
@micbab-vg2mu Місяць тому ⁺²
great - thank you for sharing:) Please explore the topic more - )
@StephanieNguyen-om1ss Місяць тому
Super helpful. Can you please make a tutorial on how to use AWS Textract too?
@trendavira5128 Місяць тому ⁺²
Hi Dave, Thanks for the awesome content, a client come to me for a RAG solution, he have a library of hundreds of thousands of pages (about 60 Giga), simplest rag techniques doesn't seem to work for this case, come up to a solution using hybrid retriever and a reranker using llama-index, the results was good but not perfect, if were you how will you tackle this problem?
@awakenwithoutcoffee 26 днів тому
we are working on a solution for this that can be white-labeled on release! does your client has an API endpoint or some kind o bucket containing all the files ? it really depends in what formats the data comes. If it its just text than you can use a hybrid-approach with semantic chunking, parent-document retrieval or other meta-data filtering techniques. The main point of importance is to make sure the data is pre-processed and cleaned before being chunked an embedded. Entity extraction is expensive but can be very helpful. A second best option is to extract meta-data. One is used for semantic extraction (entity) and the other for additional filtering.
GraphRAG is the best solution, using entities, but it costs a massive amount of resources & development time making it only accessible to enterprise clients (10-50k +).
@Divyv520 Місяць тому ⁺¹
Hey Dave , Really Nice Video . I was wondering if I could help you with more High Quality and engaging editing with maintaining a brand colour to your youtube channel which can help you to get more engagement in your videos and Build your Unique Personal Brand . Pls lmk what do you think ?
@__m__e__ 21 день тому
Thanks I'm a newbie and your videos helped get me started. Can you please also share pdf_ingester?
@AaronGayah-dr8lu Місяць тому
Enjoyed this. Thank you.
@LaHoraMaker Місяць тому
Have you tried passing the PDF to Jina Reader API? The Markdown output is quite clean too! (but it's only usable for public documents)
@inflationking1271 Місяць тому ⁺¹
Could you do a GraphRAG tutorial?
@awakenwithoutcoffee 26 днів тому
awesome video but where can we find the "from config.settings import get_settings" ?
@chwaleedsial Місяць тому
Will try this with textaract. For my use case I am just sending a csv ( of an excel ) and its working but I think that is not a systematic, luck proof way. Do you think rag approach will be better, less prone to context, structure related hallucinations ?
@salahuddinpalagiri4503 День тому
could you please share the pdf_ingester code too. I would like to play around with it
@testadrome Місяць тому ⁺¹
Does it work with scanned pdf docs?
@daveebbelaar Місяць тому ⁺³
Yes!
@brandonvelasquez3530 Місяць тому
This seems similar to GraphRAG. What is the difference?
@sahiljain9376 Місяць тому
GraphRAG is a more powerful solution than this baseline RAG. In GraphRAG, the data is stored in the graph with entities and relationships and also doing community summaries in detail which excels in retrieval flow. For eg: questions like "Did company underperform in Q4 vs Q3?" This kind of question would be difficult to answer using Baseline-RAG which can be answered easily using GraphRAG
@awakenwithoutcoffee 26 днів тому
@@sahiljain9376 you can enhance RAG with agentic frameworks to allow these questions e.g. an SQL Agent with meta-data filtering. I love graphRAG but its a.) super expensive since entity extraction requires a ton of LLM calls b.) takes allot of time to set-up the graph, c.) has additional challenges to be overcome before it can really be used for non-enterprise.
@__m__e__ 21 день тому
@@sahiljain9376 I was unaware of GraphRAG, and it looks really interesting thanks. It looks like it's beyond my skill level now, but hopefully MS integrates it into Azure soon

Наступне

Автоматичне відтворення

How I'd Learn AI in 2024 (if I could start over)

How I'd Learn AI in 2024 (if I could start over)

Why Agent Frameworks Will Fail (and what to use instead)

Why Agent Frameworks Will Fail (and what to use instead)

OpenAI Structured Output - All You Need to Know

OpenAI Structured Output - All You Need to Know

БЮДЖЕТНЫЙ КОСПЛЕЙ на WEDNESDAY 🔥 ПОВТОРЯЕМ СЕРИАЛ в ДОМАШНИХ УСЛОВИЯХ (Уэнсдей)

БЮДЖЕТНЫЙ КОСПЛЕЙ на WEDNESDAY 🔥 ПОВТОРЯЕМ СЕРИАЛ в ДОМАШНИХ УСЛОВИЯХ (Уэнсдей)

[UA] Team Vitality проти NAVI | IEM Cologne 2024

[UA] Team Vitality проти NAVI | IEM Cologne 2024

Пять Ночей Фредди в Реальной Жизни ! ОНИ СУЩЕСТВУЮТ !

Пять Ночей Фредди в Реальной Жизни ! ОНИ СУЩЕСТВУЮТ !

OpenAI Embeddings and Vector Databases Crash Course

OpenAI Embeddings and Vector Databases Crash Course

What are the LLM’s Top-P + Top-K ?

What are the LLM’s Top-P + Top-K ?

5 Steps to Build Your Own LLM Classification System

5 Steps to Build Your Own LLM Classification System

How to Find Freelance Data & AI Projects in 2024

How to Find Freelance Data & AI Projects in 2024

How to scrape the web for LLM in 2024: Jina AI (Reader API), Mendable (firecrawl) and Scrapegraph-ai

How to scrape the web for LLM in 2024: Jina AI (Reader API), Mendable (firecrawl) and Scrapegraph-ai

GraphRAG: LLM-Derived Knowledge Graphs for RAG

GraphRAG: LLM-Derived Knowledge Graphs for RAG

Why I stopped using Jupyter Notebooks

Why I stopped using Jupyter Notebooks

GraphRAG: The Most Incredible RAG Strategy Revealed

GraphRAG: The Most Incredible RAG Strategy Revealed

Claude 3.5 Deep Dive: This new AI destroys GPT

Claude 3.5 Deep Dive: This new AI destroys GPT

老公说在家无聊，想出去打牌，我不让他去，就陪他在家这样玩#夫妻搞笑视频#惊不惊喜意不意外 #万万没想到 #逗比夫妻日常 #这操作都看傻了

老公说在家无聊，想出去打牌，我不让他去，就陪他在家这样玩#夫妻搞笑视频#惊不惊喜意不意外 #万万没想到 #逗比夫妻日常 #这操作都看傻了

Зачем страны меняют флаги? #россия #ссср #новаязеландия

Зачем страны меняют флаги? #россия #ссср #новаязеландия

КТО ЛЮБИТ ГРИБЫ?? #shorts

КТО ЛЮБИТ ГРИБЫ?? #shorts

Sad To Announce I Did Not Qualify For Mens 2024 Olympic Gymnastics Team

Sad To Announce I Did Not Qualify For Mens 2024 Olympic Gymnastics Team

Буданов про плани на Курську область, та Воронеж #shorts #курск #воронеж

Буданов про плани на Курську область, та Воронеж #shorts #курск #воронеж

Вони ЛЕДЬ ХОДЯТЬ! КЕП в ефірі сказав ТЕ, про ЩО ВСІ МОВЧАТЬ

Вони ЛЕДЬ ХОДЯТЬ! КЕП в ефірі сказав ТЕ, про ЩО ВСІ МОВЧАТЬ

Пять Ночей Фредди в Реальной Жизни ! ОНИ СУЩЕСТВУЮТ !

Пять Ночей Фредди в Реальной Жизни ! ОНИ СУЩЕСТВУЮТ !