Why your ChatBot is stupid - Focus on the important parts! - The ultimative RAG-Tierlist

Coding Crash Courses

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 14 лис 2024

КОМЕНТАРІ • 41

@codingcrashcourses8533 2 місяці тому ⁺¹
You see anything different? Change my mind in the comments ;-)
@itsalexpaul 2 місяці тому ⁺³
This helped me a lt sorting some concepts in my brain. You have a huge talent for explaining things simply. It's clear that you have real hands-on experience. keep it up! 🥂
@codingcrashcourses8533 2 місяці тому
Thank you :]
@KristianSchou1 2 місяці тому ⁺³
As a person who is developing an internal RAG application, multiquerying/rephrasing seems useless for one who understands the system, but I've found that users, especially new GenAI users, love it as it lowers the entrybarrier for usage.
"Remember the stand user could be anyone"
@codingcrashcourses8533 2 місяці тому
For multiquery i totally agree (that why I put it into D tier). But rephrasing is not useless. How do you handle follow up questions without rephrasing in a RAG System?
@KristianSchou1 2 місяці тому
@@codingcrashcourses8533 it is very early days, im primarily an SCADA engineer and this is a side project at work, so it dosen't handle multiple queries at the time.
The project scope is iterating through BID/NOBID questions for every tender we plan to bid on, and then use some LDA magic to sort chunks into project roles to minimize wasted time reading through materials
@davidtindell950 2 місяці тому ⁺¹
Thank You ! A good and very timely review !!!!
@bastienmoenaert6940 2 місяці тому
Hey man, love yor vids!! I am trying to build my own RAG application and with my background in backend/data engineering I think I'll be able to come up with a pretty good RAG system. My big problem is building I nice front end, any plans on doing a tutorial (or udemy course) on building professional front ends to go with a RAG application?
@codingcrashcourses8533 2 місяці тому
No sorry I don´t plan on that. There are tons of tutorials on building frontends from people who are much better at that than I am. I would learn that somewhere else and than you will be good enough to come up with solutions for that on your own ;-).
@Investwithwisdome 2 місяці тому
I hope you're doing well! I’ve been really excited about your LangGraph course and was wondering if you could provide an estimated release date. I’m looking forward to diving into it!
Thanks so much!
@codingcrashcourses8533 2 місяці тому
Hello! Sorry for the late response, there is currently a lot of spam on my channel. My estimation is the beginning of november, since I will have one week holiday then
@juanes1793 2 місяці тому
You’ve helped me a lot by explaining all these topics related to RAG so well :) . Keep it up, you're a pro, man! I have a question: How is it possible to keep a Langchain course updated knowing that almost every week new things come out in the documentation or others become outdated? You are the best :D
@codingcrashcourses8533 2 місяці тому
Thank you! It´s actually not possible, especially since I a have a full time job and a life beside my job and this channel. I currently work on an LCEL Deep dive update for my first course. My courses do not cover EVERYTHING too, I think good courses are not bloated and don´t cover every single edge case or in the LangChain World the integration of 1000 different VEctorstores or AI models.
@pabloguatibonza5992 2 місяці тому ⁺¹
Great Video ! Which vector store would you recommend to use? I was using supabase, but I will use azure OpenAI, so I’m not sure if change to an azure vector store
@codingcrashcourses8533 2 місяці тому ⁺²
You can use Azure AI Search which is fine. I use PgVector as a container there, since it makes me independent from the cloudprovider.
@pabloguatibonza5992 2 місяці тому
@@codingcrashcourses8533 thank you very much :D
@arslanabid2245 2 місяці тому
Thankyou soo much, I really needed this video. It just cleared so much confussions.
@codingcrashcourses8533 2 місяці тому
@@arslanabid2245 thank you for that comment:)
@rorycawley 2 місяці тому
Incredible content! Danke!
@codingcrashcourses8533 2 місяці тому
Danke für den comment :)
@dumbol8126 2 місяці тому
what would rate 4o-minis chunking capabilities, what about distilling some of 4o's chunking to a smaller opensource model wonder if that would work and make a lot cheaper
@dumbol8126 2 місяці тому ⁺¹
since small models would require like exponential (or quadratically?) less vram and compute to make the chunks i think it would be a lot cheaper
@tanyongsheng4561 2 місяці тому
What about technique like query decomposition?
@codingcrashcourses8533 2 місяці тому ⁺¹
Better than something like Multiquery! But I have not experimented too much with it.
@micbab-vg2mu 2 місяці тому
Thank you - I will try your courses -)
@codingcrashcourses8533 2 місяці тому
thought you are already subscribed? ;)
@張博文-n6l 2 місяці тому
S:
rawdata, chat model
A:
document chunking,
prompt enginnering
GuardraiIs
B:
embedding model
agent RAG
rephrasing
tool calling
document compression
C :
vector store
reranking
D:
base routing
Multiquery Retrieval
RAPTOR
graphRAG
finetuning
@Sar2393 2 місяці тому ⁺¹
You mentioned gpt4o is a great semantic chunker, please can you share the notebook for the same
@codingcrashcourses8533 2 місяці тому ⁺²
That´s something I covered in-depth in my udemy course, where I build a custom chunker on top of the LangChain interface.
@Mostafa_Sharaf_4_9 2 місяці тому
@@codingcrashcourses8533 please make a free video here on youtube about that topic (gpt4o as a great semantic chunker) 😁😁❤❤
@ayushjhabak2851 2 місяці тому
with the new hnsw indexing, supabase pg vector seems better than pinecone and for scalability too , or am i missing something ?
@codingcrashcourses8533 2 місяці тому
Interesting question, but I honestly can´t answer it. I never ran into issues with my normal PgVector and I doubt the current similarity search approach is suited to support that many docs that hnsw indexing or not really matters. But I am not 100% sure about that.
@kukubanana 2 місяці тому
So it is better to use llm to for intent classification and entities extraction instead of using tool like RASA?
@codingcrashcourses8533 2 місяці тому
Maybe - wd actually got rid of rasa:)
@ramishelh9998 2 місяці тому
Hi, I am a big fan of your youtube videos. Wanted to ask if you offer any consultation / project dev services?
@codingcrashcourses8533 2 місяці тому
Sorry, I only make youtube videos and courses, no consulting or active development.
@rgolanng 2 місяці тому
graph rag like trying to apply NP hard problem to production, should be F tier..
@codingcrashcourses8533 2 місяці тому
The software I used had nothing below D ;-)
@harshithdr3952 2 місяці тому
Well informative video. But I will recommend some good images, instead of shitty AI generated images which are unreadable.....
@codingcrashcourses8533 2 місяці тому
Get your point

Наступне

Автоматичне відтворення

Local Agentic RAG with LLaMa 3.1 - Use LangGraph to perform private RAG