What is Agentic RAG?

Поділитися
Вставка
  • Опубліковано 11 січ 2025

КОМЕНТАРІ • 54

  • @norbertschmidt
    @norbertschmidt 2 місяці тому +20

    You make the best LLM + RAG explainers in the world. Thanks so much.

  • @lesmoe524
    @lesmoe524 2 місяці тому +10

    Fantastic description, it seems using agents to help you route to potentially different databases is a game changer, thank you.

  • @sqweepsrussell9412
    @sqweepsrussell9412 2 місяці тому +3

    Thanks for this well explained video. This is the most simplified explanation of agentic rag that sank into my grey matter

  • @ChielReemer
    @ChielReemer Місяць тому +5

    The most amazing thing about all this is that he can write so easily in reverse.

  • @nbamastermind
    @nbamastermind Місяць тому +2

    Excellent explanations! Simple without additional fluff. Thank you.

  • @vincenttima
    @vincenttima 26 днів тому +2

    Thanks David, well done presentation.

  • @whitesaladchips
    @whitesaladchips 28 днів тому +1

    can someone explain that this is basically instead of using similarity search, here we are instead using llm to decide which database is more appropriate to query from? am i correct here

  • @TiếnNguyễnMinh-t8h
    @TiếnNguyễnMinh-t8h 12 днів тому

    Thanks so much, youre talented. I really interested in the way you share your useful knowledge, information. I love it, the best wishes to you. Im looking forward to your video everyday, once again, thanks a lot.

  • @jaffarbh
    @jaffarbh 2 місяці тому +1

    Thanks. This is pretty useful and much better than the naïve approach of overloading the model with lot's of irrelevant data from the vector DB.

  • @PriyeshYadav
    @PriyeshYadav 2 місяці тому +3

    So the Agent is also a pretrained LLM with those 2 vector db ???

  • @SasoCasablanca63
    @SasoCasablanca63 9 днів тому

    I want to know about the display you are using . I like the way this white board works and gives the illusion of being there . What tool is that ?

  • @afzaalawan
    @afzaalawan 13 днів тому

    Excelllent concept -- many thanks.

  • @averlyrolf7970
    @averlyrolf7970 25 днів тому +1

    Where are these agents installed/located? How do they get into the pipeline connected to the LLM?

  • @digambardagade288
    @digambardagade288 15 днів тому

    This will help you in case when you have multiple documents or subsections and you are storing them into different vector databases.
    In that case, based on user's query, agent will understand which vector database model needs to access.

  • @ErickM.Joseph
    @ErickM.Joseph Місяць тому

    Makes sense I’ll be implanting this into our agent workflow

  • @ssdm444
    @ssdm444 Місяць тому

    Simple, clear explanation! Thanks!!

  • @sofluzik
    @sofluzik 17 днів тому

    What is the logic or rule to help llm decide where to go ?? Is it only key value pairs or looking for keywords , context based understanding....

  • @Plife-507
    @Plife-507 21 день тому

    This this would be good for training an agentic swarm for a trading system based on on depth guides curated from pdf?

  • @Trazynn
    @Trazynn Місяць тому

    4:23 it doesn't have to decline to answer. It probably is able to answer such questions by virtue of it already being a sufficiently big enough LLM. It could say "Though this isn't part of our internal database (creating a clear signal that it goes beyond its domain) the world series was won by..."

    • @shubham.1172
      @shubham.1172 Місяць тому

      Yes but it's best to decline for most customer facing use cases, right? For example I wouldn't want my Amazon bot to answer questions about Superbowl, it's best if it just declines to answer. Also reduces legal exposure?

  • @sterlingabbot695
    @sterlingabbot695 2 місяці тому +8

    Please do RAG + Big Data next

    • @IBMTechnology
      @IBMTechnology  2 місяці тому +1

      An interesting suggestion, any specific element or question you have about the topic?

  • @AbdullahTBukhari
    @AbdullahTBukhari Місяць тому

    Easy to Understand explanation. 👍🏼

  • @Ilovepotatoes-t4t
    @Ilovepotatoes-t4t 2 місяці тому

    thank u!! One question: Do you know how to evaluate an agentic rag? Do I have to take anything into account compared to a normal rag evaluation? (e.g. with RAGAs..) Best regards

  • @CDR4445
    @CDR4445 2 місяці тому

    We need to know how to implement agent and what is this? is this another llm to determine the context and route to right db

  • @imVishalKS
    @imVishalKS Місяць тому

    Really good explanation.

  • @BizAutomation4U
    @BizAutomation4U Місяць тому

    I asked Perplexity how best to decide if seperate vector DBs is preferred over a large vector DB with metadata to contextualize the pool of vector data. I guess it's tantemount to asking if we should normalize vector tables similar to OLTP or use a data warhouse approach. It recommended the metadata approach to help the LLM decide which portion of the vector data to contextualize, which makes sense to me. Maybe it's the same thing described in this video in a different way.

  • @BernardoCortinaCeballos
    @BernardoCortinaCeballos Місяць тому

    Que bien está explicado este video, gracias.

  • @ugurkaraaslan9285
    @ugurkaraaslan9285 Місяць тому

    There should be huge response delay due to additional step?

  • @StalinDeLaTorre
    @StalinDeLaTorre 2 місяці тому +1

    Where can I learn this approach?

  • @RohitGulati309
    @RohitGulati309 2 місяці тому

    Im confused ... Andrew Ng says just RAG is Agentic AI. But RAG does not need LLM during the query phase. So what's correct here?

  • @MikewasG
    @MikewasG 2 місяці тому +2

    What is the difference between this and semantic routing?

    • @IBMTechnology
      @IBMTechnology  2 місяці тому +1

      Semantic routing uses more straightforward methods like cosine similarity and other predefined rules to make the decision on which route to take. The example I drew used an LLM agent, which can understand and interpret more complex/nuanced queries, understand context, but is much heavier (compute and latency) because it is using an LLM.
      -David

  • @nitingaurz
    @nitingaurz Місяць тому

    Since the agent make use of LLM in decision making, it would be more clear to show the connection in the drawing.

  • @Agent221314
    @Agent221314 Місяць тому

    What framework support this? Ag2?

  • @simonthompson1099
    @simonthompson1099 Місяць тому

    How about a citation that shows it works?

  • @Ijmeisner
    @Ijmeisner 2 місяці тому +2

    In the reflection I think that’s the teleprompter… or maybe I am hallucinating 🤣

  • @peabody3000
    @peabody3000 8 днів тому

    retro encabulator vibes 😊

  • @vichupayyan
    @vichupayyan 7 днів тому

    every day new concept keep popping up, too hard to get the concept and follow before the concept gets into another form :)

  • @hi5wifi-s567
    @hi5wifi-s567 2 місяці тому +1

    “More responsible, more accurate, more adaptable, “
    plus more secure as well?

  • @sergiogonzart
    @sergiogonzart 28 днів тому

    I'd love a recommendation for an LLM that actually admits it does't have an answer instead of just making something up.

  • @sjmediaonline
    @sjmediaonline 2 місяці тому +5

    IBM’s main task now is to create the catchup teaching videos. No innovation or breakthroughs are coming out from them. I see so many IBM old timers are watching with amazement how smaller open source innovators are moving lightning fast in GenAI and then they start recording teaching videos… I do not see any new things / new demos coming out from IBM.

    • @JC-kb8vb
      @JC-kb8vb 26 днів тому +1

      Yeah. On the other hand, your company is booming… 😆

  • @AK-be7jh
    @AK-be7jh 2 місяці тому +2

    So the agent will acts like a controller here .

  • @MandoDatingCoach
    @MandoDatingCoach 8 годин тому

    yo

  • @marcomaiocchi5808
    @marcomaiocchi5808 2 місяці тому +9

    This pipeline doesnt make a lot of sense.

    • @JustinKahrs
      @JustinKahrs 2 місяці тому +1

      skill issue

    • @scycer
      @scycer Місяць тому +20

      Got a particular question?
      A standard rag pipeline just takes your question convert it into a vector and searches the data source to find content that is similar to it.
      By adding an agent in front of it, the question can be interpreted to figure out which data sources should be used to fetch that content.
      It's like the difference between asking a single book a question or asking a question to a librarian who can find the right book for you before looking into the book for the answer to that question.
      Its hard with all the unique fancy naming they give these architectures, it abstracts the simplicity of what is really happening underneath.

    • @Whynot83848
      @Whynot83848 Місяць тому +1

      @@scycervery cool example. Thanks

    • @egopgogo-job8614
      @egopgogo-job8614 19 днів тому

      @@scycerreally great explanation. Thanks!

    • @ClifHDunn
      @ClifHDunn 8 днів тому

      @@scycertotally going to swipe this librarian analogy and may even give you credit on occasion 😊

  • @box4soumendu4ever
    @box4soumendu4ever Місяць тому

    👏👏🥰🥰🕊🕊🙏🙏👍👍🪻

  • @jojojojojojojo2
    @jojojojojojojo2 9 днів тому

    Why do you say "We'll" instead of "We".. are you just starting to understand it theoretically just yet? This stuff is more than 2 years old already... And on top of that - this stuff is not only old but outdated as well...

  • @funkfreeze
    @funkfreeze 2 місяці тому

    Overkill for most consumer facing applications and, as the answer to generalist queries, not specific enough a system for internal tooling. Lots of noise introduced here.