Agentic RAG: Make Chatting with Docs Smarter

Поділитися
Вставка
  • Опубліковано 11 жов 2024

КОМЕНТАРІ • 29

  • @engineerprompt
    @engineerprompt  2 місяці тому

    Checkout the Advanced RAG course here: prompt-s-site.thinkific.com/courses/rag

    • @criticalnodecapital
      @criticalnodecapital 2 місяці тому

      thanks.. Can you become the ISHOWSPEED of AI. also are you based in USA or Subcontinent?

    • @engineerprompt
      @engineerprompt  2 місяці тому

      @@criticalnodecapital haha, that would be a good achievement :D I am based in the USA.

  • @camerongolinsky
    @camerongolinsky 9 днів тому +1

    Cool idea! When a course comes out focused on csv or databases, then I'll be there!

  • @unclecode
    @unclecode 2 місяці тому +4

    So clear and simple compared to other libraries for building genetic pipelines. Intuitive and feels like it should've been in Hugging Face libraries from the start. Makes other libraries seem overly complex and unnecessary. Easy to create an LLM engine with just a callable class. You can build any structure, with complexity only from yourself, not the library. Not surprising from Hugging Face, just like how fine-tuning models with HF library is intuitive and easy. Love a simple, powerful library that doesn't over-abstract. This is the way. Thanks for sharing.

    • @engineerprompt
      @engineerprompt  2 місяці тому +2

      Yeah, really like their implementation. Clean and straightforward.

  • @rocio6454
    @rocio6454 Місяць тому +1

    Thanks for the video! it would be interesting to see a multiagent and routing approach with 2 sources like a vector store for rag and a sql db, each one with their agents

  • @olympiasaha7165
    @olympiasaha7165 2 місяці тому +2

    It would have been interesting to see if you would have used GPT-4o as the LLM engine in the traditional RAG method to compare it with the agentic RAG response.

  • @anubisai
    @anubisai 2 місяці тому +4

    Agentic RAG + Knowledge Graph would be bad ass. Someone steal my idea, please. 😂 🙏

  • @barackobama4552
    @barackobama4552 2 місяці тому

    THANKS!

  • @BamiCake
    @BamiCake 2 місяці тому +3

    In your video the agentic rag takes about 4 times longer (15 sec). Is there a way to speed up agentic rag?

    • @engineerprompt
      @engineerprompt  2 місяці тому

      Unfortunately, using agents in the loop with take longer than standard RAG since it has to make additional calls to the LLM and do retrieval again. Over time you can cache queries and responses for faster retrieval.

  • @legendchdou9578
    @legendchdou9578 2 місяці тому +1

    Great video can we use GROQ API for the LLM?

    • @Parthi97
      @Parthi97 2 місяці тому

      It depends upon the prompt message you give.. Yes we can utilize GROQ models for simpler agentic RAG process

  • @CreativeEngineering_
    @CreativeEngineering_ 2 місяці тому +1

    I dont remember the last time I had and issue with hallucinations.

  • @nobody84980
    @nobody84980 2 місяці тому +1

    Why do I need an agent when I can add the agent description as a system prompt

    • @engineerprompt
      @engineerprompt  2 місяці тому +2

      Agent has the ability to do multiple passes of retrieval if it's not able to find the info in the first pass. If you add this to the system prompt, I will just run once and can't repeat the process with reasoning and Planning.

  • @paragshah2943
    @paragshah2943 2 місяці тому

    OP, Under what circumstances might you have duplicate chunks? Is it becuase two files that are same with differnt names?

    • @engineerprompt
      @engineerprompt  2 місяці тому

      Yes, that happens a lot. In big datasets, there can be duplicates.

  • @sauxybanana2332
    @sauxybanana2332 2 місяці тому +1

    how does this compare to graph rag?

    • @iukeay
      @iukeay 2 місяці тому

      It really depends on your use case.
      GraphRDF is currently ten to twenty times more expensive. Also, depending on the type of data and the type of query, it could be useful for you or not.
      It also increases lag by a very substantial margin. I have not found any startups or ideas implementing graph-lag effectively and usable yet.
      If you do, please keep me in the loop.

  • @nichtverstehen2045
    @nichtverstehen2045 2 місяці тому +1

    those silly comments like "Import necessary modules" or "Set up logging" make me sick. i stopped watching once i saw them.

  • @finalfan321
    @finalfan321 2 місяці тому

    too technical. where are friendly user interfaces websites/apps?