What is Retrieval Augmented Generation (RAG) - Augmenting LLMs with a memory

Поділитися
Вставка
  • Опубліковано 8 січ 2024
  • ► Jump on our free RAG course from the Gen AI 360 Foundational Model Certification (Built in collaboration with Activeloop, Towards AI, and the Intel Disruptor Initiative): learn.activeloop.ai/courses/rag
    ►Twitter: / whats_ai
    ►My Newsletter (My AI updates and news clearly explained): louisbouchard.substack.com/
    ►Support me on Patreon: / whatsai
    ►Join Our AI Discord: / discord
    How to start in AI/ML - A Complete Guide:
    ►www.louisbouchard.ai/learnai/
    Become a member of the UA-cam community, support my work and get a cool Discord role :
    / @whatsai
    #ai #llm #rag
  • Наука та технологія

КОМЕНТАРІ • 30

  • @smritisrinivas7885
    @smritisrinivas7885 4 дні тому +1

    Wow. Thanks a lot for this amazing explanation

  • @Parsley1965
    @Parsley1965 4 місяці тому +4

    Truly excellent video!

  • @letseat3553
    @letseat3553 2 місяці тому +6

    RAG is just 'full text indexing' on the local data with the ranked results fed into the context window and sent to the LLM along with the question.
    Every time I see it described as something of a database guy for the last 30 years all I see are new words describing long solved problems.

    • @rajeshbasnet4454
      @rajeshbasnet4454 2 місяці тому +2

      You mean like how elastic search does indexing ?

    • @ahmedzouaoui8177
      @ahmedzouaoui8177 Місяць тому

      Well new cars have wheels which is a technology that has thousands of years of existence. It does not mean that new cars are 'obsolete' but using an old tech to improve a new one is a great way of doing engineering !

  • @Plink2120
    @Plink2120 5 місяців тому +1

    Vraiment clair et précis merci

  • @user-oh4jz9zu5v
    @user-oh4jz9zu5v 5 місяців тому +1

    Now I understood, What is RAG - Retrieval Augmented Generation ,Very Informative Video, Liked your Video 👍

  • @sabriboubaker
    @sabriboubaker 4 місяці тому +1

    Great video, straight to the point. Thanks again

    • @WhatsAI
      @WhatsAI  4 місяці тому

      Thank you Sabri! :)

  • @Kama45
    @Kama45 Місяць тому +2

    Subbed

  • @MK-ce7im
    @MK-ce7im 3 місяці тому +2

    I think this is the best video I have seen on this topic. Wanted to ask if we can use RAG offline maybe with Mistral model ?

    • @WhatsAI
      @WhatsAI  3 місяці тому

      Of course you can host everything locally if you have the capacity! :)

  • @prattipatimanojsai
    @prattipatimanojsai 5 місяців тому +1

    Very Informative and useful!! Thanks

  • @bhanujinaidu
    @bhanujinaidu 2 місяці тому +2

    Thanks , very clear excellent explanation

    • @WhatsAI
      @WhatsAI  2 місяці тому

      Thank you! :)

  • @finn_the_dog
    @finn_the_dog 5 місяців тому +3

    Great video. Would you make a video the different types of RAGs? Or how to prepare data for a RAG, for example when your document has tables, math formulas, references to images, I haven't seen much content about how to handle diverse data inside a document with RAGs.
    Cheers

    • @WhatsAI
      @WhatsAI  5 місяців тому +2

      Great idea, thank you! Will definitely look into multi modal RAG! :)

  • @chairwood
    @chairwood 5 місяців тому +2

    thx. i enjoyed this video

    • @WhatsAI
      @WhatsAI  5 місяців тому +1

      Glad to hear so my friend! 😊

  • @helainz7198
    @helainz7198 Місяць тому +1

    Et cetera bien sur mon poto

  • @JavierTorres-st7gt
    @JavierTorres-st7gt 6 днів тому

    How to protect a company's information with this technology?

  • @martinkrueger937
    @martinkrueger937 3 місяці тому

    by any chance do you know which RAG system/framework is giving out the best performance?

    • @WhatsAI
      @WhatsAI  3 місяці тому +1

      From our work we like to use llamaindex for many parts and adapt on our own code for more personalized settings!

  • @rhans6598
    @rhans6598 4 місяці тому

    Thanks but what's the point of sound effects?

  • @Mr_Arun_Raj
    @Mr_Arun_Raj 5 місяців тому

    After integrating with RAG. latency increased....

    • @WhatsAI
      @WhatsAI  5 місяців тому

      That is for sure! There is some downsides but the latency if very little.

  • @paulwillisorg
    @paulwillisorg 2 місяці тому

    The accent of the speaker is pretty heavy.

    • @WhatsAI
      @WhatsAI  2 місяці тому +1

      Hope it’s still easy to understand!

  • @kunjs
    @kunjs 4 місяці тому

    google launched gemini advanced 1.5, a RAG killer 💀

    • @WhatsAI
      @WhatsAI  4 місяці тому +4

      A database can be much larger than this context window and much more efficient I believe. It’s unsure how good the models are vs gpt4 yet. Plus, sending millions of tokens for every prompt will be extremely expensive for each request, haha! It’s good for some use cases like sending a full repo once and asking questions but not for working with customers and handling many requests I believe.