The Power of Vector Databases For Knowledge Search

Поділитися
Вставка
  • Опубліковано 11 гру 2024

КОМЕНТАРІ • 57

  • @andythedishwasher1117
    @andythedishwasher1117 Рік тому +4

    I've been looking out/waiting for this functionality to pop up somewhere for months! Thanks a million for the heads up! Gonna go try that this evening.

    • @codetothemoon
      @codetothemoon  Рік тому

      nice! yeah it's definitely worth taking for a spin.

  • @AmeKniteCS
    @AmeKniteCS Рік тому +14

    Connecting something like this with open source projects would be great to attract more people to contribute, as it will save a lot of time trying to understand all the code.

  • @funnynews341
    @funnynews341 Рік тому +22

    Your videos are very valuable and of high quality, I feel you are very knowledgeable. Thank you so much!

  • @AngelinaGokhale
    @AngelinaGokhale 20 днів тому +1

    This was a really very informative and helpful video! Just what I needed! Thank you so much!

  • @Incertophile
    @Incertophile Рік тому +5

    Dude this is awesome!
    So many ideas spinning in my head. Most interesting project for me would be to have a vdb of all the posts from my best friends in our group chat, and we also meet every week where we talk about a movie or book or tv show that we all agree to watch, and I have recordings of all that.
    Would be great to create this and integrate it into our private slack or discord channel as an interactive bot. "What would think about this?" "Answer this question in the form of . Hilarity should ensue and maybe some really interesting use cases.

  • @DreamsAPI
    @DreamsAPI Рік тому +2

    Dude, you have the best youtube channel in terms of helping individuals with minimal programmable skills to learn and use advanced topics, and thank you for going in and explaining everything, I appreciate that.

    • @codetothemoon
      @codetothemoon  Рік тому

      thank you so much for the kind words! I definitely aim to make these as accessible as possible, I'm very happy whenever I hear that I've been successful 😎

  • @rsjeyt
    @rsjeyt Рік тому +2

    I tried this tool and I don’t recommend it. They say it’s open source but need an auth key to use it. You can’t get away from giving them your information. They give you a 15 day free trial for the front end. It’s a waste of time and so now I know why more people don’t use it.

  • @EranRiesenfeld
    @EranRiesenfeld Рік тому +3

    Love your videos 👍🏻 and „great, good, excellent“ on the opposite end of a cartesian plane to „JavaScript“ 😂

    • @codetothemoon
      @codetothemoon  Рік тому

      thank you! Glad someone caught this, was worried the joke would slip past everyone 😎

  • @seanfitzpatrick1531
    @seanfitzpatrick1531 Рік тому +2

    Wow this product looks great. I can tell it's gonna be very successful !

    • @codetothemoon
      @codetothemoon  Рік тому +1

      I agree that the potential here is huge!

  • @mozabeta7003
    @mozabeta7003 Рік тому +30

    I miss the coding aspects of your channel. I feel like I’m just watching adverts from the highest bidder now. This is a trend among quite a few channels nowadays.

    • @codetothemoon
      @codetothemoon  Рік тому +5

      thanks for the feedback! ooc are you not interested in vector databases in general, or you are you interested but just want to see things more from the perspective of the code? you may not have made it this far, but fwiw in this one the coding part starts at about 14:45. I think it's important for software developers to have vector databases on their radar and in their toolbox these days, be it Nuclia or something else.

    • @mozabeta7003
      @mozabeta7003 Рік тому +2

      @@codetothemoon I’m interested in learning I guess. I’m definitely interested in all things databases and would love to see a rudimentary implementation.
      I’d guess, like me, most of your viewers are rustaceans who are looking to enhance our understanding after reading the usual books.

    • @cmelgarejo
      @cmelgarejo Рік тому +2

      Well he has to maintain the channel and provide us good content, and to be fair, Nuclia is a nice option for vectorDBing.
      Keep em coming, @codetothemoon !

    • @AdrianMark
      @AdrianMark Рік тому +3

      Honestly, coming from web dev with javascript, I find these videos highly informative. Would happily sit through a bunch of ads for the comp sci info alone. That said, nuclia is actually useful, and I plan to work with it this weekend. Spent last weekend setting up private GPT only to realise the limitations, this looks so much more polished.

  • @Christian-op1ss
    @Christian-op1ss Рік тому +2

    Thank you, very interesting. A question: lets say you have 1000 documents stored, and you ask a natural language question to a system like this, am I correct in assuming it does this:
    - take the question and transform it into a vectorised search
    - takes the top results, look up the original text
    - feed these resulting texts into a natural language bot like chatgpt
    - have the bot write an answer based on this data
    If so, would this not be severely limiting to what you can ask it? It would be a bit like the Bing integration of GPT-4, it has no overview of the information, it can only get snippets then, due to the limited input buffer of GPT models. Would this mean that it is impossible to get for example, a summary of information in some documents you stored? (since that would require full document access, not snippets and search results)

    • @codetothemoon
      @codetothemoon  Рік тому +3

      I believe what you describe is roughly how it works. To your point, it seems like there'd be limitations of this approach - for example if part of a document was deemed relevant to answering a question, but additional context was provided elsewhere in the knowledge box that changes the meaning of the excerpt deemed "relevant". Maybe that scenario would yield an answer that isn't accurate. I haven't personally tested these boundaries myself, so I can't say for sure whether this issue exists and whether it is severe or not. It seems like LLMs that have accept massive prompts (tens of thousands of tokens) are an active area of research and would probably help if this problem does exist.

    • @Christian-op1ss
      @Christian-op1ss Рік тому

      @@codetothemoon Yes I remember seeing a paper on models with over 10k tokens without much performance degradation. That would certainly help. Thank you for the reply, and really enjoy your videos!

  • @kerodfresenbetgebremedhin1881
    @kerodfresenbetgebremedhin1881 Рік тому +2

    So potentially could upload a book into a knowledge base and query it in natural language ergo making reading books close to obsolete?

    • @Brianjp93
      @Brianjp93 Рік тому +2

      If you think that knowing things without having to ask an llm is useless, then sure

    • @alext5497
      @alext5497 Рік тому +1

      How exactly do you use books...

    • @codetothemoon
      @codetothemoon  Рік тому

      if your goal in reading books is to answer specific questions that you already have, then I think the answer is probably yes. But the thing about getting answers to questions is that it's only helpful when there's a gap in your knowledge that you're aware of - I think a lot of the knowledge to be gained from reading books comes from filling in gaps in our knowledge that we didn't even know existed.

    • @kerodfresenbetgebremedhin1881
      @kerodfresenbetgebremedhin1881 Рік тому

      @@codetothemoon fair point

  • @method_actor
    @method_actor Рік тому +3

    Really enjoy all your videos, you do a great job. I would be really interested in a tutorial about how you make your videos.

    • @codetothemoon
      @codetothemoon  Рік тому +1

      thanks so much for the kind words! I'd love to make such a tutorial - just need to figure out what the right medium for distributing such a thing would be. In the meantime, check out this video from Jeff of Fireship if you haven't already - he's got some pretty helpful tips - ua-cam.com/video/N6-Q2dgodLs/v-deo.html

    • @method_actor
      @method_actor Рік тому

      @@codetothemoon Thanks, I'll check it out

  • @DaveParr
    @DaveParr 11 місяців тому

    Lol at the subtle dunk on JS 1:30

  • @hicamajig
    @hicamajig Рік тому

    Rewind is also doing this tied with locally recording everything you do in a day on your computer so you can search backwards every content you have viewed and meetings you have had etc. Would love to figure out how I can vectorize a large sql server in some way to have conversational queries against data within the system.

  • @powpowpony9920
    @powpowpony9920 Рік тому

    Awesome introduction. Thank you.

  • @robbbieraphaelday999
    @robbbieraphaelday999 Рік тому +1

    How does it do the question answering? What is going on under the hood?

    • @codetothemoon
      @codetothemoon  Рік тому

      my understanding is that it first does a semantic search of your entire knowledge box to find information that is likely relevant to answering the question, then feeds all the relevant information, along with the question, into an LLM. Nuclia says there are also some other clever techniques involved that they haven't gone over with me, but that's the high level idea.

  • @2ndBison
    @2ndBison Рік тому

    Thank you for this. Any clues as to the pricing of the api? Couldn't find anything on their website (they need a little search box themselves :P)

  • @asatorftw
    @asatorftw Рік тому +1

    Damn this is inspiring. Gonna try it out making a knowledge Db

  • @TheDCEntertainment
    @TheDCEntertainment Рік тому

    i replicated the code of this tutorial letter for letter, but the results wont print, it doesnt seem to enter the for loop

  • @guilherme5094
    @guilherme5094 Рік тому +1

    👍Thanks.

  • @tiagodev5838
    @tiagodev5838 Рік тому

    I wonder if I add this video to NucliaDB and someone searches for Tupac, they will arrive at this video 14:43 😂
    Thank you for this fantastic content!

  • @thesimplicitylifestyle
    @thesimplicitylifestyle 5 місяців тому +1

    😎🤖

  • @Marek-mq5wu
    @Marek-mq5wu 10 місяців тому

    Thanks for the video, this topic very much interests me.
    But why do you keep saying `semantic meaning`? The word semantic means "relating to meaning in language". Just say meaning or semantic not both.

  • @longbranchgooberdapple2238
    @longbranchgooberdapple2238 Рік тому

    Rust nation!

  • @herrbanane
    @herrbanane Рік тому

    Right in front if my face 😂

  • @nikhilsathe5956
    @nikhilsathe5956 Рік тому +1

    I request you to not use Good, great, excellent words with javascript.

  • @endian675
    @endian675 Рік тому +1

    Poor JavaScript 🤣

  • @RoganRicheart
    @RoganRicheart Рік тому +1

    I was excited about this until I learned you need a professional web account.. I will stick with flexgpt.

    • @codetothemoon
      @codetothemoon  Рік тому

      yeah my guess is that they do that to throttle bot signups, maybe try using login with Github or Google Workspaces?

  • @Akitando
    @Akitando Рік тому +1

    %s/sytem/system/g

    • @codetothemoon
      @codetothemoon  Рік тому

      I know! I didn't notice this until after posting the video, was hoping it'd slip under everyone's radar, but you caught me.... 😎