Run All-in-One Local AI Infrastructure In MINUTES! (LLMs, RAG & More)

Поділитися
Вставка
  • Опубліковано 10 січ 2025

КОМЕНТАРІ • 68

  • @TyBohannon
    @TyBohannon Місяць тому +14

    And the physical requirements to run all of this without watching paint dry as tokens trickling?

  • @anshulgoel
    @anshulgoel 7 днів тому

    Great work Mervin! But there is an issue in this setup - While setting up vector database you are using postgres vector store. It should be Qdrant vector instead.

  • @josephshawa
    @josephshawa Місяць тому

    You are awesome! I am at the beginning of a large project and this just might be my gateway. Thank you.

  • @pabloweinxakapaulsynth966
    @pabloweinxakapaulsynth966 Місяць тому

    I'm completely frozen. You're a boss!!! Increíble video.

  • @Deeneeshsu
    @Deeneeshsu Місяць тому

    I haven’t made notification ON for UA-cam so far but you made me to do it man… lovely keep it up and do like this.
    Love and support from Ireland ❤

  • @paulmuriithi9195
    @paulmuriithi9195 Місяць тому

    This is priceless. Here in kenya, buisnesses are deploying agentic workflows as we speak, and we r using these tools

    • @Corteum
      @Corteum Місяць тому +1

      To do what specifically?

  • @samsquamsh78
    @samsquamsh78 Місяць тому

    yet another great video Mervin, well done!!

  • @Dr.UldenWascht
    @Dr.UldenWascht Місяць тому +4

    Seeing the 0$ in the thumbnail, I clicked so fast that I injured my index finger. I have to pay 100$ minimum to the doctor 😄

  • @KvikDeVries
    @KvikDeVries 28 днів тому

    Awesome work! I wonder though, it seems perfect for prototyping and DIY purposes, but can it be scaled to handle multiple users? I've not heard about n8n before, so it might be a dumb question :) ...thinking about it, probably the ideal answer to this would be something like 1. create workflow in n8n, 2. extract code that does all the work behind scenes, 3. add it as a module to whatever project you are working on, with all the credentials and variables configured with simple yaml file. One can dream :)

  • @MJackson-d9h
    @MJackson-d9h 26 днів тому

    great demo

  • @parthwagh3607
    @parthwagh3607 Місяць тому

    This is really great. Thank you so much for this information.
    Could you please create a video that will tell us how to create an entire book using this.
    Thank you.

  • @IdPreferNot1
    @IdPreferNot1 Місяць тому

    This detailed implementation of an n8n is a great video. Not big on no code since want to learn actual p[rocesses, but interested in these for their triggerr functions. Maybe you could do a video on how to do those outside of these no-code flows.

  • @yaseenal-wesabi5964
    @yaseenal-wesabi5964 Місяць тому

    Great work!

  • @sr.modanez
    @sr.modanez Місяць тому +1

    agora sim, em Portugues, top top top 👏👏👏👏👏👏👏

  • @Alen_115
    @Alen_115 Місяць тому +4

    The video was complete and impressive but because the llm requirements locally, i think it’s hard for now for most of people. I would love to see if you create a video on free agent framework with api like gemini or Deepseek and QwQ, which are cheap alternatives

    • @tonyclif1
      @tonyclif1 Місяць тому +3

      If I'm not mistaken, when he creates the AI agent at about 5:50, just select another option. OpenAI is one option (4o Mini is pretty cheap), but others can be added instead.

    • @ShubzGhuman
      @ShubzGhuman 23 дні тому

      u can use mistral free api

  • @moses5407
    @moses5407 Місяць тому +4

    hardware requirements?

  • @pabloescobar2738
    @pabloescobar2738 17 днів тому

    Thank for audio😊

    • @pabloescobar2738
      @pabloescobar2738 17 днів тому

      But i think docker its no good because in linux its more vulnerability, Microsoft azure create this app for him server, its not good worked 😒😔

  • @Techonsapevole
    @Techonsapevole Місяць тому +3

    great, why using qdrant when you can add pgvector support to postgres?

    • @vanmoofdiaries4955
      @vanmoofdiaries4955 Місяць тому +1

      Initially it was PGVector (ua-cam.com/video/TIMOfVhnjQA/v-deo.html), later he switched changed to Qdrant (ua-cam.com/video/TIMOfVhnjQA/v-deo.html).
      I reckon it's the same Qdrant vector store has been re used for Indexing.
      Should 've mentioned that.

    • @MervinPraison
      @MervinPraison  Місяць тому +1

      Sorry it’s my mistake , it should be the same on both indexing and querying

    • @justrobiscool4473
      @justrobiscool4473 Місяць тому

      @@MervinPraison so i should extract to postgres or get qdrant? im super noobish sorry if you explained it already but never used postgres and it just finished downloading should i dl qdrant too?

    • @MervinPraison
      @MervinPraison  Місяць тому +1

      ​@@justrobiscool4473 Yes please use Qdrant on both indexing and querying.
      I have fixed it in this mer.vin/2024/12/self-hosted-ai-starter-kit/
      You can just copy the JSON and paste it in the n8n workflow page. Everything should work as expected

    • @MervinPraison
      @MervinPraison  Місяць тому +1

      @Techonsapevole 's question. Yes you could even use pgvector on both indexing and querying, if you wish. Just compare the performance and use whatever you prefer.

  • @serhiilytvyn8753
    @serhiilytvyn8753 Місяць тому

    Cool system! Thanks!!!💛💙

  • @delcatta
    @delcatta 26 днів тому

    amazing! small typo on the blog for the docker-compose file name

  • @justrobiscool4473
    @justrobiscool4473 Місяць тому +2

    does anyone know any free chatrooms for chatting about things like this...i havent used n8n yet and was just wondering how to set it up

  • @souvickdas5564
    @souvickdas5564 23 дні тому

    I want to get some insights on a specific problem in RAG. suppose I want to provide domain specific knowledge to llm so that the LLM can analyse specific type of document based on the knowledge I provided. What will be the suitable way to address this kind of problems? I do not want to fine-tune the LLM .

  • @latelierduravinauloup994
    @latelierduravinauloup994 Місяць тому

    Wonderful, but I have a problem with qdrant config, my qdrant collection is empty :-(

  • @trendfollowers470
    @trendfollowers470 Місяць тому +1

    I am using office laptop is it possible without using docker

  • @jaggyjut
    @jaggyjut Місяць тому

    Nice but is n8n a scalable solution in production environment?

  • @moraddesign8641
    @moraddesign8641 Місяць тому

    thank you are the best

  • @BizAutomation4U
    @BizAutomation4U 19 днів тому

    I wish the 100% private dream was feasible with a sub-second inferecne. It's not, without hefty investments in really expensive GPUs (even with 8B models) particularly when considering agentic RAG against big data sources. That's why the only way to go is to use Groq. Maybe we'll get powerfull enough GPUs in the future for AI inferecne that compete with Groq.

  • @frageperez6958
    @frageperez6958 22 дні тому

    may i know if it's possible to migrate from qdrantDB to chromaDB (OLLAMA EMBEDDING)?

  • @davidpinalopez7467
    @davidpinalopez7467 23 дні тому

    When I'm trying setting up Qdrant Vector Sore, the qdrant collection is empty and there is an error "Parameter Quadrant Collection is required", Anybody happen the same?

  • @MrTapan1994
    @MrTapan1994 Місяць тому

    I wonder, The last time i tried to Create client id and secret it was not allowing to take localhost as host name in gcp ! how did it didn't show any error

  • @prasanthss420
    @prasanthss420 Місяць тому

    Vera level

  • @iltodes7319
    @iltodes7319 Місяць тому

    good jobe🙂

  • @justrobiscool4473
    @justrobiscool4473 29 днів тому

    so i have the workflow hooked up and all my creds locked in but for some reason i still cant pull simple info from the my docs the whole workflow runs but it doesn't seem to get the info if anyone can help that would be awesome i know mervin says even a beginner can do it so obviously i aint that experienced....thanks in advanced

  • @adityapanwar1220
    @adityapanwar1220 Місяць тому

    Impressive

  • @RetiredVet
    @RetiredVet Місяць тому +1

    This video looks fantastic. I am new programming AI, so I have some questions. The first question is, can I integrate crewai to work with this n8n AI solution? I have gotten deep into using crewai and I like how they do a lot of things, and putting the two together seems like it would be interesting. Second question is how do I figure out what should I look for in a laptop? I am thinking in the next year to buy a laptop that is good for running AI, but in looking around I am confused at what laptop would be best. I know chips like nvidia are required, but there are so many of them and reviews I have seen are confusing. They are more like sales pitches than a critical review of the hardware. Maybe I just haven't found the right website. Anyway, your video looks interesting and I can't wait to try it.

    • @paulmuriithi9195
      @paulmuriithi9195 Місяць тому

      Dont purchase yet. Best performance will be in deploying agentic swarms like as is given in this video. This can only be done using cloud compute. Are you deploying agents in yoir work?

    • @RetiredVet
      @RetiredVet Місяць тому

      @@paulmuriithi9195 Thanks for the feedback. I am retired. I have been learning AI just for fun and to keep active and to keep my mind alert. There are so many fascinating AI projects, but I settled on CrewAI because it makes sense to me. Other projects look great, but you have to start somewhere.

  • @premrajm1329
    @premrajm1329 8 днів тому

    can i use GPU instead of CPU

  • @TechWhizTutorials
    @TechWhizTutorials Місяць тому

    Plz make a video on how to use GPT 4o realtime audio and voice using Azure AI Foundry with Python.

  • @FirstLast-z2e
    @FirstLast-z2e 27 днів тому

    Thanks for the vid.
    how to solve error - Something went wrong
    Sorry, something went wrong there. Please try again.
    When trying to link Google Drive gmail account. Have setup GCP OAuth client creds. enter into n8n and accept scopes, then error/denied.

    • @FirstLast-z2e
      @FirstLast-z2e 27 днів тому

      Using a different browser helped and I was shown the consent prompt which was not being shown. approved the app, and OAuth connection is now working.

  • @MELTEDCREWCBA
    @MELTEDCREWCBA Місяць тому

    I'm trying to chat with the agent and i keep getting this error on the postgres node under the agent. i did the ingestion as you did, but it doesn't work. cant figure why

  • @HyperUpscale
    @HyperUpscale Місяць тому

    nice!

  • @loicbaconnier9150
    @loicbaconnier9150 Місяць тому

    You forget to add the error you made in the querying part using Postgress instead of Qdrant as vectordatabase..
    Your final solution works as you change it but noy in the video part 3.

  • @lalamax3d
    @lalamax3d Місяць тому +2

    how to put this bot on my website. so it can actually pop up like a chat window and talk to my clients...., must be a way... please add that , thanks in adv

  • @sais6027
    @sais6027 Місяць тому

    Are there any workflow tools that are cheaper or free?

    • @MervinPraison
      @MervinPraison  Місяць тому

      All these tools are open source

    • @sais6027
      @sais6027 Місяць тому

      @MervinPraison thanks, do I need any subscription or tokens for n8n. I am a newbie, so the basic questions

  • @christiandekorsy3089
    @christiandekorsy3089 Місяць тому

    Hey, Google Drive is not local!

    • @MervinPraison
      @MervinPraison  Місяць тому +1

      True, that is an example . Instead you could get the file from your local computer

  • @MH-xx6df
    @MH-xx6df Місяць тому

    Is this all free?

    • @MervinPraison
      @MervinPraison  Місяць тому +1

      Yes

    • @MH-xx6df
      @MH-xx6df Місяць тому

      @ and do you feel - for production ready app - llama is genuinely on par with frontier models now?

  • @MVPAJYT
    @MVPAJYT Місяць тому

    I’m in search of Workflow building steps similar to AutogenAI

  • @alx8439
    @alx8439 20 днів тому

    I think you're dealing with vector database in a bit sloppy way - whenever some file is changed on a source you should be first deleting the old relevant info from the vector database, and only then to upload the new one. Otherwise you're just adding new stuff side to side with the old stuff which is already there.

  • @unimposings
    @unimposings 17 днів тому

    Marvin you dont Speak German, so why the frik you use German Title? Can you not Butcher your Own Language. This Bad Translation is your channel not worth it. With a german Title, I Expect german Speaking content! if you cant deliver that, then do not trick me!

    • @MervinPraison
      @MervinPraison  17 днів тому

      Sorry, that’s not me. UA-cam automatically doing it