Build your own local o1 - here’s how

Поділитися
Вставка
  • Опубліковано 9 лют 2025
  • Wanna start a business with AI Agents? Go here: www.skool.com/...
    Work with David directly: gvw0h8ku6fc.ty... (limited to 5 people)
    Get early access to David's startup: forms.gle/SpuE...
    Ollama: ollama.com/
    Nemotron: ollama.com/lib...
    Cursor: www.cursor.com/
    Follow me on Instagram - / thedavit
    Follow me on Twitter - x.com/DavidOnd...
    Subscribe if you're serious about AI.
    Here's how to build your own 100% local o1 assisstant.

КОМЕНТАРІ • 120

  • @DavidOndrej
    @DavidOndrej  3 місяці тому +9

    Wanna build your own AI Startup? Go here: www.skool.com/new-society

    • @startingoverpodcast
      @startingoverpodcast 3 місяці тому

      Why aren't you using Msty?

    • @aaaaaaaaooooooo
      @aaaaaaaaooooooo 3 місяці тому

      Wait, my data is not private with o1? I didn't know that. Where can I check this? Where is this notified to the user, or did they bury it in small text?

  • @indiemusicvideoblog
    @indiemusicvideoblog 3 місяці тому +48

    Great! Now build a local agent with lama that can control your computer like Antropic

    • @orthodox_gentleman
      @orthodox_gentleman 3 місяці тому +8

      Very doable with Open-Interpreter which is open source and free

    • @Bllakez
      @Bllakez 3 місяці тому +5

      @@orthodox_gentleman How much should I pay someone to setup for me?

    • @alexrayoalv
      @alexrayoalv 3 місяці тому +6

      I literally did this 6 months ago.

    • @anubisai
      @anubisai 3 місяці тому +2

      You build it.😂

    • @marilynlucas5128
      @marilynlucas5128 3 місяці тому

      Skyvern!

  • @samimejri8079
    @samimejri8079 3 місяці тому +10

    I just used Llama 3.2 locally and asked about starting a 3d printing business as a 3D beginner. It gave a similar output of what you spent a good time building in this video... Maybe do it the next time, show a before and after response from an LLM.

  • @chrystofferaugusto1194
    @chrystofferaugusto1194 3 місяці тому +1

    Btw, the concept you reached in this video of undetermined number of agents is far superior than it was from a video from 5 days ago. Really awesome 👏🏻

  • @DCinzi
    @DCinzi 3 місяці тому +11

    There is a model called Llama3.3B-Overthinker. I think it would fit the task quite nicely.

    • @JackGamerEuphoriaDev
      @JackGamerEuphoriaDev 3 місяці тому +1

      Is there available in Ollama or hugging face? If you don't mind the question. Thanks by the way for giving directions..

  • @eviv8010
    @eviv8010 3 місяці тому +51

    nice clickbait

    • @kylev.8248
      @kylev.8248 3 місяці тому

      It’s not clickbait tho

    • @bruce_x_offi
      @bruce_x_offi 3 місяці тому +2

      @@kylev.8248 You must be King of fools

  • @godned74
    @godned74 3 місяці тому +1

    You could try "When providing responses, use concise and primary representations. However, include additional details only when needed to ensure clarity and completeness of the task" and you should get short response's with out compromising the chain of thought.

  • @Luxcium
    @Luxcium 3 місяці тому +5

    😂 I love the way you have called out your mistake 4:00 it was just so delightful to see you handle it like a boss that I have had to replay it more than 3 times to enjoy the moment... You are definitely a smart man!!! I am eager to see the evolution over time!!! 😅

  • @MrMoonsilver
    @MrMoonsilver 3 місяці тому +2

    Cool new format with the presentation man

  • @qkb3128
    @qkb3128 2 місяці тому

    Would have loved to check this out yet I don’t have that kinda money to spend to see the code. Good luck to ya .

  • @foxusmusicus2929
    @foxusmusicus2929 3 місяці тому +2

    Great video. Which hardware specs do you have? :-)

  • @TheDarkLordAngel
    @TheDarkLordAngel 3 місяці тому

    That mark on your nose-it’s almost like a signature, something that’s so naturally you.🖖👍

  • @szebike
    @szebike 3 місяці тому +1

    Nice, your contribution to the open source community is awesome!

    • @ysh7713
      @ysh7713 3 місяці тому

      opensource?

    • @szebike
      @szebike 3 місяці тому

      @@ysh7713 Well kind of ~ better than giving all you data to a faceless big company who wills steal your data 100%.

  • @mariomanca7546
    @mariomanca7546 3 місяці тому +2

    If you instruct the agent to use the fewest possible lines, it's likely to eliminate comments, which is suboptimal but expected.

  • @AGINews-TogethWithAI
    @AGINews-TogethWithAI 3 місяці тому

    exactly what I needed thank you so much David🎉

  • @hrarung
    @hrarung 3 місяці тому

    awesome video David! How to train this model based on my dataset? and How to give it a nice UI?

  • @Plife-507
    @Plife-507 2 місяці тому

    I want to build an agent swarm to do coin margined futures btc trading. With each agent handing a serpearte part, ta, market sentiment, execution, risk tolerance, is there a way to keep each model small and only train it to focus on its task?

  • @Visualife
    @Visualife 3 місяці тому

    You should use Anything LLM and docker / Open WebUI

  • @hedi177
    @hedi177 Місяць тому

    What is your setup ?

  • @mihaitanita
    @mihaitanita 3 місяці тому +12

    So, you've used Claude 3.5 (2024 october update) within Cursor AI Editor to develop a (simple) python script that run some agenting on a 70b model on ollama?
    Where's the o1 in here?

    • @Dancoliio
      @Dancoliio 3 місяці тому +4

      o1 is a reasoning model which kept their reasoning 'recipe' private. This is his take (which resonates with the average user of locally owned open source models) to kind of hack the way the 70b model works and simulate reasoning to enhance the final output> a simple method which actually does provide better replies.

    • @BikramAdhikari89
      @BikramAdhikari89 3 місяці тому +1

      He is not sharing his research paper published in arxiv my man.

  • @Bakobiibizo
    @Bakobiibizo 2 місяці тому

    A terminal?! I'm freaking out man

  • @AK-ox3mv
    @AK-ox3mv 3 місяці тому +1

    How much you'r local O1 results has more accuracy in comparison to original nemotron 70b and llama 3 3b without uaing chain of thought?
    Was there any improvement in bechmarks like Humaneval and MMLU?

  • @FuZZbaLLbee
    @FuZZbaLLbee 3 місяці тому

    You can also use the ollama streaming output to generate text. This way you know what’s the generator is doing.
    Also I think that GPT o1 does more then split up a task and let agents fix the individual tasks. But nevertheless, a nice tutorial on making agents.

  • @FrankDecker-n9e
    @FrankDecker-n9e 3 місяці тому +1

    @DavidOndrej, what is your Mac specs? I have a Macbook Pro M3 Max 48 GB..

  • @devbites77
    @devbites77 3 місяці тому

    Inspiring stuff. Cheers!

  • @KiranMohan-dpthinkr
    @KiranMohan-dpthinkr 3 місяці тому

    Hey David, how can we reassure clients that their data is secure and won't be shared with the LLM provider for internal training purposes? What steps can we take to ensure their data privacy and address any concerns they might have?

    • @cdunne1620
      @cdunne1620 3 місяці тому +1

      You d to ask that in David’s classroom at skoool

    • @KiranMohan-dpthinkr
      @KiranMohan-dpthinkr 3 місяці тому

      @@cdunne1620 Sure

    • @haljohnson6947
      @haljohnson6947 3 місяці тому +1

      He mentions that in the video like four times

    • @KiranMohan-dpthinkr
      @KiranMohan-dpthinkr 3 місяці тому

      @@haljohnson6947 can you mention the specific timeline where he described about it.

    • @KiranMohan-dpthinkr
      @KiranMohan-dpthinkr 3 місяці тому

      @@haljohnson6947 pls mention the timeline where he mentioned it.

  • @eado9440
    @eado9440 3 місяці тому +10

    🎉 you actually made it. Thanks

  • @MiNiD33
    @MiNiD33 3 місяці тому +1

    "Comments are apologies in code." - Robert C Martin.
    Cursor is helping you.
    Also for the price of the spec of this machine, you can buy an insane number if tokens from anthropic or openai. It might be worth getting people started using a hosted service.

  • @jayhu6075
    @jayhu6075 3 місяці тому

    What a great explanation. Thnx

  • @bsiix1576
    @bsiix1576 3 місяці тому +1

    Maybe I missed it, but what hardware is needed for that nemotron - it is 43GB? Doesn't that mean you need at least that much VRAM? And here I thought I was a baller with my 16GB vram...

  • @costatattooz840
    @costatattooz840 3 місяці тому +3

    locally what hardware do you need to run this at minimum? i have a 64gb ram + 3060 12gb

    • @ticketforlife2103
      @ticketforlife2103 3 місяці тому

      Watch the video

    • @H3XM0S
      @H3XM0S 3 місяці тому +5

      You'll need over 40gb vram so like 2 x rtx 4090 might be a good option. No idea what hardware is being used in the video. Anyone saying 'watch the video' should provide a timestamp.

    • @bollvigblack
      @bollvigblack 3 місяці тому

      this guys is rich. not even joking so

    • @chrystofferaugusto1194
      @chrystofferaugusto1194 3 місяці тому +4

      He is on a MacBook Pro bro…

    • @neomatrix-r7b
      @neomatrix-r7b 3 місяці тому

      64GB RAM + 4070 Ti Super (16 VRAM) = Run Nemotron-70b-instruct-q2_K

  • @michaeltse321
    @michaeltse321 3 місяці тому +1

    You downloade nemotron and not the 70b version which is why you had the error

  • @VinceOmondi
    @VinceOmondi 3 місяці тому

    Good stuff, Ondrej!

  • @orthodox_gentleman
    @orthodox_gentleman 3 місяці тому +5

    Dude, there are very few people that can run nemotron locally….

  • @EtH-xf6br
    @EtH-xf6br 3 місяці тому

    What a beast Macbook you need to have to get such a fast response. I have 7800x3D and 4080 rtx and its waaay slower.

  • @skulltrick
    @skulltrick 3 місяці тому

    Very inspiring! Thanks

  • @zechariahprince5671
    @zechariahprince5671 2 місяці тому

    We have had AGI for over a year.

  • @borick2024
    @borick2024 3 місяці тому

    Have you had a chance to compare your results against GPT4o?

  • @TheAsianDude9999
    @TheAsianDude9999 3 місяці тому

    What vscode extension are you using for your ai?

  • @aaaaaaaaooooooo
    @aaaaaaaaooooooo 3 місяці тому

    Are my prompts on o1-preview used to train the AI even if I opt out? Where do I find this information?

  • @MrAndrew535
    @MrAndrew535 3 місяці тому

    I want to preserve a million-word dialogue between myself and my ChatGPT on multiple threads while upgrading to your recommendations. How do I achieve that?

  • @rafaelortega1376
    @rafaelortega1376 3 місяці тому

    No repo to share the code?

  • @jefferystartm9442
    @jefferystartm9442 3 місяці тому

    Brooooo , there are tools you are behind on . Agent s and Claude computer use?? E2B has an open source version tooo 😊 stay blessed Ondrej

  • @11metatron11
    @11metatron11 3 місяці тому

    Not a chance with my elderly MacBook Pro. Looks like I need some new gear…

  • @gauravrewaliya3269
    @gauravrewaliya3269 3 місяці тому

    How to make local ai with backpropogation feature ( if got wrong stuff, CEO instruct what's wrong and it improve sub local agent by time )

  • @olivert.7177
    @olivert.7177 3 місяці тому +4

    There is also an nemotron-mini model which is only 4b.

    • @samuelgarcia1802
      @samuelgarcia1802 3 місяці тому

      How good it is? In hugging face I saw nematron was in a bad place

    • @orthodox_gentleman
      @orthodox_gentleman 3 місяці тому

      Really??? Omg that is great

  • @aatheraj1667
    @aatheraj1667 3 місяці тому

    Yet, we don't one that could trade Nasdaq futures.

  • @immortalityIMT
    @immortalityIMT 3 місяці тому

    Cool!

  • @danieleduardo9800
    @danieleduardo9800 3 місяці тому

    How’d you get composer in the sidebar?

  • @hotlineoperator
    @hotlineoperator 3 місяці тому +2

    I have test o1 - and it is not so smart. People still need to quide its selections. Big problem with models is censorship, someone else have select what you can do and not to do with these tools.

  • @SjarMenace
    @SjarMenace 3 місяці тому +4

    why do you have that thing on your nose?

    • @babyjvadakkan5300
      @babyjvadakkan5300 3 місяці тому

      For correcting the nasal path/nose bridge (or something like that

    • @INeedMeme
      @INeedMeme 3 місяці тому

      More oxygen bro

    • @cdunne1620
      @cdunne1620 3 місяці тому

      Soccer players used to wear them years ago for example Robbie Fowler for Liverpool

  • @MrMoonsilver
    @MrMoonsilver 3 місяці тому +7

    Also, I hope the bruise on your nose heals soon. Been a long time now.

    • @Tetardo
      @Tetardo 3 місяці тому +1

      I think it’s a medical device that helps him breathe

  • @gaelfalez
    @gaelfalez 3 місяці тому +1

    Missing the comparison between result using multiple agents and result using just 1....
    Disappointing. We Don t even know if it is worth the work....

  • @avi7278
    @avi7278 3 місяці тому +4

    Oh yeah im sure openai is quaking in their boots, bro.

  • @slt
    @slt 3 місяці тому

    Dadusak!

  • @themax2go
    @themax2go 2 місяці тому

    modern day sham(mer) 👍

  • @aljosja3353
    @aljosja3353 3 місяці тому

    Which computer u can use for local llm

  • @SCHaworth
    @SCHaworth 3 місяці тому

    No. Not quite. You have to split the turns.

  • @supermandem
    @supermandem 3 місяці тому

    Bro llama is nowhere near o1 wtf

  • @blasterzm
    @blasterzm 3 місяці тому

    Lol, that's not how O1 works. You can't tell it in the system prompt

  • @claxvii177th6
    @claxvii177th6 3 місяці тому

    1 token per second is too slow for any pratical use...

  • @chrystofferaugusto1194
    @chrystofferaugusto1194 3 місяці тому

    You should have a discord community to people share projects and business

    • @chrystofferaugusto1194
      @chrystofferaugusto1194 3 місяці тому

      Never mind, now I got the business model on skool. Nice call, thinking about joining it

  • @dark_cobalt
    @dark_cobalt 3 місяці тому +2

    Already have it lol. Running it on my RX 7900XTX with q4m, but i think ill buy myself 1-2 Radeon W7900 Pro to gain a lot more performance. Alsp you don't need Ollama for it, because it's available in LM Studio and it's downloading from Huggingface.
    Btw what PC hardware specs do you have?

    • @rhadiem
      @rhadiem 3 місяці тому

      He's clearly using a 128gb Macbook Pro which can use the memory as vram. He's running un-quantized. How much vram do you have on your gaming gpu? Nobody asked about your hardware bro.

    • @dark_cobalt
      @dark_cobalt 3 місяці тому +1

      @@rhadiem Every PC can use the RAM as VRAM. It's how computers work. It's called virtual memory. If the VRAM fills up, the computer uses the RAM as backup memory, to stay stable and not crash. But the RAM is waaaaaaay slower than the VRAM, that's why I am asking him what specs he has. My GPU has 24GB of VRAM and even with the Quant 4M (around 32GB) model of Nemotron 70B my VRAM gets filled up completely and my RAM also to 50GB, which slows down the model to such an amount, that it's painfully slow. He is using a way bigger model, without any issues. If he has a GPU with this huge amount of VRAM, this would be totally understandable, but with the RAM? I don't understand why lol. 😄

  • @sushilsharma1621
    @sushilsharma1621 3 місяці тому +1

    clickbait or misleading title

  • @adithyansreeni7491
    @adithyansreeni7491 3 місяці тому

    i fkin slep bro

  • @dorukkurtoglu
    @dorukkurtoglu 3 місяці тому

    27:36 LOL🤪

  • @ShishuSud
    @ShishuSud 3 місяці тому +1

    😇

  • @Álvaro-o5e
    @Álvaro-o5e 3 місяці тому +3

    99% of free stuff sucks. One of them is this video. 20 minutes to answer "why is the sky blue?"

    • @overunityinventor
      @overunityinventor 3 місяці тому +1

      free stuff has a learning curve, it's not everyone's cup of tea

    • @tomwawer5714
      @tomwawer5714 3 місяці тому +2

      99% of paid software sucks and it hurts your wallet

  • @surendarreddys7298
    @surendarreddys7298 3 місяці тому +2

    1st one to comment 😄

  • @HimaLoubi
    @HimaLoubi 3 місяці тому +1

    😂 you need a graphic card with a price of a Tesla car to run that module locally ; btw you talk like 10.000word/min , 😅

  • @gustavramedies2901
    @gustavramedies2901 3 місяці тому

    David i would like to create sales agents,lead generators,receptionist,appointment setters and I want to sell them.Can you help 😢

  • @EduardoAlarconGallo
    @EduardoAlarconGallo 3 місяці тому

    Title is misleading. You are using Llama which is a LLM but not a Reasoner model

  • @stefanschz7589
    @stefanschz7589 3 місяці тому

    Awesome!

  • @TheBhushanJPawar
    @TheBhushanJPawar 3 місяці тому

    I am getting following error:
    bhushan@Bhushans-MacBook-Pro ~ % ollama run nemotron
    Error: llama runner process has terminated: signal: killed

    • @TheBhushanJPawar
      @TheBhushanJPawar 3 місяці тому

      After clearing some memory now it's started working...