FREE Local LLMs on Apple Silicon | FAST!

Alex Ziskind

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 9 тра 2024
Step by step setup guide for a totally local LLM with a ChatGPT-like UI, backend and frontend, and a Docker option.
Temperature/fan on your Mac: www.tunabellysoftware.com/tgp... (affiliate link)
Run Windows on a Mac: prf.hn/click/camref:1100libNI (affiliate)
Use COUPON: ZISKIND10
🛒 Gear Links 🛒
* 🍏💥 New MacBook Air M1 Deal: amzn.to/3S59ID8
* 💻🔄 Renewed MacBook Air M1 Deal: amzn.to/45K1Gmk
* 🎧⚡ Great 40Gbps T4 enclosure: amzn.to/3JNwBGW
* 🛠️🚀 My nvme ssd: amzn.to/3YLEySo
* 📦🎮 My gear: www.amazon.com/shop/alexziskind
🎥 Related Videos 🎥
* 🌗 RAM torture test on Mac - • TRUTH about RAM vs SSD...
* 🛠️ Host the PERFECT Prompt - • Hosting the PERFECT Pr...
* 🛠️ Set up Conda on Mac - • python environment set...
* 🛠️ Set up Node on Mac - • Install Node and NVM o...
* 🤖 INSANE Machine Learning on Neural Engine - • INSANE Machine Learnin...
* 💰 This is what spending more on a MacBook Pro gets you - • Spend MORE on a MacBoo...
* 🛠️ Developer productivity Playlist - • Developer Productivity
🔗 AI for Coding Playlist: 📚 - • AI
Repo
github.com/open-webui/open-webui
Docs
docs.openwebui.com/
Docker Single Command
docker run -d --network=host -v open-webui:/app/backend/data -e OLLAMA_BASE_URL=127.0.0.1:11434 --name open-webui --restart always ghcr.io/open-webui/open-webui:main
- - - - - - - - -
❤️ SUBSCRIBE TO MY UA-cam CHANNEL 📺
Click here to subscribe: / @azisk
- - - - - - - - -
Join this channel to get access to perks:
/ @azisk
- - - - - - - - -
📱 ALEX ON X: / digitalix
#machinelearning #llm #softwaredevelopment
Наука та технологія

КОМЕНТАРІ • 281

@camsand6109 Місяць тому ⁺²⁶
This channel is the gift that keeps on giving.
@JosepCrespoSantacreu Місяць тому ⁺⁴
Another great video Alex, I really enjoy your videos. And I really appreciate your perfect diction in English, which makes it easy to follow your explanations even for those who do not have English as their first language.
@isaquehg 16 днів тому ⁺²
I really appreciated your approach showing "behind the scenes" instead of just running Docker. Great video, as always!
@asnifuashifj91274 Місяць тому ⁺¹⁴
Great video Alex! yes please make videos on image generation!
@3monsterbeast Місяць тому
This channel is going to be growing so fast; you make great videos that are very helpful!
@gustavohalperin Місяць тому ⁺⁵
Great video!! And yes, please add a video explaining how to add the images generator.
@ReginaldoKono Місяць тому ⁺¹
Yes Alex, you will help us more if we could learn with you how on how to add an image generator as well. We thank you for your time and colaboraron. Your channel is a must have subscription in it now-a-days.
@aldousroy Місяць тому ⁺¹
Awesome thing waiting for more videos on the way
@7764803 Місяць тому ⁺²
Thanks Alex for videos like this 👍
I would like to see Image generation follow up video 😍
@ChrisHaupt Місяць тому ⁺¹
Very interesting, will definitely be trying this when I get a little downtime!
@Ginto_O Місяць тому ⁺¹
Thank you, got it to work without docker
@cstenger Місяць тому
I like the manual installation process because it uses less resources than having docker running all the time on your mac.
thanks for the tutorial, I really enjoyed doing it and seeing how it works.
@mrdave5500 Місяць тому
Woot woot! great stuff. Nice easy tutorial and I now have a 'smarter' Mac. Thanks :)
@AaronHiltonSPD Місяць тому ⁺⁶
Amazing tutorial. Great stuff!
@AZisk Місяць тому ⁺²
Thank you! Cheers!
@kaorunguyen7782 Місяць тому
Alex, I love this video very much. Thank you!
@loveenjain Місяць тому
Excellent Video giving it a try tonight on my M3 Max 14 inch model and see what are the results will share probably...
@ilkayayas Місяць тому
Nice. Image generation and integrating new chatgpt in to this will be great.
@iv4sik Місяць тому ⁺¹
if ur trying docker, make sure it is version 4.29+, as host network driver (for mac) revealed there as a beta feature
@erenyeager655 Місяць тому ⁺¹
One thing for sure... I'll be implementing this on my menu bar for easy access :D
@brunosanmartin1065 Місяць тому ⁺²⁸
These videos are so exciting for me; this channel is the number one on UA-cam. That's why I subscribe and gladly pay for UA-cam Premium. A hug, Alex!
@AZisk Місяць тому ⁺⁵
thanks for saying! means a lot
@RealtyWebDesigners Місяць тому ⁺³
Now we need 1TB MEMORY DRIVES (Like the Amiga used to have 'fast ram' )
@MrMrvotie Місяць тому
@@AZisk Is their any chance you could incorporate a PC GPU Relative Performance Equivalence to each new apple silicon microchip that you review?
@mendodsoregonbackroads6632 Місяць тому
Yes I’m interested in an image generation video. I’m running llama3 in Bash, haven’t had time to set up a front end yet. Cool video.
@moranmono Місяць тому ⁺¹
Great video. Awesome 👏
@sungm2n Місяць тому
Amazing stuff. Thank you
@marcel948 26 днів тому
Excellent video! Keep up!
@jorgeluengo9774 Місяць тому ⁺¹
by the way, I just joined your channel, I really enjoyed these videos, very helpful, thanks!
@AZisk Місяць тому
awesome. welcome!
@DaveEtchells Місяць тому
I was gonna spring for a maxed M3 Max MBP, but saw rumors that the M4 Max will have more AI-related chops, so just picked up a maxed M1 Max to tide me over 😁
Really excited about setting all this up, finding this vid was very timely, thanks!
@vadim487 Місяць тому
Alex, you are awesome!
@johnsummers7389 Місяць тому ⁺¹
Great Video Alex. Thanks.
@AZisk Місяць тому
Glad you liked it!
@sikarinkaewjutaniti4920 Місяць тому
Thx for sharing good stuff for us. Nice onec
@BenjaminEggerstedt Місяць тому
This was interesting, thanks
@OrionM42 Місяць тому
Thanks for the video.😊😊
@guyguy467 Місяць тому ⁺³
Thanks! Very nice video
@AZisk Місяць тому
Wow! Thank you!
@bvlmari6989 Місяць тому ⁺¹
Amazing video omg, incredible tutorial man
@AZisk Місяць тому
Glad you liked it!
@WokeSoros Місяць тому
I was able to, by tracking down your Conda video, get this running.
I have some web dev and Linux experience, so it wasn’t a huge chore but certainly not easy going in relatively blind.
Great tutorial though. Much thanks.
@dibyajit9429 Місяць тому ⁺¹
I've just started my career as a Data Scientist, and I found this video to be awesome! 🤩🥳Could you please consider making a video on image generation (in LLama 3) in a private PC environment?🥺🥺
@AzrealNimer Місяць тому ⁺¹
I would love to see the image generation tutorial 😁
@lucerocj 23 дні тому
Would love to see an extended video on working with local files for this option.
@gligoran Місяць тому
Amazing video! I'd just recommend Volta over nvm.
@AC-cg6mf Місяць тому ⁺²¹
I really like that you showed the non-docker install first. I think too many rely on docker black-boxes. I prefer this. Thanks!
@philipo1541 Місяць тому ⁺⁵
Dockers are not a black-box. You can get it in them, and change stuff!!!
@veccio Місяць тому
Respectfully, Docker need not be a black box. Don’t be afraid to tinker and dig in. :) But I get how doing it manually forces you to touch different parts.
@willmartin4715 Місяць тому
i believe my laptop has 80 Tensor cores. for starters. This looks like a really good shift for a fri night! thanks.
@RealtyWebDesigners Місяць тому ⁺⁵
BTW - One of the BEST programmer channels!
@Meet7 Місяць тому
thanks alex
@AdityaSinghEEE Місяць тому
Can't believe, I found this video today because I just started searching for Local LLMs yesterday and today, I found the complete guide. Great video Alex :)
@scorn7931 Місяць тому
You live in Matrix. Wake up
@haralc Місяць тому
Oh you got distracted! You're a true developer!
@akhimohamed Місяць тому ⁺¹
As a game dev, this is so good to have. Btw am gonna try this on parallels for my m1 pro
@Lucas-fl8ug Місяць тому
You mean in windows through parallels? why would it be useful?
@filipjofce Місяць тому
So cool, and it's free (if we don't count the 4 grands spent for the machine). I'd love to see the images generation
@Dominickleiner Місяць тому ⁺¹
instant sub, great content thank you!
@AZisk Місяць тому
Welcome aboard!
@soulofangel1990 Місяць тому
Yes, we do.
@agnemedia624 Місяць тому
Thanks 👍🏻
@Daydream_Dynamo 20 днів тому
Please do a video about image generation!!
@keithdow8327 Місяць тому ⁺⁴
Thanks!
@AZisk Місяць тому
Wow 🤩 thanks so much!
@XinYue-ki3uw Місяць тому
i like this tutorial, it is computer dummy friendly~
@shapelessed Місяць тому ⁺¹⁰
YO! Finally hearing of a big Svelte project!
Like really, it's so much quicker and easier to ship with Svelte than others, why am I only seeing this now?
@AZisk Місяць тому ⁺⁴
Svelte for the win!
@precisionchoker Місяць тому ⁺¹
Well.. Apple, Brave, New York times, IKEA among other big names all use svelte
@shapelessed Місяць тому
@@precisionchoker But they do not acknowledge that too much..
@ontime8109 Місяць тому
thanks!
@davidgoncalvesalvarez Місяць тому ⁺¹³⁰
My M1 Mac 16GB be real frightened on the side rn.
@blackandcold Місяць тому ⁺¹²
I ran 7b variants no problem on my now sold m1 air 16g
@ivomeadows Місяць тому ⁺⁵
got macbook with the same specs. tried to run 15b starcoder2 quantized k5m in LM studio on it, max GPU layers, getting me around 12-13 tokens per sec, not good but manageable
@RobertMcGovernTarasis Місяць тому ⁺¹⁰
Don't be, unless you are using other things that are super heavy as well. Llama3 8B(?) takes up about 4.7GB of Ram, with the Silicon's event use of the Nvme and Swap you'll be fine. (I prefer using LM Studio now to Ollama as it has CLI and Web built in, no need for Docker/OrbStack but, Ollama on its own without a WebUI works too)
@martinseal1987 Місяць тому
😂
@DanielHarrisCodes Місяць тому
Great video. What format are LLM models download as? Looking into how I can use those downloaded with OLLAMA with other technologies like .NET
@LucaCilfoneLC Місяць тому
Yes! Image generation, please!
@jakubpeciak429 Місяць тому
Hi Alex, I would like to see the image generation video
@Raptor235 Місяць тому
Great video Alex, is there anyway to have an LLM execute local shell scripts to perform tasks?
@pixelplay1098 Місяць тому
Amazing stuff as Usual. Now make a tutorial on Automatic 1111
@erwintan9848 Місяць тому ⁺¹
Is it fast on mac m1 pro too?
How many storage used for all instalation sir?
Your video is awesome!
@innocent7048 Місяць тому ⁺²⁰
Here you have a super like - and a cup of coffee 🙂
@AZisk Місяць тому ⁺⁷
Yay, thank you! I haven't been to Denmark in a while - beautiful country.
@youssefragab2109 Місяць тому ⁺¹
This is really cool, love the channel and the videos Alex! Just curious, how is this different to an app like LM Studio? Keep up the good work!
@yuanyuanintaiwan Місяць тому
My guess is that this web UI has more capabilities such as image generation which LM Studio doesn’t have. If the goal is simply to have text interaction, then I agree that this may not be necessary
@MohammedAraby Місяць тому
Well be happ to see a tutorial for automatic 1111 ❤
@engr.hashimali758 11 днів тому
Hi Alex, I really appreciate your work, can you please sugggest me any model where I need a detailed summary and breakdown of quantities from this construction PDF drawings. Which organize the quantities by type (linear feet, square feet, each) and categorize them by division (e.g., General Requirements, Sitework). Waiting for your response!
@ashesofasker Місяць тому
Great video! So are you saying that we can get ChatGPT like quality just faster, more private and for free by running local LLM's on our personal machines? Like, do you feel that this replaces ChatGPT?
@yianghan751 Місяць тому ⁺¹
Alex, excellent video!
Can my MacBook air m2 with 16G RAM host these AI engines smoothly?
@kdnz_music Місяць тому
Just install LM Studio and get any model and good native ui in a couple clicks
@reyaustineechavez9600 28 днів тому
This hungs up my M1 Air everytime I gave a prompt Lol
@Mikoaj-ie6gt Місяць тому
very intresting
@rickymassi Місяць тому
Why not doing a deployment with Electron, so you have a desktop application. Btw I love this thing!!!
@matteobottazzi6847 Місяць тому ⁺³
A video on how you could incorporate these LLMs in your applications would be super interesting! Let's say that in your application you have a set of pdfs or html files that provide documentation on your product. If you let these LLMs analyse that documentation, then the user could get very useful information just asking and not searching through all of the documentation files!
@FelipeViaud Місяць тому ⁺²
+1
@neoqe6lb Місяць тому ⁺¹
Ollama has api endpoints that you can integrate in your apps. Check their documentation.
@AIandVisuals 26 днів тому
Great video. Congrats. But how to install node? Would it not work with a Python front end?
@OlegShulyakov Місяць тому
When there will be a video to run LLM on an iPhone or iPad? Like using LLMFarm
@FaysalBDev Місяць тому
lets do some image generation please it would be super helpful
@cookiebinary Місяць тому ⁺²
Tried llama3 on 8GB ram M1 :D ... I guess I was too optimistic
@toddbristol707 Місяць тому ⁺¹
Great channel! I just did a build something similar with lm studio and flask based web ui. I’m going to try this method now. Btw, what was the ‘code .’ command you ran? Are you using visual studio code? Thanks again!
@AZisk Місяць тому
Thanks! and thanks for joining. I did the flask thing a few videos ago, but it's just another thing to maintain. I find this webui a lot more feature rich and better looking. And yes, the 'code .' command just opens the current folder in VSCode
@113bast Місяць тому ⁺⁴
Please show image generation
@99cya Місяць тому ⁺¹
Hey Alex, would you say Apple is in a very good position when it comes to AI and the required hardware? So far Apple has been really quiet and lots of ppl dont think Apple can have an edge here. Whats your thought in general here?
@jehad4455 Місяць тому
Mr. Alex Ziskind
Could you clarify whether training deep learning models on a GPU for the Apple Silicon M3 Pro might reduce its lifespan?
Thank you.
@truenetgmx Місяць тому
now benchmark it vs mac air :) also wonder how much these are usefull tools and not just toys
@zorawarsingh11 Місяць тому
Yes do images please 🙏🏻
@gayanperera7273 Місяць тому
Thanks @Alex, by the way is there a reason it can only use GPU, any reason not taking advantage of NPUs ?
@ChiliadStudios 29 днів тому
please make a video on how to run huggingface models on this thing. i cant for the life of me figure it out. such a headache
@tyron2854 Місяць тому ⁺¹
What about a new M4 iPad Pro video?
@MW-mn1el Місяць тому ⁺³
I use Ollama with Continue plugin with VSCode. And Chatbox GUI when not code related. Work well with both Mac and Linux with Ryzen 7000 CPU. On linux it's running in a podman(docker) container. But best experience is with MacBook Pro, apple silicon and unified memory make it speedy.
@sergey_c Місяць тому
Было бы здорово ещё дать краткое описание каждой из моделей и рейтинг популярности или узконаправленности. А то установишь какие-то неизвестные модели себе на мак)
@uwegenosdude Місяць тому
Thanks for the video. I tried to download the code companion. Do you know why when the download of this LLM is going on, happens an upload of a couple of GBytes?
@motocar1585 21 день тому ⁺¹
how to do it with docker and also can i live this host it on vercel or netlify ??
@howfakeisfake Місяць тому
great job, I had some issues with finding the backend directory (it was missing) and the node thing ... btw Ollama is great
@bisarothub1644 Місяць тому
Great video. But I think Jan AI is a lot easier to configure and setup for mac users
@cjchand Місяць тому ⁺¹
Just some food for thought for future vids: Anaconda's licensing terms changed to require any org > 200 employees to license it. For this reason, many Enterprises are steering their devs away from Anaconda. Would be helpful if the tutorials used "vanilla" Python (e.g.: venv) unless Conda were truly necessary. Thanks for the vids and keep up the great work!
@AZisk Місяць тому
good to know. thanks
@swapwarick Місяць тому ⁺²⁶
I am running llama, code Gemma on my laptop for local files intelligence. It's slow but damm it reads all my PDFs and give perfect overview
@devinou-programmationtechn9979 Місяць тому ⁺¹⁰
Do you do it through ollama and open webui ? I m curious as to how you can send files to be processed by llms
@ShakeAndBakeGuy Місяць тому
@@devinou-programmationtechn9979 GP4All works fairly well with attachments. But I personally use Obsidian as a RAG to process markdown files and PDFs. There are tons of plugins like Text Generator and Smart Connections that can work with Ollama, LM Studio, etc.
@TheXabl0 Місяць тому
Can you describe this “perfect overview”? Just curious what you mean by
@swapwarick Місяць тому
Yes running open webui for llama and code Gemma llms on windows machine. Running open webui on localhost gives textarea where you can upload the file. The upload takes time. Once it is done, you can ask questions like give me an overview of this document, tell me all the important points of this document etc
@TheChindoboi Місяць тому
Gemma doesn’t seem to work well on Apple silicon
@thetabletopskirmisher Місяць тому
What advantage does this have over using LM Studio that you can install directly as an app instead of using the Terminal? (Genuine question)
@thevirtualdenis3502 Місяць тому
Thanks ! Is Macbook air enough for that?
@jorgeluengo9774 Місяць тому
Thank You Alex, amazing video, I followed all steps and I enjoyed the process and the results with my m3 max. I wonder if there is a GPT that we can use from the laptop and have searches online since the cutoff knowledge date of these models seem to be over a year ago or more. For example when I ask questions of what is the terraform provider version for aws or other type of platform, is old and there is a potential to have deprecated code responses. What do you recommend in this case? not sure if you have already a video for that lol.
@AZisk Місяць тому ⁺¹
that’s a great question. you’ll need to use a framework like flowise or langchain to accomplish this I believe, but i don’t know much about them - it’s on my list of things to learn
@jorgeluengo9774 Місяць тому
@@AZisk makes sense, I will do some research about it and see what I can find out to test but I will look forward when you share a video with this type of model orchestration, will be fantastic.
@AlexLaslau Місяць тому ⁺¹
MBP M1 Pro with 16GB of RAM would be enough to run this?
@joaquincaballero4353 Місяць тому
Image generation video please
@justintie Місяць тому ⁺¹
the question is: are opensource LLMs just as good as say chatGPT or Gemini?
@sitrakaforler8696 Місяць тому
Letzzzz gooooo
@bekagelashvili2904 Місяць тому
easy question, if i am not developer, what's the benefit i get from installing LLM in my apple silicon, what's the difference, between free version, or paid version of ai models ?

Наступне

Автоматичне відтворення