We built an orchestrator from scratch. Here's why.

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

Build your Personal Language Assistant with an LLM | Fly.io GPUs + Ollama

I am the angry pumpkin 🎃 #plantsvszombies #pvz #animation #laurashigihara #pvz2 #videogames #cartoon

⚡️ МАЙК ТАЙСОН ОФІЦІЙНО ПОВЕРНУВСЯ! Огляд бою Джейк Пол - Майк Тайсон

Внезапно! Что на самом деле подорвал «Орешник»

How to Self-Host an LLM | Fly GPUs + Ollama

Fly․io

Переглядів 2 478

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 26 лис 2024

КОМЕНТАРІ • 16

@MrManafon 3 місяці тому
These videos are really cool. I'm not a beginner, far from it, but it is soooo nice to get this information in such a distilled manner, and from a person that clearly knows what they are talking about. So natural!
@jjbetspro 3 місяці тому ⁺¹
And she has a great personality.😂
@יהוידעשטינמץ 3 місяці тому
I see alot of explainer videos and yours are the best!, just grate content delivery and tone, prefection at all!
@sidiocity 2 дні тому
Can we use any llama based model ? In the destination xan we use the llm we have downloaded ? Imean the custom llm based on llama ?
@miro016 2 місяці тому
Can you provide wireguard instructions you mentioned? Btw perfect tutorial :)
@thedavymac 3 місяці тому
That looks like a nice way to run an LLM for my personal use, but I’d like to also try out one of the bigger LLM models.
Is that doable at all?
Or will I need to stick to models that fit within the 40gb GPU memory of the a100 for instance?
@flydotio 3 місяці тому ⁺¹
How big are you talking about? Generally the amount of vram you need is parameter count times parameter size in bits divided by 8 to get bytes plus 20%. Check this short for more info: ua-cam.com/users/shortstCE-awsKmmg
In general though:
* 13b or lower: any GPU works, no caveats
* 30b or lower: any GPU works, but you need at least Q8 or FP8 quantization
* 70b or lower: use the a100-80gb or the L40s
* greater than 80b: it depends, if you're lucky it'll work on one GPU, if not then you'll need to use multiple GPUs
@elias8294 3 місяці тому
Cool vid, thanks!
@oskrm 3 місяці тому ⁺¹
ollama run llama3 why is fly cool?
@mikeeomega 3 місяці тому
Great video 👌👌
@flydotio 3 місяці тому
Thank you 👍
@hassenalhandy4720 22 дні тому
I don't understand, how is this self hosting, isn't this cloud hosting?
@TheloniousBird 3 місяці тому
Hey, I tried setting this up but I have this error:
2024-08-24T00:27:36.386 runner[***] ord [info] Machine started in 3.517s
2024-08-24T00:27:37.133 app[***] ord [info] INFO Main child exited normally with code: 1
2024-08-24T00:27:37.152 app[***] ord [info] INFO Starting clean up.
2024-08-24T00:27:37.266 app[***] ord [info] INFO Umounting /dev/vdc from /root/.ollama
2024-08-24T00:27:37.268 app[***] ord [info] WARN could not unmount /rootfs: EINVAL: Invalid argument
2024-08-24T00:27:37.269 app[***] ord [info] [ 3.718685] reboot: Power down
any ideas on what would cause this?>
@TheloniousBird 3 місяці тому
I got it, I had to play around with the memory sizes
@dareljohnson5770 2 місяці тому
@@TheloniousBird What memory size? Explain?
@TheloniousBird 2 місяці тому
@@dareljohnson5770 in the fly Toml, VM -> memory I had to set it to 16 where it was originally set to 8

Наступне

Автоматичне відтворення

We built an orchestrator from scratch. Here's why.

We built an orchestrator from scratch. Here's why.

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

Build your Personal Language Assistant with an LLM | Fly.io GPUs + Ollama

Build your Personal Language Assistant with an LLM | Fly.io GPUs + Ollama

I am the angry pumpkin 🎃 #plantsvszombies #pvz #animation #laurashigihara #pvz2 #videogames #cartoon

I am the angry pumpkin 🎃 #plantsvszombies #pvz #animation #laurashigihara #pvz2 #videogames #cartoon

⚡️ МАЙК ТАЙСОН ОФІЦІЙНО ПОВЕРНУВСЯ! Огляд бою Джейк Пол - Майк Тайсон

⚡️ МАЙК ТАЙСОН ОФІЦІЙНО ПОВЕРНУВСЯ! Огляд бою Джейк Пол - Майк Тайсон

Внезапно! Что на самом деле подорвал «Орешник»

Внезапно! Что на самом деле подорвал «Орешник»

What type of pedestrian are you?😄 #tiktok #elsarca

What type of pedestrian are you?😄 #tiktok #elsarca

Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)

Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)

Ollama on Kubernetes: ChatGPT for free!

Ollama on Kubernetes: ChatGPT for free!

How to Run Llama 3.1 Locally on your Computer with Ollama and n8n (Step-by-Step Tutorial)

How to Run Llama 3.1 Locally on your Computer with Ollama and n8n (Step-by-Step Tutorial)

How to self-host and hyperscale AI with Nvidia NIM

How to self-host and hyperscale AI with Nvidia NIM

Uncensored self-hosted LLM | PowerEdge R630 with Nvidia Tesla P4

Uncensored self-hosted LLM | PowerEdge R630 with Nvidia Tesla P4

host ALL your AI locally

host ALL your AI locally

Earn $1,350/Day with ChatGPT & Google Drive for FREE

Earn $1,350/Day with ChatGPT & Google Drive for FREE

Vast AI: Run ANY LLM Using Cloud GPU and Ollama!

Vast AI: Run ANY LLM Using Cloud GPU and Ollama!

Qwen Just Casually Started the Local AI Revolution

Qwen Just Casually Started the Local AI Revolution

⚡️ МАЙК ТАЙСОН ОФІЦІЙНО ПОВЕРНУВСЯ! Огляд бою Джейк Пол - Майк Тайсон

⚡️ МАЙК ТАЙСОН ОФІЦІЙНО ПОВЕРНУВСЯ! Огляд бою Джейк Пол - Майк Тайсон

How Much Tape To Stop A Lamborghini?

How Much Tape To Stop A Lamborghini?

Симбу закрыли дома?! 🔒 #симба #симбочка #арти

Симбу закрыли дома?! 🔒 #симба #симбочка #арти

А я думаю что за звук такой знакомый? 😂😂😂

А я думаю что за звук такой знакомый? 😂😂😂

Увеличили моцареллу для @Lorenzo.bagnati

Увеличили моцареллу для @Lorenzo.bagnati

Мама у нас строгая

Мама у нас строгая

Водопад Ангела (2006)

Водопад Ангела (2006)

От первого лица: Школа 7😡 ПОТЕРЯЛ ДРУГА 💔НОЧЕВКА с ДЕВУШКОЙ 🤯ДОВЕЛ УЧИТЕЛЯ ДО СЛЕЗ ГЛАЗАМИ ШКОЛЬНИКА

От первого лица: Школа 7😡 ПОТЕРЯЛ ДРУГА 💔НОЧЕВКА с ДЕВУШКОЙ 🤯ДОВЕЛ УЧИТЕЛЯ ДО СЛЕЗ ГЛАЗАМИ ШКОЛЬНИКА