LLM's Anywhere: Browser Deployment with Wasm & WebGPU - Joinal Ahmed & Nikhil Rana

Is Your GPU Really Working Efficiently in the Data Center? N Ways to Imp... Xiao Zhang & Wu Ying Jun

Open Sourcing the Future of Z: Unleashing Innovation on the... Dong Ma & Chen ji, Mike Friesenegger

Apple Event - September 9

Holding Bigger And Bigger Dogs

The Joker wanted to stand at the front, but unexpectedly was beaten up by Officer Rabbit

Self-Hosted LLM Agent on Your Own Laptop or Edge Device - Michael Yuan, Second State

The Linux Foundation

Переглядів 108

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 15 вер 2024
Don't miss out! Join us at our upcoming conference: Open Source Summit + AI_Dev: Open Source GenAI & ML Summit in Tokyo from October 28-29, 2024. Connect with peers as the community gathers to further the education and advancement of open source and GenAI. Learn more at events.linuxfo...
Self-Hosted LLM Agent on Your Own Laptop or Edge Device | 在自己的笔记本电脑或边缘设备上自托管LLM Agent - Michael Yuan, Second State
As LLM applications evolve from chatbots to copilots to AI agents, there are increasing needs for privacy, customization, cost control, and value alignment. Running open-source LLMs and agents on personal or private devices is a great way to achieve those goals. With the release of a new generation of open-source LLMs, such as Llama 3, the gap between open-source and proprietary LLMs is narrowing fast. In many cases, open source LLMs are already outperforming SaaS-based proprietary LLMs. For AI agents, open-source LLMs are not just cheaper and more private. They allow customization through finetuning and RAG prompt engineering using private data. This talk shows you how to build a complete AI agent service using an open-source LLM and a personal knowledge base. We will use the open-source WasmEdge + Rust stack for LLM inference, which is fast and lightweight without complex Python dependencies. It is cross-platform and achieves native performance on any OSes, CPUs, and GPUs.
随着LLM应用程序从聊天机器人发展到副驾驶员再到AI代理，对隐私、定制、成本控制和价值对齐的需求越来越大。在个人或私人设备上运行开源LLMs和代理是实现这些目标的好方法。随着新一代开源LLMs（如Llama 3）的发布，开源和专有LLMs之间的差距迅速缩小。在许多情况下，开源LLMs已经超越了基于SaaS的专有LLMs。对于AI代理来说，开源LLMs不仅更便宜、更私密，还允许通过微调和使用私人数据进行RAG提示工程来进行定制。本次演讲将向您展示如何使用开源LLM和个人知识库构建完整的AI代理服务。我们将使用开源的WasmEdge + Rust堆栈进行LLM推理，这种方法快速轻便，不需要复杂的Python依赖。它是跨平台的，在任何操作系统、CPU和GPU上都能实现原生性能。

КОМЕНТАРІ •

Наступне

Автоматичне відтворення

LLM's Anywhere: Browser Deployment with Wasm & WebGPU - Joinal Ahmed & Nikhil Rana

LLM's Anywhere: Browser Deployment with Wasm & WebGPU - Joinal Ahmed & Nikhil Rana

Is Your GPU Really Working Efficiently in the Data Center? N Ways to Imp... Xiao Zhang & Wu Ying Jun

Is Your GPU Really Working Efficiently in the Data Center? N Ways to Imp... Xiao Zhang & Wu Ying Jun

Open Sourcing the Future of Z: Unleashing Innovation on the... Dong Ma & Chen ji, Mike Friesenegger

Open Sourcing the Future of Z: Unleashing Innovation on the... Dong Ma & Chen ji, Mike Friesenegger

Apple Event - September 9

Apple Event - September 9

Holding Bigger And Bigger Dogs

Holding Bigger And Bigger Dogs

The Joker wanted to stand at the front, but unexpectedly was beaten up by Officer Rabbit

The Joker wanted to stand at the front, but unexpectedly was beaten up by Officer Rabbit

👆🏻Если любишь маму, жми на «МЫ поехали в ПИТЕР…» и увидишь самый лучший влог 👀

👆🏻Если любишь маму, жми на «МЫ поехали в ПИТЕР…» и увидишь самый лучший влог 👀

Revolutionizing Service Mesh with Kernel-Native Sidecarless Architecture - Xin Liu, Huawei

Revolutionizing Service Mesh with Kernel-Native Sidecarless Architecture - Xin Liu, Huawei

Simplify AI Infrastructure with Kubernetes Operators - Ganeshkumar Ashokavardhanan & Tariq Ibrahim

Simplify AI Infrastructure with Kubernetes Operators - Ganeshkumar Ashokavardhanan & Tariq Ibrahim

Networking labs on ARM

Networking labs on ARM

Effortless Scalability: Orchestrating Large Language Model Inference w... Joinal Ahmed & Nirav Kumar

Effortless Scalability: Orchestrating Large Language Model Inference w... Joinal Ahmed & Nirav Kumar

ESDT: Epidsode 1 - Introduction to Bootloader Design for Microcontrollers

ESDT: Epidsode 1 - Introduction to Bootloader Design for Microcontrollers

Ethics in the Cloud: Safeguarding Responsible AI Development in Asia - Quiana Berry, Red Hat

Ethics in the Cloud: Safeguarding Responsible AI Development in Asia - Quiana Berry, Red Hat

Detecting & Overcoming GPU Failures During ML Training- Ganeshkumar Ashokavardhanan & Sarah Belghiti

Detecting & Overcoming GPU Failures During ML Training- Ganeshkumar Ashokavardhanan & Sarah Belghiti

OS Migration Solution on Cloud - Jianlin Lv, eBay

OS Migration Solution on Cloud - Jianlin Lv, eBay

Sit Back and Relax with Fault Awareness and Robust Instant Recovery for... Fanshi Zhang & Kebe Liu

Sit Back and Relax with Fault Awareness and Robust Instant Recovery for... Fanshi Zhang & Kebe Liu

Как мы играем в игры 😂

Как мы играем в игры 😂

Жіночий лікар. Нове життя 2. Серія 17. Новинка 2024 на 1+1 Україна. Найкраща медична мелодрама

Жіночий лікар. Нове життя 2. Серія 17. Новинка 2024 на 1+1 Україна. Найкраща медична мелодрама

КТО БОИТСЯ КЛЕЩЕЙ?? #shorts

КТО БОИТСЯ КЛЕЩЕЙ?? #shorts

Пришёл к другу на ночёвку 😂

Пришёл к другу на ночёвку 😂

"Завжди був патріотом! Я ніколи би не залишив країну через булінг!", Волошин⁠⁠ | @Raminaeshakzai

"Завжди був патріотом! Я ніколи би не залишив країну через булінг!", Волошин⁠⁠ | @Raminaeshakzai

Дізнався стать майбутньої дитини на фронті

Дізнався стать майбутньої дитини на фронті

ДИЗЕЛЬ ШОУ 2024 💙 149 ВИПУСК 💛💐 ВЕЛИКА ПРЕМ'ЄРА 🌷 від 13.09.2024

ДИЗЕЛЬ ШОУ 2024 💙 149 ВИПУСК 💛💐 ВЕЛИКА ПРЕМ'ЄРА 🌷 від 13.09.2024

В ДЕТСТВЕ ДЕЛАЕМ ПАРАШЮТ ИЗ ПАКЕТОВ

В ДЕТСТВЕ ДЕЛАЕМ ПАРАШЮТ ИЗ ПАКЕТОВ