Lessons from Deploying LLMs with LangSmith

Modal: an upgrade from the Kubernetes model deployment standard

Coding Llama 2 from scratch in PyTorch - Part 3

ЖИЗНЬ В БОГАТСТВЕ ИЛИ ЛЮБОВЬ В ШАЛАШЕ? ИСТОРИЯ ЛЮБВИ И НЕВОСПОЛНИМЫХ ПОТЕРЬ! БОЛЬШИЕ НАДЕЖДЫ

«Люди думали, що ми американці»: як бійці заходили у звільнене село #війна #україна #зсу #shorts

WHO DO I LOVE MOST?

LLMOps: Deploying LLMs and Scaling using Modal, LangChain and Huggingface

Prince Canuma

Переглядів 410

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 26 чер 2024
In this video, you'll learn about LLMOps, the practice of deploying and scaling LLMs using Modal, Langchain and Huggingface.
In the rapidly evolving domain of Large Language Models (LLMs), businesses and researchers grapple with the challenges of efficiently deploying, monitoring and scaling these models. The operational complexities, from infrastructure management to ensuring context-aware responses and efficient token streaming, pose significant obstacles.
Addressing the complexities of deploying, monitoring, and scaling Large Language Models requires a blend of specialized tools and methodologies.
Our approach is built upon the following tools:
🤗 Hugging Face: Tapping into their expansive repository of pre-trained models to establish a robust foundational base for LLM deployment.
🚀 vLLM: An open-source toolkit designed specifically for accelerated inference and seamless serving of LLMs, ensuring both speed and reliability. ☁️ Modal: This cloud code execution platform is key for abstracting away infrastructure complexities, enabling streamlined deployment and scaling of a wide range of LLMs, encompassing both proprietary and open-source variants.
🦜🔗LangChain: A dynamic framework tailored for the development of LLM-driven applications. With its modular components and readily available chains, it empowers LLMs to be both context-aware and adept at reasoning.
By integrating these tools and techniques, we manage to deliver a comprehensive solution that simplifies LLM operations, from deployment to scaling, while ensuring consistent exceptional performance without compromising on functionality.
----------------------------
Resources:
💻 Code: github.com/Blaizzy/LLMOps
📋 Presentation Slides: docs.google.com/presentation/...
Наука та технологія

КОМЕНТАРІ •

Наступне

Автоматичне відтворення

Lessons from Deploying LLMs with LangSmith

Lessons from Deploying LLMs with LangSmith

Modal: an upgrade from the Kubernetes model deployment standard

Modal: an upgrade from the Kubernetes model deployment standard

Coding Llama 2 from scratch in PyTorch - Part 3

Coding Llama 2 from scratch in PyTorch - Part 3

ЖИЗНЬ В БОГАТСТВЕ ИЛИ ЛЮБОВЬ В ШАЛАШЕ? ИСТОРИЯ ЛЮБВИ И НЕВОСПОЛНИМЫХ ПОТЕРЬ! БОЛЬШИЕ НАДЕЖДЫ

ЖИЗНЬ В БОГАТСТВЕ ИЛИ ЛЮБОВЬ В ШАЛАШЕ? ИСТОРИЯ ЛЮБВИ И НЕВОСПОЛНИМЫХ ПОТЕРЬ! БОЛЬШИЕ НАДЕЖДЫ

«Люди думали, що ми американці»: як бійці заходили у звільнене село #війна #україна #зсу #shorts

«Люди думали, що ми американці»: як бійці заходили у звільнене село #війна #україна #зсу #shorts

WHO DO I LOVE MOST?

WHO DO I LOVE MOST?

Другие: 1-8 серии подряд

Другие: 1-8 серии подряд

Claude 3: The GPT-4 Killer That Will Shock You!

Claude 3: The GPT-4 Killer That Will Shock You!

Get started with Gemma Google's NEW open-source LLM model

Get started with Gemma Google's NEW open-source LLM model

HuggingFace Fundamentals with LLM's such as TInyLlama and Mistral 7B

HuggingFace Fundamentals with LLM's such as TInyLlama and Mistral 7B

LlamaIndex Webinar: Advanced Tabular Data Understanding with LLMs

LlamaIndex Webinar: Advanced Tabular Data Understanding with LLMs

LangChain explained - The hottest new Python framework

LangChain explained - The hottest new Python framework

Using LangChain Output Parsers to get what you want out of LLMs

Using LangChain Output Parsers to get what you want out of LLMs

But What Is Cloud Native Really All About?

But What Is Cloud Native Really All About?

Lightning Talk: How to Win at Coding Interviews - David Stone - CppCon 2022

Lightning Talk: How to Win at Coding Interviews - David Stone - CppCon 2022

LangChain & Hugging Face - Run Any Language Model Locally (Code Walkthrough)

LangChain & Hugging Face - Run Any Language Model Locally (Code Walkthrough)

APPLE совершила РЕВОЛЮЦИЮ!

APPLE совершила РЕВОЛЮЦИЮ!

Is the NEW Surface Pro STRONGER than the iPad Pro?! - (Its only fair...)

Is the NEW Surface Pro STRONGER than the iPad Pro?! - (Its only fair...)

ГЛАВНАЯ ЛОВУШКА ДОРОГИХ СМАРТФОНОВ! Хватит покупать флагманы…

ГЛАВНАЯ ЛОВУШКА ДОРОГИХ СМАРТФОНОВ! Хватит покупать флагманы…

iOS 18 Beta 2 обновление! Что нового iOS 18 Beta 2? Обзор iOS 18 Beta 2, нагрев, стоит ли ставить

iOS 18 Beta 2 обновление! Что нового iOS 18 Beta 2? Обзор iOS 18 Beta 2, нагрев, стоит ли ставить

Собрал ПК, продал на Авито! Сколько заработал перекуп компьютеров?

Собрал ПК, продал на Авито! Сколько заработал перекуп компьютеров?

Почему не стоит переплачивать за СЖО. Как дорогой охлад от EK слил китайской водянке.

Почему не стоит переплачивать за СЖО. Как дорогой охлад от EK слил китайской водянке.

Gizli Apple Watch Özelliği😱

Gizli Apple Watch Özelliği😱

478 СОКЕТ НА СТЕРОИДАХ / ЧТО СМОЖЕТ В 2024 ГОДУ?

478 СОКЕТ НА СТЕРОИДАХ / ЧТО СМОЖЕТ В 2024 ГОДУ?