Lightning Talk: Accelerated Inference in PyTorch 2.X with Torch...- George Stefanakis & Dheeraj Peri

PyTorch Edge: Developer Journey for Deploying AI Models Onto Edge Devices - Mengwei Liu & Angela Yi

TorchScript and PyTorch JIT | Deep Dive

Factory assembly line, water transfer graffiti #hydrographic #craftshorts #printing #diy #shorts

«У коридор вийшли і все прилетіло»: жителька пошкодженого будинку про удар РФ по Запоріжжю

Lightning Talk: The Fastest Path to Production: PyTorch Inference in Python - Mark Saroufim, Meta

PyTorch

Переглядів 1 707

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 1 жов 2024
Lightning Talk: The Fastest Path to Production: PyTorch Inference in Python - Mark Saroufim, Meta
Historically for inference, users have had to rewrite their models to be jit scriptable which required model rewrites and familiarity with C++ services. This is frustrating especially when the vast majority of real world pytorch users actually deploy python in production. When torch.compile was introduced, it encouraged a UX of gradual model rewrites to optimize models but users would get value even without any. A C++ based option still represents a steep difficulty jump and torch.compile still suffers from long compile times which make it unsuited for server side inference where cold start times are critical. In this talk we introduce the options users have for the quickest possible path to production including new APIs to cache compilation artifacts across devices so users can compile models once for both training and inference and python bindings for AOT Inductor. We'll also end with some real world case studies inspired by users who faced the above problems within the context of torchserve. By which point we hope you'll be fully convinced that it's possible deploy python in production and retain performance.

КОМЕНТАРІ •

Наступне

Автоматичне відтворення

Lightning Talk: Accelerated Inference in PyTorch 2.X with Torch...- George Stefanakis & Dheeraj Peri

Lightning Talk: Accelerated Inference in PyTorch 2.X with Torch...- George Stefanakis & Dheeraj Peri

PyTorch Edge: Developer Journey for Deploying AI Models Onto Edge Devices - Mengwei Liu & Angela Yi

PyTorch Edge: Developer Journey for Deploying AI Models Onto Edge Devices - Mengwei Liu & Angela Yi

TorchScript and PyTorch JIT | Deep Dive

TorchScript and PyTorch JIT | Deep Dive

Factory assembly line, water transfer graffiti #hydrographic #craftshorts #printing #diy #shorts

Factory assembly line, water transfer graffiti #hydrographic #craftshorts #printing #diy #shorts

«У коридор вийшли і все прилетіло»: жителька пошкодженого будинку про удар РФ по Запоріжжю

«У коридор вийшли і все прилетіло»: жителька пошкодженого будинку про удар РФ по Запоріжжю

From Small To Giant Pop Corn #katebrush #funny #shorts

From Small To Giant Pop Corn #katebrush #funny #shorts

Lightning Talk: Profiling and Memory Debugging Tools for Distributed ML Workloads on GPUs- Aaron Shi

Lightning Talk: Profiling and Memory Debugging Tools for Distributed ML Workloads on GPUs- Aaron Shi

Exploring the Latency/Throughput & Cost Space for LLM Inference // Timothée Lacroix // CTO Mistral

Exploring the Latency/Throughput & Cost Space for LLM Inference // Timothée Lacroix // CTO Mistral

Build and Deploy AI with Pytorch | Lightning AI Founder & CEO, William Falcon

Build and Deploy AI with Pytorch | Lightning AI Founder & CEO, William Falcon

PyTorch 2.0: Unlocking the Power of Deep Learning with the Torch Compile API - Christian Keller

PyTorch 2.0: Unlocking the Power of Deep Learning with the Torch Compile API - Christian Keller

STOP Learning These Programming Languages (for Beginners)

STOP Learning These Programming Languages (for Beginners)

What is PyTorch? (Machine/Deep Learning)

What is PyTorch? (Machine/Deep Learning)

Lightning Talk: Exploring PiPPY, Tensor Parallel and Torchserve for Large... - Hamid Shojanazeri

Lightning Talk: Exploring PiPPY, Tensor Parallel and Torchserve for Large... - Hamid Shojanazeri

Lightning Talk: Streamlining Model Export with the New ONNX Exporter - Maanav Dalal & Aaron Bockover

Lightning Talk: Streamlining Model Export with the New ONNX Exporter - Maanav Dalal & Aaron Bockover

Lightning Talk: Accelerating Inference on CPU with Torch.Compile - Jiong Gong, Intel

Lightning Talk: Accelerating Inference on CPU with Torch.Compile - Jiong Gong, Intel

ПОЛНОЕ видео на канале. Нажми СРАЖАЮСЬ с ЗЛЫМИ РОДИТЕЛЯМИ в schoolboy ranaway

ПОЛНОЕ видео на канале. Нажми СРАЖАЮСЬ с ЗЛЫМИ РОДИТЕЛЯМИ в schoolboy ranaway

From Small To Giant Pop Corn #katebrush #funny #shorts

From Small To Giant Pop Corn #katebrush #funny #shorts

DOROFEEVA - Колискова 2022 (Official Music Video)

DOROFEEVA - Колискова 2022 (Official Music Video)

«Здається, якщо не прийду - буде ображатися». Батьки Степана Романюка щодня відвідують його могилу.

«Здається, якщо не прийду — буде ображатися». Батьки Степана Романюка щодня відвідують його могилу.

Загадочная череда смертей участников группы Ласковый май | Документальный фильм

Загадочная череда смертей участников группы Ласковый май | Документальный фильм

Танкісти ОК "Захід" порівняли радянські Т-64 та Т-72 з німецьким Leopard 1A5 #shorts

Танкісти ОК "Захід" порівняли радянські Т-64 та Т-72 з німецьким Leopard 1A5 #shorts

Жіночий лікар. Нове життя 2. Серія 31. Новинка 2024 на 1+1 Україна. Найкраща медична мелодрама

Жіночий лікар. Нове життя 2. Серія 31. Новинка 2024 на 1+1 Україна. Найкраща медична мелодрама

повтори звуки животного 😱

повтори звуки животного 😱