Lightning Talk: Exploring PiPPY, Tensor Parallel and Torchserve for Large Model Inference

Lightning Talk: Exploring PiPPY, Tensor Parallel and Torchserve for Large Model Inference

YouTube

Description

Here, we talk about large model inference with Torchserve, using PiPPy, Tensor Parallel, challenges of distributed inference and available solutions. Discuss the features that Torchserve provide today for serving LLMs in production today.

Details

Event: PyTorch Conference 2023
Language: English
Media URL: YouTube
Tags: Lightning Talk
Related URLs:
- Conference Website

Improve this page