Contribute Media
A thank you to everyone who makes this possible: Read More

Accelerating Explorations in Vision and Multimodal AI Using Pytorch Libraries

Description

PyTorch Libraries provide building blocks (data processing transforms, modeling components, loss functions, etc.) on top of PyTorch as well as examples and tutorials on how to use these building blocks for training SoTA Models. In this talk, we’ll provide insights into ongoing work to accelerate exploration in multimodal understanding and generative AI using TorchMultimodal. We'll also present TorchVision's new transforms API, with added support for image detection, segmentation, and video tasks.

Details

Improve this page