PyTorch 2.0: Faster machine learning through dynamic Python bytecode translation

Speaker

Jason Ansel

Meta AI

Host

Professor Saman Amarasinghe

MIT-CSAIL

Abstract:
PyTorch 2.0 uses compilers to deliver faster training and inference
without sacrificing the usability and flexibility PyTorch is known
for. PyTorch 2.0 is fully backwards compatible and continues to
provide an interactive, extensible, easy to debug, and Pythonic
programming environment for AI researchers, data scientists and
engineers. Across 163 models in our benchmark set we find 93% model
coverage and a geometric mean training speedup of approximately 43% on
an NVIDIA A100.

This talk will cover key technologies behind PyTorch 2.0: TorchDynamo
and TorchInductor. TorchDynamo is a Python-level JIT compiler
designed to make unmodified PyTorch programs faster. TorchDynamo hooks
into the frame evaluation API in CPython to dynamically modify Python
bytecode right before it is executed. It rewrites Python bytecode in
order to extract sequences of PyTorch operations into an FX Graph
which is then just-in-time compiled with many extensible backends. It
creates this FX Graph through bytecode analysis and is designed to
generate smaller graph fragments that can be mixed with Python
execution to get the best of both worlds: usability and performance.

TorchInductor is the new compiler backend included in PyTorch 2.0. It
translates PyTorch programs into OpenAI's Triton for GPUs and
OpenMP/C++ for CPUs. TorchInductor is able to handle the flexibility
and dynamism of PyTorch by using similar abstractions to PyTorch eager
mode. It introduces a new define-by-run loop level intermediate
representation (IR) to make it easy to add new operator lowerings.
Finally, it is implemented in Python, so it is easy for PyTorch users
to extend and modify to meet their needs.

Bio:
Jason Ansel is a Principal Research Scientist at Meta AI and a
technical lead for PyTorch compilers. He started the TorchDynamo and
TorchInductor projects, which bring flexible graph capture and a high
performance compiler to PyTorch 2.0. He received a Ph.D. from MIT
CSAIL in 2014 (advisor Saman Amarasinghe) with research focusing on
the boundary of machine learning, compilers, and programming
languages.

Zoom Link :https://mit.zoom.us/j/92332482198

Add to Calendar 2023-03-09 12:30:00 2023-03-09 13:30:00 America/New_York PyTorch 2.0: Faster machine learning through dynamic Python bytecode translation Abstract:PyTorch 2.0 uses compilers to deliver faster training and inferencewithout sacrificing the usability and flexibility PyTorch is knownfor. PyTorch 2.0 is fully backwards compatible and continues toprovide an interactive, extensible, easy to debug, and Pythonicprogramming environment for AI researchers, data scientists andengineers. Across 163 models in our benchmark set we find 93% modelcoverage and a geometric mean training speedup of approximately 43% onan NVIDIA A100.This talk will cover key technologies behind PyTorch 2.0: TorchDynamoand TorchInductor. TorchDynamo is a Python-level JIT compilerdesigned to make unmodified PyTorch programs faster. TorchDynamo hooksinto the frame evaluation API in CPython to dynamically modify Pythonbytecode right before it is executed. It rewrites Python bytecode inorder to extract sequences of PyTorch operations into an FX Graphwhich is then just-in-time compiled with many extensible backends. Itcreates this FX Graph through bytecode analysis and is designed togenerate smaller graph fragments that can be mixed with Pythonexecution to get the best of both worlds: usability and performance.TorchInductor is the new compiler backend included in PyTorch 2.0. Ittranslates PyTorch programs into OpenAI's Triton for GPUs andOpenMP/C++ for CPUs. TorchInductor is able to handle the flexibilityand dynamism of PyTorch by using similar abstractions to PyTorch eagermode. It introduces a new define-by-run loop level intermediaterepresentation (IR) to make it easy to add new operator lowerings.Finally, it is implemented in Python, so it is easy for PyTorch usersto extend and modify to meet their needs.Bio:Jason Ansel is a Principal Research Scientist at Meta AI and atechnical lead for PyTorch compilers. He started the TorchDynamo andTorchInductor projects, which bring flexible graph capture and a highperformance compiler to PyTorch 2.0. He received a Ph.D. from MITCSAIL in 2014 (advisor Saman Amarasinghe) with research focusing onthe boundary of machine learning, compilers, and programminglanguages.Zoom Link :https://mit.zoom.us/j/92332482198 32-D463. Zoom: https://mit.zoom.us/j/92332482198

Organizer & Contact

Mary McDavitt

mmcdavit@csail.mit.edu

617-253-9620

PyTorch 2.0: Faster machine learning through dynamic Python bytecode translation

Speaker

Host

March 09 2023

Location

Organizer & Contact