Torchy: A Tracing JIT Compiler for PyTorch

Nuno P. Lopes

Abstract:

Machine learning (ML) models keep getting larger and more complex. Whereas before models used to be represented by static data-flow graphs, they are now implemented via arbitrary Python code. Eager-mode frameworks, such as PyTorch, are now the standard for developing new ML models. The semantics of eager-mode frameworks is that operations are computed straight away. This greatly simplifies the development process, and it enables more dynamic ML models.
Although eager-mode frameworks are more convenient, they are less efficient today as operations are dispatched to the hardware one at a time. This execution model precludes, for example, operation fusion, which is essential for executing ML workloads efficiently.
In this paper we present Torchy, a tracing JIT compiler for PyTorch. Torchy achieves similar performance as data-flow frameworks, while providing the same semantics of straight-away execution. Moreover, Torchy works with any PyTorch program unmodified. Torchy outperforms PyTorch by up to 12x in microbenchmarks, and PyTorch's static compiler (TorchScript) by up to 5x.

Published:

N. P. Lopes. Torchy: A Tracing JIT Compiler for PyTorch. In Proc. of the ACM SIGPLAN 2023 International Conference on Compiler Construction (CC), Feb. 2023.

Download:

Bibtex:

@inproceedings{torchy-cc23,
  title =	{Torchy: A Tracing {JIT} Compiler for {PyTorch}},
  author =	{Nuno P. Lopes},
  booktitle =	{Proc. of the ACM SIGPLAN 2023 International Conference on Compiler Construction (CC)},
  doi =		{10.1145/3578360.3580266},
  month =	feb,
  year =	2023
}

Copyright notice:

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

<-- Return