TorchDynamo-based ONNX Exporter

.. automodule:: torch.onnx
  :noindex:

Overview
Dependencies
A simple example
Inspecting the ONNX model using GUI
When the conversion fails
API Reference
Deprecated

Overview

The ONNX exporter leverages TorchDynamo engine to hook into Python's frame evaluation API and dynamically rewrite its bytecode into an FX Graph. The resulting FX Graph is then polished before it is finally translated into an ONNX graph.

The main advantage of this approach is that the FX graph is captured using bytecode analysis that preserves the dynamic nature of the model instead of using traditional static tracing techniques.

In addition, during the export process, memory usage is significantly reduced compared to the TorchScript-enabled exporter. See the :doc:`memory usage documentation <onnx_dynamo_memory_usage>` for more information.

Dependencies

The ONNX exporter depends on extra Python packages:

ONNX

ONNX Script

They can be installed through pip:

pip install --upgrade onnx onnxscript

onnxruntime can then be used to execute the model on a large variety of processors.

A simple example

See below a demonstration of exporter API in action with a simple Multilayer Perceptron (MLP) as example:

import torch
import torch.nn as nn

class MLPModel(nn.Module):
  def __init__(self):
      super().__init__()
      self.fc0 = nn.Linear(8, 8, bias=True)
      self.fc1 = nn.Linear(8, 4, bias=True)
      self.fc2 = nn.Linear(4, 2, bias=True)
      self.fc3 = nn.Linear(2, 2, bias=True)

  def forward(self, tensor_x: torch.Tensor):
      tensor_x = self.fc0(tensor_x)
      tensor_x = torch.sigmoid(tensor_x)
      tensor_x = self.fc1(tensor_x)
      tensor_x = torch.sigmoid(tensor_x)
      tensor_x = self.fc2(tensor_x)
      tensor_x = torch.sigmoid(tensor_x)
      output = self.fc3(tensor_x)
      return output

model = MLPModel()
tensor_x = torch.rand((97, 8), dtype=torch.float32)
onnx_program = torch.onnx.export(model, (tensor_x,), dynamo=True)

As the code above shows, all you need is to provide :func:`torch.onnx.export` with an instance of the model and its input. The exporter will then return an instance of :class:`torch.onnx.ONNXProgram` that contains the exported ONNX graph along with extra information.

onnx_program.optimize() can be called to optimize the ONNX graph with constant folding and elimination of redundant operators. The optimization is done in-place.

onnx_program.optimize()

The in-memory model available through onnx_program.model_proto is an onnx.ModelProto object in compliance with the ONNX IR spec. The ONNX model may then be serialized into a Protobuf file using the :meth:`torch.onnx.ONNXProgram.save` API.

onnx_program.save("mlp.onnx")

Inspecting the ONNX model using GUI

You can view the exported model using Netron.

When the conversion fails

Function :func:`torch.onnx.export` should called a second time with parameter report=True. A markdown report is generated to help the user to resolve the issue.

.. toctree::
    :hidden:

    onnx_dynamo_memory_usage

API Reference

.. autofunction:: torch.onnx.export

.. autoclass:: torch.onnx.ONNXProgram
    :members:

.. autofunction:: is_in_onnx_export

.. autoclass:: torch.onnx.OnnxExporterError
    :members:

.. autofunction:: torch.onnx.enable_fake_mode

Deprecated

The following classes and functions are deprecated and will be removed.

.. autofunction:: torch.onnx.dynamo_export

.. autoclass:: torch.onnx.ExportOptions

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!