Name	Name	Last commit message	Last commit date
parent directory ..
cpu	cpu	[ROCm][Windows] Fix clang-cl error related to -Wmissing prototypes en…	Feb 18, 2025
cuda	cuda	[codemod] Fix unused-value issue in caffe2/aten/src/ATen/cuda/detail/…	Mar 1, 2025
README.md	README.md	Forbid trailing whitespace (pytorch#53406 )	Mar 6, 2021
arg_spec.h	arg_spec.h	[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )	Aug 26, 2024
codegen.cpp	codegen.cpp	[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )	Aug 26, 2024
codegen.h	codegen.h	[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )	Aug 26, 2024
compiler.cpp	compiler.cpp	Revert "[Environment Variable][7/N] Use thread-safe getenv functions (p…	Feb 3, 2025
compiler.h	compiler.h	[22/N] Fix clang-tidy warnings in jit (pytorch#134829 )	Sep 19, 2024
executor.cpp	executor.cpp	[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )	Aug 26, 2024
executor.h	executor.h	[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )	Aug 26, 2024
fallback.cpp	fallback.cpp	[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )	Aug 26, 2024
fallback.h	fallback.h	[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )	Aug 26, 2024
fused_kernel.h	fused_kernel.h	[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )	Aug 26, 2024
interface.cpp	interface.cpp	[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )	Aug 26, 2024
interface.h	interface.h	[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )	Aug 26, 2024
kernel_cache.cpp	kernel_cache.cpp	[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )	Aug 26, 2024
kernel_cache.h	kernel_cache.h	[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )	Aug 26, 2024
kernel_spec.h	kernel_spec.h	Fix clang-tidy warnings in torch/jit (pytorch#146963 )	Feb 15, 2025
partition_desc.h	partition_desc.h	[21/N] Fix clang-tidy warnings in jit (pytorch#134537 )	Aug 28, 2024
tensor_desc.h	tensor_desc.h	[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )	Aug 26, 2024
tensor_info.h	tensor_info.h	[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )	Aug 26, 2024

Name

Last commit message

Last commit date

[ROCm][Windows] Fix clang-cl error related to -Wmissing prototypes en…

Feb 18, 2025

cuda

[codemod] Fix unused-value issue in caffe2/aten/src/ATen/cuda/detail/…

Mar 1, 2025

README.md

Forbid trailing whitespace (pytorch#53406 )

Mar 6, 2021

arg_spec.h

[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )

Aug 26, 2024

codegen.cpp

[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )

Aug 26, 2024

codegen.h

[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )

Aug 26, 2024

compiler.cpp

Revert "[Environment Variable][7/N] Use thread-safe getenv functions (p…

Feb 3, 2025

compiler.h

[22/N] Fix clang-tidy warnings in jit (pytorch#134829 )

Sep 19, 2024

executor.cpp

[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )

Aug 26, 2024

executor.h

[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )

Aug 26, 2024

fallback.cpp

[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )

Aug 26, 2024

fallback.h

[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )

Aug 26, 2024

fused_kernel.h

[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )

Aug 26, 2024

interface.cpp

[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )

Aug 26, 2024

interface.h

[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )

Aug 26, 2024

kernel_cache.cpp

[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )

Aug 26, 2024

kernel_cache.h

[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )

Aug 26, 2024

kernel_spec.h

Fix clang-tidy warnings in torch/jit (pytorch#146963 )

Feb 15, 2025

partition_desc.h

[21/N] Fix clang-tidy warnings in jit (pytorch#134537 )

Aug 28, 2024

tensor_desc.h

[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )

Aug 26, 2024

tensor_info.h

[20/N] Fix clang-tidy warnings in jit (pytorch#133399 )

Aug 26, 2024

PyTorch Fuser

The fuser accepts subgraphs wrapped in "fusion nodes" and tries to execute them by just-in-time (JIT) compiling kernels that run all the graph operations.

Code Organization

The fuser is designed hierarchically with device-independent logic eventually deferring to device-specific logic and implementation. The device-specific code is (mostly) found in each devices' subdirectory. The device-independent logic has six components:

The Interface (interface.h/cpp) has functions to register and run fusions, interrogate fusion functionality, and perform debugging.
The Compiler (compiler.h/cpp) performs "upfront" and "runtime" compilation. When fusions are registered, upfront compilation produces fallback code and and performs some shape inference. When a fusion is run, runtime compilation invokes code generation and the device-specific compilation logic.
The Code Generator (codegen.h/cpp) produces the string to be compiled on the device.
The Executor (executor.h/cpp) runs requested fusions. It performs shape inference, expands tensors as necessary, determines the device to run on, acquires a cached compiled kernel or requests the Compiler produce a new one, invokes device-specific code to launch the kernel and updates the stack.
The Fallback (fallback.h/cpp) runs subgraphs that can't be fused because shape inference didn't determine a common tensor size or the device the tensors are on doesn't support fusion.
The Kernel Specification Cache (kernel_cache.h/cpp) is a thread-safe cache holding the device-independent specifications produced during upfront compilation. These specifications each have their own thread-safe stores of compiled kernels that the Executor checks before requesting runtime compilation.

The device-specific components have logic for compiling and running code in FusedKernelCPU (cpu/fused_kernel.h/cpp) and FusedKernelCUDA (cuda/fused_kernel.h/cpp).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

fuser

fuser

README.md

PyTorch Fuser

Code Organization

Files

fuser

Directory actions

More options

Directory actions

More options

Latest commit

History

fuser

Folders and files

parent directory

README.md

PyTorch Fuser

Code Organization