[ROCm] better AMD CDNA4 and RDNA4 support for VAE by Apophis3158 · Pull Request #13411 · Comfy-Org/ComfyUI

Apophis3158 · 2026-04-14T23:25:11Z

Better support for AMD latest GPU arches: RDNA4 (gfx1200, gfx1201) and CDNA4 (gfx950), determined based on fp8 support.

Reference: __hip_fp8_e4m3 and __hip_fp8_e5m2 supports in HIP C++ type implementation support table at AMD data types and precision support

coderabbitai · 2026-04-14T23:27:34Z

📝 Walkthrough

Walkthrough

The PR changes AMD-specific logic to depend on SUPPORT_FP8_OPS. In comfy/model_management.py, pytorch_attention_enabled_vae() now returns False for AMD only when SUPPORT_FP8_OPS is false; if SUPPORT_FP8_OPS is true, it defers to pytorch_attention_enabled(). In comfy/sd.py, VAE_KL_MEM_RATIO in VAE.__init__ is set to 2.73 only when is_amd() is true and SUPPORT_FP8_OPS is false; otherwise it uses the non-AMD value. No public signatures changed.

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description check	✅ Passed	The description is directly related to the changeset, explaining the motivation for the changes and referencing AMD documentation about FP8 support.
Title check	✅ Passed	The title accurately reflects the main objective of the PR: adding improved support for AMD GPU architectures CDNA4 and RDNA4 through conditional FP8 operations handling.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Apophis3158 · 2026-04-15T23:39:36Z

Updated inline comments.

These two improvements has already been tested by ROCm users on Windows and Linux long time ago:

and more.

Most feedback is from gfx120x, so it's better to use SUPPORT_FP8_OPS for the restriction.

ComfyUI/comfy/model_management.py

Lines 409 to 411 in 1de83f9

    
           if torch_version_numeric >= (2, 7) and rocm_version >= (6, 4): 
        
               if any((a in arch) for a in ["gfx1200", "gfx1201", "gfx950"]):  # TODO: more arches, "gfx942" gives error on pytorch nightly 2.10 1013 rocm7.0 
        
                   SUPPORT_FP8_OPS = True

Apophis3158 requested review from Kosinkadink, comfyanonymous and guill as code owners April 14, 2026 23:25

better AMD CDNA4 and RDNA4 support

693ce0a

Apophis3158 force-pushed the master/rocm branch from b043989 to 693ce0a Compare April 15, 2026 19:38

Apophis3158 changed the title ~~[ROCm] better AMD CDNA4 and RDNA4 support~~ [ROCm] better AMD CDNA4 and RDNA4 support for VAE Apr 17, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ROCm] better AMD CDNA4 and RDNA4 support for VAE#13411

[ROCm] better AMD CDNA4 and RDNA4 support for VAE#13411
Apophis3158 wants to merge 1 commit intoComfy-Org:masterfrom
Apophis3158:master/rocm

Apophis3158 commented Apr 14, 2026

Uh oh!

coderabbitai bot commented Apr 14, 2026 •

edited

Loading

Walkthrough

❌ Failed checks (1 warning)

Uh oh!

Apophis3158 commented Apr 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Apophis3158 commented Apr 14, 2026

Uh oh!

coderabbitai bot commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

❌ Failed checks (1 warning)

Uh oh!

Apophis3158 commented Apr 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

coderabbitai bot commented Apr 14, 2026 •

edited

Loading