[ET-VK][ops] Add bitwise_or / logical_or operators by SS-JIA · Pull Request #20382 · pytorch/executorch

SS-JIA · 2026-06-18T15:03:30Z

Stack from ghstack (oldest at bottom):

Adds Vulkan support for aten.bitwise_or.Tensor and aten.logical_or.default, mirroring the existing bitwise_and / logical_and implementation. This is the first of two ops needed to collapse the Llama4-mini TISO en_US backbone export to a single Vulkan partition: the discrete-speech mask OR-s several bool tensors via bitwise_or, which previously had no Vulkan implementation and forced a CPU fallback that split the delegated graph.

Implementation mirrors bitwise_and: a X | Y uint8 shader variant in binary_op_buffer.yaml / binary_op_texture.yaml, a DEFINE_BINARY_OP_FN(bitwise_or) dispatch with VK_REGISTER_OP for both aten.bitwise_or.Tensor and aten.logical_or.default in BinaryOp.cpp, and register_bitwise_or / register_logical_or OpFeatures (bool inputs) in op_registry.py.

This change was authored with Claude.

Differential Revision: D108457794

[ghstack-poisoned]

pytorch-bot · 2026-06-18T15:03:34Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20382

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 6dc4f6b with merge base 1227757 ():

NEW FAILURE - The following job has failed:

pull / unittest-nxp-neutron / linux-job (gh)
RuntimeError: Command docker exec -t 86877c7b324adfcd22e48c0fb7db4261e012a9f32269e76122bdce1d21732de6 /exec failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

linux-foundation-easycla · 2026-06-18T15:03:38Z

❌ The email address for the commit (529dd9a, 6dc4f6b, 7e0aceb, d0a67c3, e34f63e) is not linked to the GitHub account, preventing the EasyCLA check. Consult this Help Article and GitHub Help to resolve. (To view the commit's email address, add .patch at the end of this PR page's URL.) For further assistance with EasyCLA, please visit our EasyCLA portal and chat with our support bot.

github-actions · 2026-06-18T15:04:34Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

[ghstack-poisoned]

metascroy · 2026-06-26T18:40:23Z

@claude review this code

claude · 2026-06-26T18:41:04Z

Claude finished @metascroy's task in 1m 9s —— View job

Code Review: bitwise_or / logical_or

Read PR diff
Analyze shader/registry/test changes
Post review

Reviewed against origin/gh/SS-JIA/561/base. This is a clean, well-scoped change that faithfully mirrors the existing bitwise_and path. LGTM — no blocking issues.

Correctness ✅

BinaryOp.cpp: DEFINE_BINARY_OP_FN(bitwise_or) + the two VK_REGISTER_OP lines for aten.bitwise_or.Tensor and aten.logical_or.default are correct. Routing logical_or to the same uint8 X | Y shader is valid because the registry constrains inputs to BOOL_T (0/1), so bitwise-or and logical-or are equivalent on those values — same reasoning the existing logical_and → bitwise_and mapping relies on.
Shaders (binary_op_buffer.yaml / binary_op_texture.yaml): the X | Y uint8 variants match the bitwise_and pattern exactly. Kernel name (binary_bitwise_or_{buffer,texture3d}) lines up with what add_binary_op_node constructs at BinaryOp.cpp:76-80.

Nice improvements 👍

op_registry.py: consolidating five separate register_* functions into a single register_bool_binary_ops with a list of edge ops is a good simplification (net −12 lines) and keeps bitwise_not correctly grouped despite being unary, since it shares the same OpFeatures.
cases.py: dropping test_suite.data_gen = "make_seq_tensor" is a genuine test-quality fix, not just cosmetic. For at::kBool, make_seq_tensor fills values 1, 2, 3, … (all nonzero → all true), so OR/AND tests were effectively true | true everywhere. Reverting to the default make_rand_tensor (rand() > 0.5, see gen_correctness_base.py:315-316) exercises a real mix of true/false. The two added broadcasting shapes ((1, M2), (M1, M2)) and ((XS, 1, S1, 1), (1, S, 1, S2)) also broaden coverage — and since the suite is now shared, bitwise_and benefits from both changes too.

Minor observations (non-blocking)

The shared test suite means aten.logical_and.default is not in the @register_test_suite([...]) list, even though it's registered as an op. It was already untested before this PR, so no regression — just noting it for completeness if you want symmetric coverage with logical_or.
The CI failure (unittest-nxp-neutron) is in the NXP/Neutron backend and unrelated to these Vulkan changes.

Update

e34f63e

[ghstack-poisoned]

SS-JIA mentioned this pull request Jun 18, 2026

[ET-VK][patterns] Fuse torchao 4-bit quantized embedding to embedding_q4gsw #20381

Open

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 18, 2026

SS-JIA mentioned this pull request Jun 18, 2026

[ET-VK][ops] Add eq.Scalar operator #20383

Open

SS-JIA temporarily deployed to cadence June 18, 2026 15:03 — with GitHub Actions Inactive

meta-codesync Bot added the meta-exported label Jun 18, 2026

Update

529dd9a

[ghstack-poisoned]

SS-JIA mentioned this pull request Jun 24, 2026

[ET-VK][quantized] Store dq8ca per-token zero-point as fp32 #20491

Open

SS-JIA temporarily deployed to cadence June 24, 2026 18:04 — with GitHub Actions Inactive

Update

7e0aceb

[ghstack-poisoned]

SS-JIA temporarily deployed to cadence June 24, 2026 21:43 — with GitHub Actions Inactive

Update

d0a67c3

[ghstack-poisoned]

SS-JIA temporarily deployed to cadence June 26, 2026 04:39 — with GitHub Actions Inactive

Update

6dc4f6b

[ghstack-poisoned]

SS-JIA temporarily deployed to cadence June 26, 2026 16:37 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ET-VK][ops] Add bitwise_or / logical_or operators#20382

[ET-VK][ops] Add bitwise_or / logical_or operators#20382
SS-JIA wants to merge 5 commits into
gh/SS-JIA/561/basefrom
gh/SS-JIA/561/head

SS-JIA commented Jun 18, 2026 •

edited

Loading

Uh oh!

pytorch-bot Bot commented Jun 18, 2026 •

edited

Loading

Uh oh!

linux-foundation-easycla Bot commented Jun 18, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 18, 2026

Uh oh!

metascroy commented Jun 26, 2026

Uh oh!

claude Bot commented Jun 26, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

SS-JIA commented Jun 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Jun 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20382

❌ 1 New Failure

Uh oh!

linux-foundation-easycla Bot commented Jun 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 18, 2026

This PR needs a release notes: label

Uh oh!

metascroy commented Jun 26, 2026

Uh oh!

claude Bot commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review: bitwise_or / logical_or

Correctness ✅

Nice improvements 👍

Minor observations (non-blocking)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SS-JIA commented Jun 18, 2026 •

edited

Loading

pytorch-bot Bot commented Jun 18, 2026 •

edited

Loading

linux-foundation-easycla Bot commented Jun 18, 2026 •

edited

Loading

This PR needs a `release notes:` label

claude Bot commented Jun 26, 2026 •

edited

Loading