Qualcomm AI Engine Direct - Enable more HF LLM Model by winskuo-quic · Pull Request #20587 · pytorch/executorch

winskuo-quic · 2026-06-29T03:18:52Z

Summary

Enable more hf llm models that is also tested on Optimum-ExecuTorch.

Model (`--decoder_model`)	Use `--enable_spinquant_r3`?	Sample Prompt
`qwen2_5-0_5b`	✅ Yes	"My favourite condiment is "
`qwen3-0_6b`	✅ Yes	"Give me a short introduction to large language model."
`smollm2_135m`	✅ Yes	"My favourite condiment is "
`gemma-2b`	✅ Yes	"Hello I am doing"
`llama3_2-1b`	✅ Yes	"Simply put, the theory of relativity states that"

Sample Script
python examples/qualcomm/oss_scripts/hf_causal_lm.py --ptq 16a8w --prompt "My favourite condiment is " --soc_model SM8750 --device $DEVICE_ID --build_folder build-android/ --decoder_model qwen2_5-0_5b --enable_spinquant_r3

Test plan

python backends/qualcomm/tests/test_qnn_delegate.py TestExampleLLMScript.test_hf_causal_lm --device $DEVICE_ID --soc_model SM8750 --build_folder build-android --executorch_root . --artifact_dir ./hf_qwen

pytorch-bot · 2026-06-29T03:18:57Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20587

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 4 New Failures, 1 Unrelated Failure, 3 Unclassified Failures

As of commit 2cb77c1 with merge base 05b977d ():

NEW FAILURES - The following jobs have failed:

pull / unittest / linux / linux-job (gh)
RuntimeError: Command docker exec -t 6923716f46be3e4415652d70056e7858288e55b828b6b037bfc8dd8c788ba8f4 /exec failed with exit code 1
pull / unittest / macos / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1
pull / unittest-editable / linux / linux-job (gh)
RuntimeError: Command docker exec -t 09583de59cc503a47b84f0725b78a9a9c5a790aa9cf9a5a7a8da9ae68250410c /exec failed with exit code 1
pull / unittest-editable / macos / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1

UNCLASSIFIED FAILURES - DrCI could not classify the following jobs because the workflow did not run on the merge base. The failures may be pre-existing on trunk or introduced by this PR:

Build Aarch64 Linux Wheels / pytorch/executorch / build-wheel-py3_10-cpu-aarch64 (gh) (this job did not run on the merge base, so DrCI cannot tell whether the failure is pre-existing)
/__w/executorch/executorch/pytorch/executorch/backends/apple/coreml/runtime/inmemoryfs/inmemory_filesystem.cpp:722:48: error: ‘inmemoryfs::InMemoryFileSystem::InMemoryNode::Kind’ has not been declared
Build Aarch64 Linux Wheels / pytorch/executorch / upload / upload-wheel-py3_10-cpu-aarch64 (gh) (this job did not run on the merge base, so DrCI cannot tell whether the failure is pre-existing)
Unable to download artifact(s): Artifact not found for name: pytorch_executorch__3.10_cpu_aarch64
pull / test-arm-backend-public-api-backward-compatibility / linux-job (gh) (this job did not run on the merge base, so DrCI cannot tell whether the failure is pre-existing)
RuntimeError: Command docker exec -t 4530942d6826035f65cfc66b7cfc07988855af413831b03680f8450537408cac /exec failed with exit code 127

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / test-qnn-testsuite-linux / test-backend-linux (qnn, models) / linux-job (gh) (detected as infra flaky with no log or failing log classifier)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-06-29T03:20:10Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Qualcomm AI Engine Direct - Enable more HF LLM Model

2cb77c1

winskuo-quic requested review from abhinaykukkadapu and psiddh as code owners June 29, 2026 03:18

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 29, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Qualcomm AI Engine Direct - Enable more HF LLM Model#20587

Qualcomm AI Engine Direct - Enable more HF LLM Model#20587
winskuo-quic wants to merge 1 commit into
pytorch:mainfrom
CodeLinaro:dev1/winskuo/hf_llm_enablement

winskuo-quic commented Jun 29, 2026 •

edited

Loading

Uh oh!

pytorch-bot Bot commented Jun 29, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

winskuo-quic commented Jun 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

pytorch-bot Bot commented Jun 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20587

❌ 4 New Failures, 1 Unrelated Failure, 3 Unclassified Failures

Uh oh!

github-actions Bot commented Jun 29, 2026

This PR needs a release notes: label

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

winskuo-quic commented Jun 29, 2026 •

edited

Loading

pytorch-bot Bot commented Jun 29, 2026 •

edited

Loading

This PR needs a `release notes:` label