Add Context Parallel tutorial #3319

XilunWu · 2025-04-09T23:29:55Z

PyTorch 2.7 release of torch.distributed.tensor.experimental._attention

API: https://pytorch.org/docs/stable/distributed.tensor.html#torch.distributed.tensor.experimental.context_parallel
tutorial preview: https://docs-preview.pytorch.org/pytorch/tutorials/3319/prototype/context_parallel.html

pytorch-bot · 2025-04-09T23:29:59Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/tutorials/3319

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 5872433 with merge base 7cb6915 ():

NEW FAILURE - The following job has failed:

link check on PR / linkChecker (gh)
Process completed with exit code 1.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

prototype_source/context_parallel.rst

wconstab · 2025-04-11T04:15:31Z

prototype_source/context_parallel.rst

+            out = F.scaled_dot_product_attention(*qkv, is_causal=True)
+
+        # make a clean copy of QKV for output comparison
+        cp_qkv = [t.detach().clone() for t in qkv]


Ahh, so this is not even needed for cp just for the reference?

I wonder if it's better to delete the reference. That's more appropriate for a unit test, but for an example people usually want something that's minimal and copy able, and in this case they might be distracted by this line.

AlannaBurke

Just some minor formatting fixes.

prototype_source/context_parallel.rst

fegin

There is a wrong statement about pass-kv, we should change that.

fegin

Two final comments. LGTM

prototype_source/context_parallel.rst

svekars · 2025-04-15T18:42:07Z

Can you please rebase the base branch to main.

Summary: The compiled model run takes the same input as Eager. No need to explicitly compose args as a tuple.

facebook-github-bot added the cla signed label Apr 9, 2025

svekars added the 2.7 label Apr 10, 2025

svekars requested a review from AlannaBurke April 10, 2025 03:04

wconstab reviewed Apr 10, 2025

View reviewed changes

prototype_source/context_parallel.rst Show resolved Hide resolved

wconstab reviewed Apr 10, 2025

View reviewed changes

prototype_source/context_parallel.rst Show resolved Hide resolved

XilunWu requested a review from wconstab April 10, 2025 23:03

wconstab approved these changes Apr 11, 2025

View reviewed changes

AlannaBurke requested changes Apr 12, 2025

View reviewed changes

fegin approved these changes Apr 14, 2025

View reviewed changes

prototype_source/context_parallel.rst Outdated Show resolved Hide resolved

prototype_source/context_parallel.rst Outdated Show resolved Hide resolved

fegin requested changes Apr 14, 2025

View reviewed changes

XilunWu requested review from AlannaBurke and fegin April 14, 2025 21:37

fegin approved these changes Apr 15, 2025

View reviewed changes

prototype_source/context_parallel.rst Outdated Show resolved Hide resolved

prototype_source/context_parallel.rst Outdated Show resolved Hide resolved

svekars changed the base branch from 2.7-RC-TEST to main April 15, 2025 18:41

svekars changed the base branch from main to 2.7-RC-TEST April 15, 2025 18:41

AlannaBurke approved these changes Apr 15, 2025

View reviewed changes

XilunWu changed the base branch from 2.7-RC-TEST to main April 15, 2025 20:11

XilunWu changed the base branch from main to 2.7-RC-TEST April 16, 2025 19:00

svekars and others added 10 commits April 16, 2025 12:02

[DO NOT MERGE] 2.7 RC Test

864aedb

Update .jenkins/build.sh

490e3b1

Update .jenkins/build.sh

a7fcb92

Update build.sh

1daf409

Update build.sh

6e218dc

Update build.sh

6359756

Update onnxscript in requirements (pytorch#3300)

edd7240

Update build.sh

9a649ea

Update .jenkins/validate_tutorials_built.py

83a8781

Update build.sh

cfb2719

svekars and others added 20 commits April 16, 2025 12:04

Update .jenkins/build.sh

29f4c56

Update build.sh

4eb24e1

Apply suggestions from code review

45f2bd5

Update build.sh

c309e11

Update requirements.txt

3674238

Update .jenkins/build.sh

d40f855

Update .jenkins/build.sh

dbfe3da

Fix the AOTI example (pytorch#3306)

3885455

Summary: The compiled model run takes the same input as Eager. No need to explicitly compose args as a tuple.

Update build.sh

d7d29fe

Disable rl tutorials again

f2fcf6f

Add Context Parallel tutorial

b87d98d

fix typo

f75c9fd

fix: address comment

dcf02de

fix: typos

4275c42

address review comments

c6d8dfa

address comments: improve pass-KV description

a6938ff

address comments: improve API description

b74e6cc

address comments: improve API description

02d419c

fix indentation

80f228c

address review comments

0432a23

XilunWu force-pushed the 2.7-RC-TEST branch from 0bae593 to 0432a23 Compare April 16, 2025 19:09

XilunWu changed the base branch from 2.7-RC-TEST to main April 16, 2025 19:09

XilunWu added 2 commits April 16, 2025 12:15

manually fix rebase issues

ada3e08

manually fix rebase issues

5872433

svekars merged commit aebeff4 into pytorch:main Apr 18, 2025
18 of 19 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Context Parallel tutorial #3319

Add Context Parallel tutorial #3319

Uh oh!

XilunWu commented Apr 9, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Apr 9, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

wconstab Apr 11, 2025

Uh oh!

AlannaBurke left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fegin left a comment

Uh oh!

fegin left a comment

Uh oh!

Uh oh!

Uh oh!

svekars commented Apr 15, 2025

Uh oh!

Uh oh!

Uh oh!

Add Context Parallel tutorial #3319

Add Context Parallel tutorial #3319

Uh oh!

Conversation

XilunWu commented Apr 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Apr 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/tutorials/3319

❌ 1 New Failure

Uh oh!

Uh oh!

Uh oh!

wconstab Apr 11, 2025

Choose a reason for hiding this comment

Uh oh!

AlannaBurke left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fegin left a comment

Choose a reason for hiding this comment

Uh oh!

fegin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

svekars commented Apr 15, 2025

Uh oh!

Uh oh!

Uh oh!

XilunWu commented Apr 9, 2025 •

edited

Loading

pytorch-bot bot commented Apr 9, 2025 •

edited

Loading