Skip to content

Pull requests: pytorch/ao

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Add CUTLASS-based row-wise scaled sparse FP8 kernel CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. float8 sparsity topic: new feature Use this tag if this PR adds a new feature
#1671 opened Feb 5, 2025 by alexsamardzic Draft
Support power of 2 scaling factors in float8 training CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: new feature Use this tag if this PR adds a new feature
#1670 opened Feb 5, 2025 by danielvegamyhre Loading…
Feat/blockwise fp8 quant CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1668 opened Feb 5, 2025 by Degnel Loading…
Support mixed MX element dtype in mx_mm function and MXLinear. CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: improvement Use this tag if this PR is an improvement (doesn't fit into any of the other categories)
#1667 opened Feb 5, 2025 by balancap Loading…
Add boiler plate code to Tensor subclass CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: improvement Use this tag if this PR is an improvement (doesn't fit into any of the other categories)
#1663 opened Feb 4, 2025 by jainapurva Loading…
Add mx_fp4_kernel CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. mx topic: new feature Use this tag if this PR adds a new feature
#1661 opened Feb 4, 2025 by drisspg Loading…
Tensor parallel support for fpx dtype CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1660 opened Feb 4, 2025 by jainapurva Draft
Attempt to switch everything to cmake CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1659 opened Feb 4, 2025 by drisspg Loading…
Tensor parallel support for uintx dtype CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: improvement Use this tag if this PR is an improvement (doesn't fit into any of the other categories)
#1656 opened Feb 3, 2025 by jainapurva Draft
draft ukernel selection logic CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1652 opened Feb 3, 2025 by metascroy Draft
[Fix]: Fallback to KleidiAI channelwise kernel groupsize isnt suitable CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1647 opened Jan 31, 2025 by ng-05 Loading…
Change TORCH_LIBRARY to TORCH_LIBRARY_FRAGMENT CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1645 opened Jan 30, 2025 by metascroy Loading…
Tests jeanschmidt/NVIDIA_IMEX_CHANNELS changes CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1644 opened Jan 30, 2025 by jeanschmidt Loading…
mx: add ceil and RNE rounding modes to the cast from fp32 to e8m0 CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: improvement Use this tag if this PR is an improvement (doesn't fit into any of the other categories)
#1643 opened Jan 30, 2025 by vkuzo Loading…
Add mx_fp8_bf16 kernel CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. mx topic: new feature Use this tag if this PR adds a new feature
#1637 opened Jan 29, 2025 by drisspg Loading…
Create separate float8 tensor subclass CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1636 opened Jan 29, 2025 by jainapurva Draft
Add boilerplate code CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: for developers Use this tag if this PR is mainly developer facing topic: improvement Use this tag if this PR is an improvement (doesn't fit into any of the other categories)
#1635 opened Jan 29, 2025 by jainapurva Draft
Update to cutlass 3.8 | wait for tag to land CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. mx topic: not user facing Use this tag if you don't want this PR to show up in release notes
#1634 opened Jan 28, 2025 by drisspg Loading…
Fix broken nav link in README
#1633 opened Jan 28, 2025 by ByronHsu Loading…
[not for land] hook up MX to CUDA 12.8 cuBLAS MX gemm CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: not user facing Use this tag if you don't want this PR to show up in release notes
#1625 opened Jan 27, 2025 by vkuzo Loading…
convert dora fusion test from pytest to unittest CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: not user facing Use this tag if you don't want this PR to show up in release notes
#1622 opened Jan 26, 2025 by osbm Loading…
Move tensor_flatten/unflatten from AQT -> TensorAOBaseClass CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: for developers Use this tag if this PR is mainly developer facing
#1615 opened Jan 24, 2025 by jainapurva Draft
Add H100 to CI for regression CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: for developers Use this tag if this PR is mainly developer facing topic: not user facing Use this tag if you don't want this PR to show up in release notes
#1614 opened Jan 24, 2025 by jainapurva Draft
[WIP] speed up CodebookQuantizedTensor inference CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1607 opened Jan 23, 2025 by DerekLiu35 Loading…
4 tasks
Move Hqq quantization to subclass CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#1604 opened Jan 23, 2025 by jainapurva Draft
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.