Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

Appearance settings

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

codeplaysoftware / cutlass-sycl Public

forked from NVIDIA/cutlass

Notifications You must be signed in to change notification settings
Fork 29
Star 21

Code
Issues 7
Pull requests 24
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: codeplaysoftware/cutlass-sycl

Labels 11 Milestones 0

Labels 11 Milestones 0

New pull request New

24 Open 325 Closed

24 Open 325 Closed

Author

Filter by author

Loading

Label

Filter by label

Loading

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Loading

Milestones

Filter by milestone

Loading

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Loading

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Avoid warnings about Int (alias template)

#365 opened May 8, 2025 by joeatodd

Loading…

2

Fix case where matrix size exceeds max uint32

#364 opened May 8, 2025 by joeatodd

Loading…

4

Add benchmark for Flash Attention Decode

#363 opened May 7, 2025 by muhammad-tanvir-1211

Loading…

2

Fix Variable Sequence Length Support for Flash Attention Decode

#362 opened May 7, 2025 by muhammad-tanvir-1211

Loading…

5

Added the second release required FP16 fine-tuned GEMM kernels.

#361 opened May 7, 2025 by tdeng5

Loading…

1

Fix variable length for Flash Attention prefill

#360 opened May 6, 2025 by muhammad-tanvir-1211

Loading…

Fix Variable Sequence Length Support for Flash Attention Prefill + KV Cache

#359 opened May 6, 2025 by muhammad-tanvir-1211

Loading…

4

Reenable BMG examples in CI testing

#358 opened May 6, 2025 by t4c1

Loading…

Support for bf16 C and D in GEMM

#356 opened May 2, 2025 by FMarno

Loading…

5

Rename files and classes for Flash Attention

#354 opened May 2, 2025 by muhammad-tanvir-1211

Loading…

5

Pass IGC options on cmd line

#353 opened May 2, 2025 by rolandschulz

Loading…

2

Enable FP8_E5M2 GEMM and unify FP8 GEMM implementation with xe_mma.hpp

#352 opened May 2, 2025 by sanchitintel

Loading…

1

[Please do not review] FP8 Grouped GEMM CollectiveMma

#351 opened May 1, 2025 by sanchitintel • Draft

1 task

enable splitk for mixed precision gemm

#339 opened Apr 29, 2025 by taozha2

Loading…

5

Avoid the cacheline alignment requirement for batches

#325 opened Apr 22, 2025 by joeatodd • Draft

2

Input alignment

#323 opened Apr 22, 2025 by t4c1

Loading…

add gemm with rmsnorm

#321 opened Apr 22, 2025 by yuankuns

Loading…

add int8/tf32 transpose A copy traits

#319 opened Apr 21, 2025 by taozha2

Loading…

3

Pure FP8 (W8A8) GEMM support (draft)

#306 opened Apr 14, 2025 by jiyang1011

Loading…

9

RFC: test out new syntax for launch with type deduction

#305 opened Apr 12, 2025 by rolandschulz

Loading…

7

Use Collective builder for benchmarks

#302 opened Apr 10, 2025 by FMarno

Loading…

4

Enable SM90 via sycl-cuda-compat

#276 opened Mar 24, 2025 by FMarno

Loading…

Enable batch tests for streamK

#258 opened Mar 12, 2025 by aacostadiaz

Loading…

Switch to SPIRV APIs from internal built-in APIs

#255 opened Mar 12, 2025 by jiyang1011

Loading…

ProTip! Mix and match filters to narrow down what you’re looking for.

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.