Skip to content

Commit eabbf28

Browse files
committed
mamf update
1 parent dd17be1 commit eabbf28

File tree

1 file changed

+2
-3
lines changed

1 file changed

+2
-3
lines changed

compute/accelerator/README.md

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -62,7 +62,7 @@ As most of us rent the compute, and we never see what it looks like, here is a h
6262
- HPC: High-performance Computing
6363
- HPU: Habana Gaudi AI Processor Unit
6464
- IPU: Intelligence Processing Unit
65-
- MAMF: Maximum Achievable FLOPS
65+
- MAMF: Maximum Achievable Matmul FLOPS
6666
- MME: Matrix Multiplication Engine
6767
- QPU: Quantum Processing Unit
6868
- RDU: Reconfigurable Dataflow Unit
@@ -182,8 +182,6 @@ General notes:
182182

183183
#### Maximum Achievable FLOPS
184184

185-
MAMF stands for Maximum Achievable FLOPS, which is a term coined by yours truly. This and the following sections explain why this metric is very practical for those who do performance optimization work.
186-
187185
Theoretical peak FLOPS is what gets published on the accelerator's spec. And it's calculated as:
188186

189187
`Theoretical FLOPS = compute_unit_clock_speed * flops_per_clock_cycle_per_compute_unit * num_compute_units`
@@ -201,6 +199,7 @@ To provide a numerical sense to what I'm talking about let's take an A100 with i
201199

202200
You could measure the the actual peak TFLOPS by doing a perfectly aligned max-size matrices `matmul` measured on a single accelerator. You can use [Maximum Achievable Matmul FLOPS Finder](benchmarks#maximum-achievable-matmul-flops-finder) to reproduce the results. But, of course, this will only tell you how well your given accelerator and its software stack do `matmul` - depending on the workload this might be all you need to know, or not.
203201

202+
MAMF stands for [Maximum Achievable Matmul FLOPS](#maximum-achievable-matmul-flops-comparison-table), which is a term coined by yours truly. It is very practical for those who do performance optimization work.
204203

205204
#### Maximum Achievable Matmul FLOPS comparison table
206205

0 commit comments

Comments
 (0)