@@ -44,25 +44,27 @@ Despite all implementation are fast, not all the paths are implemented using SIM
44
44
45
45
` ~ ` - Partially implemented
46
46
47
- | | NEON | SSE | AVX2 | WASM |
48
- | ----------------| ------| -----| ------| ------|
49
- | RGBA (8 bit) | x | x | x | ~ |
50
- | RGB (8 bit) | x | x | ~ | ~ |
51
- | Plane (8 bit) | x | x | ~ | ~ |
52
- | RGBA (8+ bit) | x | x | ~ | - |
53
- | RGB (8+ bit) | x | x | ~ | - |
54
- | Plane (8+ bit) | ~ | ~ | ~ | - |
55
- | RGBA (f32) | x | x | x | - |
56
- | RGB (f32) | x | x | ~ | - |
57
- | Plane (f32) | x | x | ~ | - |
58
- | RGBA (f16) | x | x | x | - |
59
- | RGB (f16) | x | ~ | ~ | - |
60
- | Plane (f16) | ~ | ~ | ~ | - |
61
- | AR30/RA30 | x | - | - | - |
47
+ | | NEON | SSE | AVX2 | AVX-512 | WASM |
48
+ | ----------------| ------| -----| ------| --------- | ------ |
49
+ | RGBA (8 bit) | x | x | x | ~ | ~ |
50
+ | RGB (8 bit) | x | x | ~ | ~ | ~ |
51
+ | Plane (8 bit) | x | x | ~ | ~ | ~ |
52
+ | RGBA (8+ bit) | x | x | ~ | - | - |
53
+ | RGB (8+ bit) | x | x | ~ | - | - |
54
+ | Plane (8+ bit) | ~ | ~ | ~ | - | - |
55
+ | RGBA (f32) | x | x | x | - | - |
56
+ | RGB (f32) | x | x | ~ | - | - |
57
+ | Plane (f32) | x | x | ~ | - | - |
58
+ | RGBA (f16) | x | x | x | - | - |
59
+ | RGB (f16) | x | ~ | ~ | - | - |
60
+ | Plane (f16) | ~ | ~ | ~ | - | - |
61
+ | AR30/RA30 | x | - | - | - | - |
62
62
63
63
#### Features
64
64
65
- To enable support of ` f16 ` the feature ` half ` should be activated.
65
+ Features:
66
+ - To enable support of ` f16 ` the feature ` half ` should be activated.
67
+ - ` nightly_avx512 ` activates AVX-512 feature set and requires ` nightly ` compiler channel
66
68
67
69
#### Target features with runtime dispatch
68
70
@@ -72,6 +74,8 @@ For x86 and aarch64 NEON runtime dispatch is used.
72
74
73
75
` avx2 ` , ` fma ` , ` sse4.1 ` , ` f16c ` will be detected automatically if available, no additional actions need, and called the best path.
74
76
77
+ ` avx512 ` requires feature ` nightly_avx512 ` and requires ` nightly ` compiler channel, runtime detection if it is available then will be used.
78
+
75
79
` fullfp16 ` NEON target detection performed in runtime, when available best the best paths for * f16* images are available on ARM.
76
80
77
81
WASM ` simd128 ` target feature activating is mandatory in build flags.
0 commit comments