Skip to content

Commit db10dd8

Browse files
committed
Fixed _gemm_small() prototype; disabled gemm_small.
Details: - Fixed a mismatch between the prototype for bli_gemm_small() in bli_gemm_front.h and the actual definition of bli_gemm_small() in kernels/zen/3/bli_gemm_small.c. The former was erroneously declaring the cntl_t* argument as 'const'. Thanks to Jeff Diamond for reporting this issue. - Commented out BLIS_ENABLE_SMALL_MATRIX, BLIS_ENABLE_SMALL_MATRIX_TRSM macro definitions in config/zen3/bli_family_zen3.h. AMD's small matrix implementation should probably remain disabled in vanilla BLIS, at least for now.
1 parent f0337b7 commit db10dd8

File tree

2 files changed

+3
-3
lines changed

2 files changed

+3
-3
lines changed

config/zen3/bli_family_zen3.h

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -52,8 +52,8 @@
5252
// All zen3 specific code should be included in this macro
5353
#define BLIS_CONFIG_ZEN3
5454

55-
#define BLIS_ENABLE_SMALL_MATRIX
56-
#define BLIS_ENABLE_SMALL_MATRIX_TRSM
55+
//#define BLIS_ENABLE_SMALL_MATRIX
56+
//#define BLIS_ENABLE_SMALL_MATRIX_TRSM
5757

5858

5959
// This will select the threshold below which small matrix code will be called.

frame/3/gemm/bli_gemm_front.h

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -52,7 +52,7 @@ err_t bli_gemm_small
5252
const obj_t* beta,
5353
const obj_t* c,
5454
const cntx_t* cntx,
55-
const cntl_t* cntl
55+
cntl_t* cntl
5656
);
5757
#endif
5858

0 commit comments

Comments
 (0)