@@ -20,51 +20,99 @@ julia> N = 100;
20
20
21
21
julia> A = rand (N,N); B = rand (N,N); C = similar (A);
22
22
23
- julia> @benchmark TriangularSolve. rdiv! ($ C, $ A, UpperTriangular ($ B), Val (false )) # false means single threaded
23
+ julia> @benchmark TriangularSolve. rdiv! (copyto! ( $ C, $ A) , UpperTriangular ($ B), Val (false )) # false means single threaded
24
24
BenchmarkTools. Trial: 10000 samples with 1 evaluation.
25
- Range (min … max): 15.909 μs … 41.524 μs ┊ GC (min … max): 0.00 % … 0.00 %
26
- Time (median): 17.916 μs ┊ GC (median): 0.00 %
27
- Time (mean ± σ): 17.751 μs ± 697.786 ns ┊ GC (mean ± σ): 0.00 % ± 0.00 %
25
+ Range (min … max): 12.106 μs … 108.141 μs ┊ GC (min … max): 0.00 % … 0.00 %
26
+ Time (median): 12.841 μs ┊ GC (median): 0.00 %
27
+ Time (mean ± σ): 12.862 μs ± 1.585 μs ┊ GC (mean ± σ): 0.00 % ± 0.00 %
28
28
29
- ▃▁ ▁ ▁ ▄▁ ▇▆ ▆█▃ ▂
30
- ██▃▁▁██ ▁▁▁▁█▆▁▁▃▇██▄▃▁███▆▁▄▄███▄▄▅▅▆▇█▇▄▅▆▇██▇█▇▇▆▄▅▄▁▄▁▄▄▇ █
31
- 15.9 μs Histogram: log ( frequency) by time 19.9 μs <
29
+ █
30
+ ▃█▆▂ ▁▁▁▁▂▃▇▄▂▂▇▄▂▂▂▃▃▃▂▂▂▃▃▂▂▂▂▁▁▁▁▂▂▁▁▂▂▂▂▂▂▂▂▂▂▂▂▁▂▂▂▂▂▂▂▂ ▂
31
+ 12.1 μs Histogram: frequency by time 16.6 μs <
32
32
33
33
Memory estimate: 0 bytes, allocs estimate: 0.
34
34
35
35
julia> @benchmark rdiv! (copyto! ($ C, $ A), UpperTriangular ($ B))
36
36
BenchmarkTools. Trial: 10000 samples with 1 evaluation.
37
- Range (min … max): 17.578 μs … 75.835 μs ┊ GC (min … max): 0.00 % … 0.00 %
38
- Time (median): 19.852 μs ┊ GC (median): 0.00 %
39
- Time (mean ± σ): 19.827 μs ± 1.342 μs ┊ GC (mean ± σ): 0.00 % ± 0.00 %
37
+ Range (min … max): 13.483 μs … 31.165 μs ┊ GC (min … max): 0.00 % … 0.00 %
38
+ Time (median): 13.698 μs ┊ GC (median): 0.00 %
39
+ Time (mean ± σ): 13.816 μs ± 791.937 ns ┊ GC (mean ± σ): 0.00 % ± 0.00 %
40
40
41
- ▄▂ ▂ ▆▅ ▁█▇▂ ▅▃ ▂ ▂
42
- ██ ▁▁▃█▇▁▁▁█▇▄▄▁██▇▄▄▄██▆▅▄████▅▄▆██▆▆▆▆▇██▇▇▆▆▇▆▅▆▄▅▅▆▄▅▄▅▅ █
43
- 17.6 μs Histogram: log (frequency) by time 22.4 μs <
41
+ ▃▆██▅ ▃▄▃ ▂
42
+ ▆██████▄▃ ▁▁▃▁▁▇████▃▃▃▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▅▄▁▄▆▇▆▆ █
43
+ 13.5 μs Histogram: log (frequency) by time 16.8 μs <
44
44
45
45
Memory estimate: 0 bytes, allocs estimate: 0.
46
46
47
- julia> @benchmark ldiv! ( $ C, LowerTriangular ($ B), $ A)
47
+ julia> @benchmark TriangularSolve . rdiv! ( copyto! ( $ C, $ A), LowerTriangular ($ B), Val ( false )) # false means single threaded
48
48
BenchmarkTools. Trial: 10000 samples with 1 evaluation.
49
- Range (min … max): 19.102 μs … 69.966 μs ┊ GC (min … max): 0.00 % … 0.00 %
50
- Time (median): 21.561 μs ┊ GC (median): 0.00 %
51
- Time (mean ± σ): 21.565 μs ± 890.952 ns ┊ GC (mean ± σ): 0.00 % ± 0.00 %
49
+ Range (min … max): 12.990 μs … 35.762 μs ┊ GC (min … max): 0.00 % … 0.00 %
50
+ Time (median): 13.094 μs ┊ GC (median): 0.00 %
51
+ Time (mean ± σ): 13.170 μs ± 636.637 ns ┊ GC (mean ± σ): 0.00 % ± 0.00 %
52
52
53
- ▂▂ ▂▃ ▄▄ ▆█▄ ▅▅ ▂
54
- ██▃▁▁▁▇█ ▁▁▁▁▅█ ▁▁▁▁▁██▅ ▁▁▁▅██▆ ▁▁▁▆███▆▅▃▅████▃▄▅██▇▇▅▆▆▇▇█▇▆▆ █
55
- 19.1 μs Histogram: log (frequency) by time 23.4 μs <
53
+ ▄██▅ ▂▂ ▂
54
+ ████▆▄ ▃▁▁▁▁▁▁▄███▃ ▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▃▄▅▄▅▅▇ █
55
+ 13 μs Histogram: log (frequency) by time 16 μs <
56
56
57
57
Memory estimate: 0 bytes, allocs estimate: 0.
58
58
59
- julia> @benchmark TriangularSolve. ldiv! ($ C, LowerTriangular ($ B), $ A, Val (false )) # false means single threaded
59
+ julia> @benchmark rdiv! (copyto! ($ C, $ A), LowerTriangular ($ B))
60
+ BenchmarkTools. Trial: 10000 samples with 1 evaluation.
61
+ Range (min … max): 14.119 μs … 32.767 μs ┊ GC (min … max): 0.00 % … 0.00 %
62
+ Time (median): 14.321 μs ┊ GC (median): 0.00 %
63
+ Time (mean ± σ): 14.372 μs ± 461.912 ns ┊ GC (mean ± σ): 0.00 % ± 0.00 %
64
+
65
+ ▂█▆
66
+ ▂▄███▇▃▂▂▂▁▁▂▁▂▂▂▂▂▂▂▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▂▂▂▂▂▂ ▂
67
+ 14.1 μs Histogram: frequency by time 17.3 μs <
68
+
69
+ Memory estimate: 0 bytes, allocs estimate: 0.
70
+
71
+ julia> @benchmark TriangularSolve. ldiv! (LowerTriangular ($ B), copyto! ($ C, $ A), Val (false )) # false means single threaded
72
+ BenchmarkTools. Trial: 10000 samples with 1 evaluation.
73
+ Range (min … max): 16.579 μs … 38.431 μs ┊ GC (min … max): 0.00 % … 0.00 %
74
+ Time (median): 16.728 μs ┊ GC (median): 0.00 %
75
+ Time (mean ± σ): 16.812 μs ± 640.766 ns ┊ GC (mean ± σ): 0.00 % ± 0.00 %
76
+
77
+ ▁▅██▅ ▁▂▁ ▂
78
+ █████▇▃▁▁▁▁▁▁▁▁▁▁▁▄███▅▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▅▃▅▃▅▇█ █
79
+ 16.6 μs Histogram: log (frequency) by time 19.6 μs <
80
+
81
+ Memory estimate: 0 bytes, allocs estimate: 0.
82
+
83
+ julia> @benchmark ldiv! (LowerTriangular ($ B), copyto! ($ C, $ A))
84
+ BenchmarkTools. Trial: 10000 samples with 1 evaluation.
85
+ Range (min … max): 15.763 μs … 38.550 μs ┊ GC (min … max): 0.00 % … 0.00 %
86
+ Time (median): 15.936 μs ┊ GC (median): 0.00 %
87
+ Time (mean ± σ): 16.351 μs ± 1.103 μs ┊ GC (mean ± σ): 0.00 % ± 0.00 %
88
+
89
+ ▃▇█▆ ▁▅▆▄ ▂▄▅▃ ▁▂▁ ▂
90
+ █████▄▄▁▁▁▃▁▄████▆▁▅████▆▁▃▆███▅▃▁▁▁▁▁▁▁▁▁▁▁▁▄▃▄▅▆▇█▆▅▆▅▅▆▄ █
91
+ 15.8 μs Histogram: log (frequency) by time 19.5 μs <
92
+
93
+ Memory estimate: 0 bytes, allocs estimate: 0.
94
+
95
+ julia> @benchmark TriangularSolve. ldiv! (UpperTriangular ($ B), copyto! ($ C, $ A), Val (false )) # false means single threaded
96
+ BenchmarkTools. Trial: 10000 samples with 1 evaluation.
97
+ Range (min … max): 15.921 μs … 34.610 μs ┊ GC (min … max): 0.00 % … 0.00 %
98
+ Time (median): 16.180 μs ┊ GC (median): 0.00 %
99
+ Time (mean ± σ): 16.290 μs ± 733.446 ns ┊ GC (mean ± σ): 0.00 % ± 0.00 %
100
+
101
+ ▅██▃ ▂▃▁ ▂
102
+ ▇█▆████▅▄▁▁▁▁▁▁▁▃▁▅███▄▁▁▁▁▄▇█▄▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▃▆▅▄▃▇█ █
103
+ 15.9 μs Histogram: log (frequency) by time 19.1 μs <
104
+
105
+ Memory estimate: 0 bytes, allocs estimate: 0.
106
+
107
+ julia> @benchmark ldiv! (UpperTriangular ($ B), copyto! ($ C, $ A))
60
108
BenchmarkTools. Trial: 10000 samples with 1 evaluation.
61
- Range (min … max): 19.082 μs … 39.078 μs ┊ GC (min … max): 0.00 % … 0.00 %
62
- Time (median): 19.694 μs ┊ GC (median): 0.00 %
63
- Time (mean ± σ): 19.765 μs ± 774.848 ns ┊ GC (mean ± σ): 0.00 % ± 0.00 %
109
+ Range (min … max): 15.085 μs … 37.057 μs ┊ GC (min … max): 0.00 % … 0.00 %
110
+ Time (median): 15.274 μs ┊ GC (median): 0.00 %
111
+ Time (mean ± σ): 15.373 μs ± 733.025 ns ┊ GC (mean ± σ): 0.00 % ± 0.00 %
64
112
65
- ▃ ▄█ ▁
66
- ▂▇██▄▂▁▁▂▂▃ ███▃▂▁▂▁▂▂▅█▇▃▂▂▂▁▂▂▂▂▂▂▂▁▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▁▁▂▂▂ ▃
67
- 19 .1 μs Histogram: frequency by time 22.1 μs <
113
+ ▄▇█▇▄ ▃▂ ▂
114
+ ██ ████▇▃▁▃▁▁▃▁▁▅████▄▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▄▆▃▅▅▆█ █
115
+ 15 .1 μs Histogram: log ( frequency) by time 18.3 μs <
68
116
69
117
Memory estimate: 0 bytes, allocs estimate: 0.
70
118
```
0 commit comments