You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+14-2Lines changed: 14 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,7 @@
1
1
# leela-chess-experimental
2
2
based on Leela Chess Zero https://github.com/LeelaChessZero
3
3
4
+
Update 19/06/2018: V2.1, new source, new executable, changes to tree balancing, easy-early-visits
4
5
Update 17/06/2018: New source, new executable, new parameters and new test results.
5
6
6
7
I tried a number of MCTS search ideas in lc0. If you find something interesting here feel free to open an issue and discuss. This is a work in progress and purely experimental - new ideas will be added from time to time. This serves as documentation of both the good as well as the bad tries - so do not expect huge gains - but some ideas yield a measurable elo gain.
@@ -9,7 +10,7 @@ Disclaimer: All changes are completely zero and game agnostic.
9
10
10
11
## Search Modifications
11
12
12
-
### Tree Balancing
13
+
### Tree Balancing - Work in Progress
13
14
The upper confidence bound used in LC0's UCT flavor (and that of A0) assumes that the confidence bound of a child node is not affected by the local branching factor. However, in some games like Draughts or Chess the number of legal moves (branches) can vary greatly even in the same part of the search tree. This modification is based on the idea that we can use the number of individual branches in relation to the average number of branches to adjust the upper bound when selecting child nodes for expansion.
14
15
15
16
Initial testing with Parameters:
@@ -22,7 +23,13 @@ at 800 visits per move in 1000 games yielded these results:
Besides the small gain of elo, this has some additonal nice properties. Leela now finds shallow mates faster and certain winning moves at root can be played regardless of visit counts, which is beneficial in time pressure situations (typically MCTS is slow to revise initial estimates).
37
44
45
+
Update:
46
+
Improvement (v2.1) - if current best move is a certain loss change best move to a non loosing move, even if visits are lower.
47
+
38
48
### Compress low policy move probabilites
39
49
Instead of changing softmax temperature this scheme encourages exploration of low policy priors by compressing low probabilites more than high probabilites in relation to search depths. This does well at tactics (>170/200 WAC Silvertestsuite with standard cpuct=1.2) but suffers somewhat in selfplay, even though results against different opponents (non leela) are better. Might be useful for long analysis as it restores MCTS convergence properties (under some circumstances leela would never find moves no matter how many nodes visited.).
40
50
```
@@ -50,6 +60,8 @@ Tweaks the formula slightly to encourage early visits. The confidence bound is a
50
60
```
51
61
Might help ever so slightly tactically - this is untested but might work well in conjunction with policy compression. Self-play might suffer, but untested against non-leela opponents.
52
62
63
+
Update: This is now float parameter with 0.0 turning this feature off and 1.0 corresponding to old enabled behavior. Now values between 0.0 and 1.0 are also possible.
64
+
53
65
### Q-Moving-Average
54
66
Tested some variants of Gudmundsson and Björnsson and Feldman and Domshlak. For a description see:
0 commit comments