This page contains the resources for the experiments of Pommerman as discussed in the TLeague technical report.
The training yaml in the technical reports can be generated by pommerman.yml.jinja2.
python3 -m tleague.sandbox.run_local_battle_pommerman \
--policy_config="{
'use_xla': False,
'rollout_len': 1,
'test': True,
'rl': False,
'use_loss_type': 'none',
'use_value_head': False,
'use_self_fed_heads': True,
'use_lstm': True,
'nlstm': 64,
'hs_len': 128,
'lstm_duration': 1,
'lstm_dropout_rate': 0.0,
'lstm_cell_type': 'lstm',
'lstm_layer_norm': True,
'weight_decay': 0.00000002,
'n_v': 11,
'merge_pi': False,
}" \
--model=0076:0077_20201029114642.model
The trained model (after 10 day's training) can be downloaded at Google Drive or Tencent Weiyun.
Here are the 100 replay files of our agent against Navocado as discussed in the TLeague technical report: Google Drive or Tencent Weiyun.
Replays can be displayed by drag-and-drop on the page Pommerman Playback.