Skip to content

Latest commit

 

History

History
41 lines (37 loc) · 1.43 KB

README.md

File metadata and controls

41 lines (37 loc) · 1.43 KB

Pommerman Experiments

This page contains the resources for the experiments of Pommerman as discussed in the TLeague technical report.

Training Code

The training yaml in the technical reports can be generated by pommerman.yml.jinja2.

Evaluation

python3 -m tleague.sandbox.run_local_battle_pommerman \
--policy_config="{
    'use_xla': False,
    'rollout_len': 1,
    'test': True,
    'rl': False,
    'use_loss_type': 'none',
    'use_value_head': False,
    'use_self_fed_heads': True,
    'use_lstm': True,
    'nlstm': 64,
    'hs_len': 128,
    'lstm_duration': 1,
    'lstm_dropout_rate': 0.0,
    'lstm_cell_type': 'lstm',
    'lstm_layer_norm': True,
    'weight_decay': 0.00000002,
    'n_v': 11,
    'merge_pi': False,
  }" \
--model=0076:0077_20201029114642.model

Downloads

Trained Model

The trained model (after 10 day's training) can be downloaded at Google Drive or Tencent Weiyun.

Replay Files

Here are the 100 replay files of our agent against Navocado as discussed in the TLeague technical report: Google Drive or Tencent Weiyun.

Replays can be displayed by drag-and-drop on the page Pommerman Playback.