Pommerman Experiments

This page contains the resources for the experiments of Pommerman as discussed in the TLeague technical report.

Training Code

The training yaml in the technical reports can be generated by pommerman.yml.jinja2.

Evaluation

python3 -m tleague.sandbox.run_local_battle_pommerman \
--policy_config="{
    'use_xla': False,
    'rollout_len': 1,
    'test': True,
    'rl': False,
    'use_loss_type': 'none',
    'use_value_head': False,
    'use_self_fed_heads': True,
    'use_lstm': True,
    'nlstm': 64,
    'hs_len': 128,
    'lstm_duration': 1,
    'lstm_dropout_rate': 0.0,
    'lstm_cell_type': 'lstm',
    'lstm_layer_norm': True,
    'weight_decay': 0.00000002,
    'n_v': 11,
    'merge_pi': False,
  }" \
--model=0076:0077_20201029114642.model

Downloads

Trained Model

The trained model (after 10 day's training) can be downloaded at Google Drive or Tencent Weiyun.

Replay Files

Here are the 100 replay files of our agent against Navocado as discussed in the TLeague technical report: Google Drive or Tencent Weiyun.

Replays can be displayed by drag-and-drop on the page Pommerman Playback.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Pommerman Experiments

Training Code

Evaluation

Downloads

Trained Model

Replay Files

Files

README.md

Latest commit

History

README.md

File metadata and controls

Pommerman Experiments

Training Code

Evaluation

Downloads

Trained Model

Replay Files