Add EvolvedAttention: A transformer-based neural network strategy for Prisoner's Dilemma #1471

moderouin · 2025-02-28T18:37:44Z

Description

This PR introduces the EvolvedAttention strategy, a novel approach that uses a transformer neural network with self-attention mechanisms to make decisions. The strategy analyzes game history through attention patterns to determine optimal moves.

Features

Transformer architecture with self-attention (24 layers, 8 attention heads)
Memory depth of 200 moves with reverse-chronological processing
GPU acceleration when available (with CPU fallback)
Pre-trained model weights from evolutionary self-play

Technical Implementation

Game states encoded as tokens (CC, CD, DC, DD plus special CLS/PAD tokens)
Position embeddings to maintain sequence information
Decision boundary using sigmoid activation (< 0.5 → Cooperate, ≥ 0.5 → Defect)

Performance Considerations

The neural network is relatively complex but runs efficiently on modern hardware. The strategy balances analytical depth with reasonable computational requirements.

marcharper · 2025-03-01T18:40:38Z

Hi @moderouin, thanks for your contribution! Have you run the tests locally? I'm wondering if the new tests are very slow or if the issue is with Github's CI.

moderouin · 2025-03-01T20:53:41Z

The test runs within 20 minutes on my PC with Ubuntu 22.04.5 LTS and also seems to work well on Windows and macOS. However, I’m able to reproduce the CI bug within a Docker container using ubuntu:latest; the tests never seem to end. It appears to be an issue with *.rts. I’m currently trying to fix this, but if you have any insights, that would be great! @marcharper

moderouin · 2025-03-02T14:38:33Z

It seems like the error was caused by the forking of process on Linux. I use spawn instead to avoid any deadlocks @marcharper

marcharper · 2025-03-02T16:55:30Z

How does the strategy perform? Have you run any tournaments?

moderouin · 2025-03-02T18:46:17Z

When running a tournament with 10 repetitions against all the strategies not in long_run_time, it ranked first in median score, with a score close to that of EvolvedLookerUp2_2_2.

marcharper · 2025-03-04T03:57:15Z

Can you tell us more about how you trained it?

moderouin · 2025-03-05T20:00:07Z

This strategy was trained by performing multiple rounds of tournaments against all strategies and the current network of EvolvedAttention. After each tournament, the strategy learns to reproduce the moves of the best-performing strategy from the last tournament.

moderouin · 2025-03-05T22:33:16Z

I’m also currently working on a second version with the same architecture, but incorporating an actor-critic approach with policy and value heads on top of the base network and training it using PPO.

moderouin · 2025-03-07T19:31:10Z

Is there any adjustment I should make to this strategy? @marcharper

marcharper · 2025-03-08T03:11:16Z

No, I just haven't had a chance to review thoroughly, and we require two maintainer review. It's not surprising that it does so well if you trained against all the other strategies (it might be overfit), but that's not a blocker to including it. For your future trainings, try using just the short runtime strategies, which has worked fine for the other ML strategies and saves a lot of computation time.

marcharper

Looks good overall, some minor comments. PTAL and thanks for the contribution!

axelrod/data/all_classifiers.yml

axelrod/strategies/attention.py

axelrod/tests/integration/test_matches.py

axelrod/tournament.py

tox.ini

marcharper · 2025-03-09T21:35:44Z

@jsafyan PTAL, you're more of a transformer expert than I.

moderouin · 2025-03-16T17:27:53Z

@marcharper Do you have an idea why this test fail in the last check? Seems like it's not related to my strategy?

moderouin · 2025-04-27T15:35:18Z

Is there anything else I should change ? @drvinceknight @marcharper

drvinceknight · 2025-05-02T13:34:10Z

Is there anything else I should change ?

Would you be able to add a few more tests (against known sequences of play or strategies) here axelrod/tests/strategies/test_attention.py please?

I see you have the test again two cooperations but it would be good to have more wide ranging ones (even if due to the nature of this strategy it serves more as a regression test).

moderouin · 2025-05-03T21:43:12Z

Let me know what you think @drvinceknight

drvinceknight

This looks good to me.

drvinceknight · 2025-05-26T14:59:00Z

@marcharper I think this is just waiting on you to ok.

marcharper · 2025-05-28T01:33:07Z

The entire test suite is substantially slower now. Is there something we can do to mitigate this, such as loading the weights more lazily?

drvinceknight · 2025-05-28T07:59:49Z

(Sorry I clicked a button by mistake closing the PR!)

drvinceknight · 2025-05-28T08:02:12Z

The entire test suite is substantially slower now. Is there something we can do to mitigate this, such as loading the weights more lazily?

I hadn't realised this. How mush slower is it? If @marcharper's good suggestion doesn't work what about marking some tests as "slow" (pytest has tooling for this: https://pytest-with-eric.com/pytest-best-practices/pytest-markers/)?

moderouin · 2025-06-04T12:49:14Z

The entire test suite is substantially slower now. Is there something we can do to mitigate this, such as loading the weights more lazily?

Looking into this

moderouin · 2025-06-04T17:07:26Z

Lazy loading of the model weights has reduced the total test run time by roughly one hour. Let me know what you think @marcharper

drvinceknight · 2025-06-06T07:40:19Z

Unless I'm not remembering correctly before this PR the test run would be about 20 minutes, and your latest commit is at 60 minutes.

Have I got that right? (not sure who this question is for)

moderouin · 2025-06-06T20:34:19Z

Unless I'm not remembering correctly before this PR the test run would be about 20 minutes, and your latest commit is at 60 minutes.

Have I got that right? (not sure who this question is for)

That's correct. test_meta.py is taking a really long time to run. I'll try to reduce the runtime while still preserving the quality of the tests

marcharper · 2025-06-08T20:55:15Z

@drvinceknight I'm ok with this going in for now, we should think about how to speed up the test suite though, maybe by ensuring that fewer of the long run time strategies end up in test tournaments or meta strategies.

drvinceknight · 2025-06-09T10:14:56Z

That's correct. test_meta.py is taking a really long time to run. I'll try to reduce the runtime while still preserving the quality of the tests

Thanks I think you've done an awesome job, appreciate you working to get the tests under control :)

@drvinceknight I'm ok with this going in for now, we should think about how to speed up the test suite though, maybe by ensuring that fewer of the long run time strategies end up in test tournaments or meta strategies.

100%!

moderouin marked this pull request as ready for review February 28, 2025 19:05

moderouin force-pushed the attention-strategy branch from 8022169 to a2f8e23 Compare March 1, 2025 05:05

Adding attention strategy

927371b

moderouin force-pushed the attention-strategy branch from a2f8e23 to 927371b Compare March 1, 2025 05:08

moderouin added 2 commits March 1, 2025 08:30

Adding deadline=None

7cf83d0

Adding verbosity to see which test is stalling the CI on ubuntu

dfb5cc0

moderouin closed this Mar 1, 2025

moderouin reopened this Mar 1, 2025

moderouin added 2 commits March 1, 2025 22:53

Use spawn instead of fork for new process

9b6ff87

Removing pytest-xdist -n auto

ab9c165

marcharper requested changes Mar 9, 2025

View reviewed changes

marcharper added the ready-for-review label Mar 11, 2025

Long run time

0d15ecc

marcharper requested a review from drvinceknight March 13, 2025 03:36

Adding evolved attentio to long run strategies

4dc44c1

moderouin added 2 commits March 16, 2025 15:44

Adding min_value for s = 0

d680f6c

Changing max_examples=5 to max_examples=2

fcfb6f5

Adding test against strategies

e800b3e

moderouin added 2 commits May 4, 2025 15:30

Removing stochastic strategy in test

a4f275b

Fixing randomize test

b9e837c

moderouin requested a review from marcharper May 23, 2025 12:56

drvinceknight approved these changes May 26, 2025

View reviewed changes

drvinceknight closed this May 28, 2025

drvinceknight reopened this May 28, 2025

moderouin added 2 commits June 4, 2025 09:28

Lazy loading of model weights

9635a07

mypy compliance

6a36621

Shorter run time for test_meta

a7f6a40

moderouin force-pushed the attention-strategy branch from 23c78cd to a7f6a40 Compare June 6, 2025 20:35

Adding coverage to frequency analyzer

be46365

drvinceknight merged commit 4b3369c into Axelrod-Python:dev Jun 9, 2025
7 checks passed

Add EvolvedAttention: A transformer-based neural network strategy for Prisoner's Dilemma #1471

Add EvolvedAttention: A transformer-based neural network strategy for Prisoner's Dilemma #1471

Uh oh!

Conversation

moderouin commented Feb 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Features

Technical Implementation

Performance Considerations

Uh oh!

marcharper commented Mar 1, 2025

Uh oh!

moderouin commented Mar 1, 2025

Uh oh!

moderouin commented Mar 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

marcharper commented Mar 2, 2025

Uh oh!

moderouin commented Mar 2, 2025

Uh oh!

marcharper commented Mar 4, 2025

Uh oh!

moderouin commented Mar 5, 2025

Uh oh!

moderouin commented Mar 5, 2025

Uh oh!

moderouin commented Mar 7, 2025

Uh oh!

marcharper commented Mar 8, 2025

Uh oh!

marcharper left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

marcharper commented Mar 9, 2025

Uh oh!

moderouin commented Mar 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

moderouin commented Apr 27, 2025

Uh oh!

drvinceknight commented May 2, 2025

Uh oh!

moderouin commented May 3, 2025

Uh oh!

drvinceknight left a comment

Choose a reason for hiding this comment

Uh oh!

drvinceknight commented May 26, 2025

Uh oh!

marcharper commented May 28, 2025

Uh oh!

drvinceknight commented May 28, 2025

Uh oh!

drvinceknight commented May 28, 2025

Uh oh!

moderouin commented Jun 4, 2025

Uh oh!

moderouin commented Jun 4, 2025

Uh oh!

drvinceknight commented Jun 6, 2025

Uh oh!

moderouin commented Jun 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

marcharper commented Jun 8, 2025

Uh oh!

drvinceknight commented Jun 9, 2025

Uh oh!

Uh oh!

Uh oh!

moderouin commented Feb 28, 2025 •

edited

Loading

moderouin commented Mar 2, 2025 •

edited

Loading

moderouin commented Mar 16, 2025 •

edited

Loading

moderouin commented Jun 6, 2025 •

edited

Loading