Difficulty In Reproducing The Paper Results (e.g., Table 4., BYOL reported 66%, measured <<60%)

Hi,

Thanks a lot for your amazing work and releasing the code. I am trying to reproduce your **[Table 4](https://arxiv.org/pdf/2112.04215.pdf)** for sometime. I directly use the code and the scripts with NO modification. 

For example, in this Table, BYOL fine-tuning on ImageNet-100 for 5-class incremental task performance is **66.0**. Instead, I measured below <<60.0, at least 6% below. Please see the full results Table below if interested (a 5 x 5 Table). 

[results.pdf](https://github.com/DonkeyShot21/cassle/files/10679733/results.pdf)

Any idea what may be causing the gap? Is there any nuances in evaluation method? For example, for average accuracy, I simply take the mean of the below Table across all rows and colums (as also suggested by [GEM](https://arxiv.org/abs/1706.08840), as you referenced). 

Thanks a lot again for your response and your eye-opening work. 


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Difficulty In Reproducing The Paper Results (e.g., Table 4., BYOL reported 66%, measured <<60%) #12

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Difficulty In Reproducing The Paper Results (e.g., Table 4., BYOL reported 66%, measured <<60%) #12

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions