EleutherAI
diff --git a/‎README.md
Lines changed: 18 additions & 0 deletions b/‎README.md
Lines changed: 18 additions & 0 deletions
@@ -203,6 +203,24 @@ We also provide benchmark 0-shot and 5-shot results on a variety of NLP datasets
 
 Evaluations were performed in GPT-NeoX using the [LM Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness), and are viewable by model and step at `results/json/v1.1-evals/*` in this repository.
 
+## Reproducing Memorization Results
+The memorization evaluation script `memorization/eval_memorization.py` assumes that you are running the script in a distributed process, ideally in slurm. If you want to reproduce the evaluation, consider the following steps.
+
+1. Change `prefix` and `idx_path` local variables of `generate_function()` to point to the right document and index path.
+
+2. If you are not using [Slurm](https://slurm.schedmd.com/documentation.html), You need to change global variables inside the script, like `RANK` and `NUM_PROCS` (world size) to point to the right environment variables.
+
+3. Change `cache_dir` of model being loaded (line 172) to point to locally saved directory of the model. This is necessary as we **donot** want to load the same model multiple times. Doing so will lead to errors.
+
+4. This script additionally saves results to aws s3 buckets (line 205). If you would like to save the results locally instead, you can do so by saving `memorization_evals` as a csv instead.
+
+5. You should ideally be able to run this script now on slurm (see `memorization/multinode_runner.sbatch`) for an example sbatch script.
+
+6. If you are using a different distributed client instead, you will need to pass `MODEL` and `CHECKPOINT` variables appropriately (see `memorization/multinode_runner.sbatch`) for an example 
+
+7. These csvs can then be combined by simple pandas concatenation. See `memorization/eda.ipynb` for an example.
+
+8. You can now generate plots too by following `memorization/eda.ipynb`.
 
 
 ## Citation Details