GitHub - janphilippfranken/sami: Self-Supervised Alignment with Mutual Information

SAMI: Self-Supervised Alignment with Mutual Information

🧐 What this is

This repository contains a reference implementation of SAMI (Self-Supervised Alignment with Mutual Information) using the TL;DR dataset.

🚀 Running the Code

Prerequisites

Set up a conda environment (we used python==3.10.0) and install the required dependencies by running pip install -e .

Data Generation (Optional)

Adjust the experiments/tldr/config/generate.yaml config file to match your directories and desired configurations. Example constitutions using principles written by mistral-7b and claude-opus are provided in constitutions_mistral and constitutions_opus.
Navigate to cd experiments/tldr and run python generate.py to generate your own data. By default, the generated data will be stored in experiments/tldr/data/base. Note that this directory is already populated with the data used in the paper if you prefer to finetune a model directly.

Training

Select a model configuration (e.g., mistral-7b) from the experiments/tldr/conf/model directory and update the cache_dir accordingly (e.g., /scr/YOUR_USERNAME/sami/checkpoints).
Adjust the experiments/tldr/conf/train_sami.yaml config as needed, including optional wandb logging. If you set log: true you should have an account/make sure that you are logged in.
Navigate to cd experiments/tldr and run training using an interactive job using the command below, or adapt the example slurm script to meet your computing needs and submit it using sbatch (or modify the script to be a standard bash script and submit from e.g. a tmux window).

python train.py \
    training.beta=0.0 \
    wandb.name="$YOUR_WANDB_NAME" \
    training.checkpoint_dir="$YOUR_CHECKPOINT_DIR" \
    training.lr=5e-7 \
    data_path="data/base" \
    data_file="base_mistral_from_mistral_principles.json" \
    n_examples=2000

Evaluation

Adjust the experiments/tldr/config/evaluate.yaml configuration, navigate to cd experiments/tldr and run python evaluate.py. This will write the generated responses into experiments/tldr/results/responses.
Compute win rates by adjusting the experiments/tldr/config/win_rates.yaml configuration and running python win_rates.py from the same directory. Note that this script currently uses azure, so if you dont have access to GPT-4 via azure, you might have to copy-paste the /scr/models/openai_models/azure.py and create your own AsyncOpenAI class. FYI: We used the gpt-4-0613 snapshot for all evaluations.

Running without GPUs

If you don't have access to GPUs, you can attempt to run training using experiments/tldr/conf/model/mistral_tiny_base, which we tested locally on an Apple M2 Pro (2023 MacBook Pro with 16B memory).

Additional Resources

The SAMITrainer and train.py use FSDP (FullyShardedDataParallel). To learn more about FSDP, you may find the FSDP tutorial series and the DDP tutorial series helpful.

Citation

If you found this work useful, please cite:

@article{fränken2024selfsupervised,
      title={Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels}, 
      author={Jan-Philipp Fränken and Eric Zelikman and Rafael Rafailov and Kanishk Gandhi and Tobias Gerstenberg and Noah D. Goodman},
      year={2024},
      eprint={2404.14313},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
experiments/tldr		experiments/tldr
src/sami		src/sami
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SAMI: Self-Supervised Alignment with Mutual Information

🧐 What this is

🚀 Running the Code

Prerequisites

Data Generation (Optional)

Training

Evaluation

Running without GPUs

Additional Resources

Citation

About

Releases

Packages

Languages

License

janphilippfranken/sami

Folders and files

Latest commit

History

Repository files navigation

SAMI: Self-Supervised Alignment with Mutual Information

🧐 What this is

🚀 Running the Code

Prerequisites

Data Generation (Optional)

Training

Evaluation

Running without GPUs

Additional Resources

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages