SOMD2

Open-source GPU accelerated molecular dynamics engine for alchemical free-energy simulations. Built on top of Sire and OpenMM. The code is still under active development and is not yet ready for general use.

Installation

First create a conda environment using the provided environment file:

conda env create -f environment.yaml

(We recommend using Miniforge.)

Note

On macOS, you will need to use the environment_macos.yaml file instead.

Now install somd2 into the environment:

conda activate somd2
pip install --editable .

You should now have a somd2 executable in your path. To test, run:

somd2 --help

Usage

In order to run an alchemical free-energy simulation you will need to first create a stream file containing the perturbable system of interest. This can be created using BioSimSpace. For example, following the tutorial here. Once the system is created, it can be streamed to file using, e.g.:

import BioSimSpace as BSS

BSS.Stream.save(system, "perturbable_system")

You can then run a simulation with:

somd2 perturtbable_system.bss

The help message provides information on all of the supported options, along with their default values. Options can be specified on the command line, or using a YAML configuration file, passed with the --config option. Any options explicity set on the command line will override those set via the config file.

An example perturbable system for a methane to ethanol perturbation in solvent can be found here. This is a bzip2 compressed file that will need to be extracted before use.

Running SOMD2 using one or more GPUs

In order to run using GPUs you will first need to set the relevant environment variable. For example, to run using 4 CUDA enabled GPUS set CUDA_VISIBLE_DEVICES=0,1,2,3 (for openCL and HIP use OPENCL_VISIBLE_DEVICES and HIP_VISIBLE_DEVICES respectively).

By default SOMD2 will run using the CPU platform, however if the relevant environment variable has been set (as above) the new platform will be detected and set. In the case that this detection fails, or if there are multiple platforms available, the --platform option can be set (for example --platform cuda).

By default, SOMD2 will automatically manage the distribution of lambda windows across all listed devices. In order to restrict the number of devices used the --max_gpus option can be set, for example setting max_gpus=2 while CUDA_VISIBLE_DEVICES are set as above would restrict SOMD2 to using only GPUs 0 and 1.

Replica exchange

SOMD2 supports Hamiltonian replica exchange (HREX) simulations, which can be enabled using the --replica-exchange option. Note that dynamics contexts will be created up-front for all replicas, so this can be memory intensive. As such, replica exchange is intended for use on multi-GPU nodes with a large amount of memory. For optimal performance, it is recommended that the number of replicas be a multiple of the number of GPUs. It is also possible to oversubscribe the GPUs, i.e. have more than one replica running on a GPU at a time. This can be controlled via the --oversubscription-factor option, e.g. a value of 2 would allow 2 replicas to run on each GPU at a time.

The swap frequency for replica exchange is controlled by the energy-frequency option, i.e. we compute the energies for all replicas at this frequency, then attempt to mix the replicas. A larger value will improve performance, but may reduce the efficiency of the exchange.

REST2

We also support Replica Exchange with Solute Scaling (REST2) simulations to facilitate sampling for perturbations involving conformational changes, e.g. ring flips. This can be enabled using the --rest2-scale option, which specifies the "temperature" of the REST2 region relative to the rest of the system. By default, the REST2 region comprises all atoms in perturbable molecules, but can be controlled via the --rest2-selection option. This should be a Sire selection string that specifies additional atoms of interest, i.e. those in regular, non-perturbable molecules. If the selection does contain atoms within perturbable molecules, then only those atoms within the perturbable molecules will be considered as part of the REST2 region, i.e. you can select a sub-set of atoms within a perturbable molecule to be scaled.

By default, the REST2 schedule is a triangular function that starts and ends at 1.0, with a peak at the middle of the lambda schedule corresponding to the value of --rest2-scale. By passing multiple values for --rest2-scale, the user can fully control the schedule. When doing so, the number of values must match the number of lambda windows.

GCMC

SOMD2 also supports grand canonical Monte Carlo (GCMC) water sampling using the loch package. This can be enabled using the --gcmc option. To define a GCMC region, use the --gcmc-selection option, which should be a Sire selection string that specifies the atoms defining the centre of geometry for the GCMC region. The radius of the GCMC sphere can be controlled using the --gcmc-radius option. To see all GCMC related options, run:

somd2 --help | grep -A2 '  --gcmc'

Note

GCMC is currently only supported when using the CUDA platform and isn't available on macOS, where the pycuda package is not available.

Analysis

Simulation output will be written to the directory specified using the --output-directory parameter. This will contain a number of files, including Parquet files for the energy trajectories of each λ window. These can be processed using BioSimSpace as follows:

import BioSimSpace as BSS

pmf1, overlap1 = BSS.FreeEnergy.Relative.analyse("output1")

(Here we assume that the output directory is called output1.)

To compute the relative free-energy difference between two legs, e.g. legs 1 and 2, you can use:

pmf2, overlap2 = BSS.FreeEnergy.Relative.analyse("output2")

free_nrg = BSS.FreeEnergy.Relative.difference(pmf1, pmf2)

Truncated MBAR analysis

When running HREX with a large number of replicas it can become computationally expensive to compute energies. (We need the energies of each replica at each lamdba value.) As a shortcut, it's possible to truncate the neighbourhood of windows for which we compute energies, then use a large null energy for the remaining windows. This can be controlled via the --num-energy-neighbours option. For example, setting this to 2 would compute energies for the current window and its two neighbours on either side. The value assigned to the remaining windows can be controlled via the --null-energy option. The number of neighbours should be chosen as a trade off between accuracy and computational cost. A value of around 20% of the number of replicas has been found to be a good starting point.

Ghost atom modifications

We support modification of ghost atom bonded terms to avoid spurious coupling to the physical system using the approach described in this paper. These are enabled by default, but can be disabled using the --no-ghost-modifications option. Modifications are implemented using the ghostly package.

Note for SOMD1 users

For existing users of somd1, it's possible to generate input for somd2 by passing --somd2 True to the prepareFEP.py setup script. This will write a somd2 compatible stream file.

Additionally, somd2 can be run in somd1 compatibility mode by passing the --somd1-compatibility command-line option to the somd2 executable. This ensures that the perturbation used is consistent with the approach from somd1, i.e. it uses the same modifications for bonded-terms involving dummy atoms as somd1.

Finally, it is also possible to run somd2 using an existing somd1 perturbation file. To do so, you will also need to create a stream file representating the λ = 0 state. For existing input generated by prepareFEP.py, this can be done as follows. (This assumes that the output has a prefix somd1.)

import BioSimSpace as BSS

# Load the lambda = 0 state from prepareFEP.py
system = BSS.IO.readMolecules(["somd1.prm7", "somd1.rst7"])

# Write a stream file.
BSS.Stream.save(system, "somd1")

(This will write a stream file called somd1.bss.)

This can then be run with somd2 using the following:

somd2 somd1.bss --pert-file somd1.pert --somd1-compatibility

(This only shows the limited options required. Others will take default values and can be set accordingly.)

If you want to load an existing system from a perturbation file and use the new somd2 ghost atom bonded-term modifications, then simply omit the --somd1-compatibility option.

GPU oversubscription

If you have an NVIDIA GPU that supports the multi-process service (MPS), you can oversubscibe the GPU to run multiple OpenMM contexts on the same GPU at once, increasing the throughput of your simulation. To do this, you will need to first enable MPS by running the following command:

nvidia-cuda-mps-control -d

The number of contexts that can be run in parallel is then controlled by the --oversubscription-factor option, which defaults to 1.

More details on MPS, including tuning options, can be found in the following techical blog.

Python API

SOMD2 can also be used as a Python API, allowing it to be embedded within other Python scripts.

Known issues

If using the regular Runner class via the Python API, then you will need to guard calls to its run() method within a if __name__ == "__main__": block since it uses multiprocessing with the spawn start method.

During a checkpoint cycle trajectory frames are stored in memory before being paged to disk. When running replica exchange simulations with a large number of replicas this can lead to exceeding the temporary file storage limit on some systems, causing the simulation to hang. This can be resolved by either reducing the frequency at which frames are stored, or checkpointing more frequently. (Frames are written to disk and cleared from memory at each checkpoint.)

Name		Name	Last commit message	Last commit date
Latest commit History 747 Commits
.github/workflows		.github/workflows
demo		demo
scripts		scripts
src/somd2		src/somd2
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
environment_macos.yaml		environment_macos.yaml
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SOMD2

Installation

Usage

Running SOMD2 using one or more GPUs

Replica exchange

REST2

GCMC

Analysis

Truncated MBAR analysis

Ghost atom modifications

Note for SOMD1 users

GPU oversubscription

Python API

Known issues

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

License

OpenBioSim/somd2

Folders and files

Latest commit

History

Repository files navigation

SOMD2

Installation

Usage

Running SOMD2 using one or more GPUs

Replica exchange

REST2

GCMC

Analysis

Truncated MBAR analysis

Ghost atom modifications

Note for SOMD1 users

GPU oversubscription

Python API

Known issues

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages