Skip to content

Commit ba513d4

Browse files
committed
Updating main README and introductory README for scripts
1 parent 200da9e commit ba513d4

File tree

2 files changed

+119
-44
lines changed

2 files changed

+119
-44
lines changed

README.md

+57-44
Original file line numberDiff line numberDiff line change
@@ -1,44 +1,57 @@
1-
[![Build Status](https://travis-ci.org/ModelSEED/ModelSEEDDatabase.svg?branch=master)](https://travis-ci.org/ModelSEED/ModelSEEDDatabase)
2-
3-
# ModelSEEDDatabase
4-
This repository contains the definitive copy of the biochemistry and metadata used to construct models using the ModelSEED/ProbAnno approach
5-
6-
You can contribute by submitting revisions to the biochemistry though pull requests
7-
8-
# Python Environment
9-
10-
1) Install Conda:
11-
https://conda.io/projects/conda/en/latest/user-guide/install/index.html
12-
13-
2) Set up a python 3 conda environment:
14-
```
15-
conda create -n msd-env python=3
16-
```
17-
3) Activate:
18-
```
19-
conda activate rdkit-env
20-
```
21-
22-
4) Install rdkit and openbabel for handling biochemical structures:
23-
```
24-
conda install -c rdkit rdkit
25-
conda install -c openbabel openbabel
26-
```
27-
28-
5) Install eQuilibrator for retrieving thermodynamics data:
29-
```
30-
conda install wxPython
31-
pip install quilt
32-
```
33-
34-
6) Export path to local python libraries
35-
```
36-
export PYTHONPATH=$PYTHONPATH:<path-to-repository>/ModelSEEDDatabase/Libs/Python/
37-
```
38-
39-
7) Test it by running:
40-
```
41-
./Scripts/Structures/Print_Structure_Formula_Charge.py
42-
git status -s
43-
```
44-
(there should be no change in the formula and charge in the compounds.tsv file, if there is, our recommendation is that you panic and throw your laptop out of the window.)
1+
<!--[![Build
2+
Status](https://travis-ci.org/ModelSEED/ModelSEEDDatabase.svg?branch=master)](https://travis-ci.org/ModelSEED/ModelSEEDDatabase)-->
3+
# ModelSEED Biochemistry Database
4+
5+
<a href="https://www.biorxiv.org/content/10.1101/2020.03.31.018663v2">The
6+
ModelSEED Database for the integration of metabolic annotations and
7+
the reconstruction, comparison, and analysis of metabolic models for
8+
plants, fungi, and microbes.</a>
9+
10+
## Abstract
11+
12+
For over ten years, ModelSEED has been a primary resource for the
13+
construction of draft genome-scale metabolic models based on annotated
14+
microbial or plant genomes. Now being released, the biochemistry
15+
database serves as the foundation of biochemical data underlying
16+
ModelSEED and KBase. The biochemistry database embodies several
17+
properties that, taken together, distinguish it from other published
18+
biochemistry resources by: (i) including compartmentalization,
19+
transport reactions, charged molecules and proton balancing on
20+
reactions;; (ii) being extensible by the user community, with all data
21+
stored in GitHub; and (iii) design as a biochemical “Rosetta Stone” to
22+
facilitate comparison and integration of annotations from many
23+
different tools and databases. The database was constructed by
24+
combining chemical data from many resources, applying standard
25+
transformations, identifying redundancies, and computing thermodynamic
26+
properties. The ModelSEED biochemistry is continually tested using
27+
flux balance analysis to ensure the biochemical network is
28+
modeling-ready and capable of simulating diverse
29+
phenotypes. Ontologies can be designed to aid in comparing and
30+
reconciling metabolic reconstructions that differ in how they
31+
represent various metabolic pathways. ModelSEED now includes 33,978
32+
compounds and 36,645 reactions, available as a set of extensible files
33+
on GitHub, and available to search at https://modelseed.org and KBase.
34+
35+
## Related Links
36+
37+
* <a href="https://modelseed.org">ModelSEED</a>
38+
* <a href="https://github.com/ModelSEED/ModelSEEDDatabase">(this) Git repository</a>
39+
* <a href="https://kbase.us">KBase</a>
40+
41+
## Documentation
42+
43+
We provide some documentation for the database and everything required
44+
in this repository to build and maintain it in these directories:
45+
46+
### [Biochemistry](Biochemistry/README.md)
47+
48+
This holds the main database files.
49+
50+
### [Scripts](Scripts/README.md)
51+
52+
This holds the scripts that we use to build and maintain the database.
53+
54+
### [Solr](Solr/README.md)
55+
56+
This holds the Solr schema and examples on how to access the
57+
biochemistry at various endpoints.

Scripts/README.md

+62
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,62 @@
1+
# ModelSEED Biochemistry Database Script Environment
2+
3+
We've developed a small python library for loading and manipulating
4+
the biochemistry database according to our needs and we written a set
5+
of scripts that uses the library for our needs.
6+
7+
The main set of scripts that we used for the most recent publication of the ModelSEED Biochemistry Database are:
8+
9+
* [Biochemistry](Biochemistry/README.md)
10+
11+
* [Structures](Structures/README.md)
12+
13+
* [Thermodynamics](Thermodynamics/README.md)
14+
15+
* [Curation](Curation/README.md)
16+
17+
* [Statistics](Statistics/README.md)
18+
19+
In the last case, the scripts were used to generate some of the figures and tables in our latest publication.
20+
21+
### Python Environment
22+
23+
All these scripts run in a Python 3 environment, and some require additional packages which we explain here:
24+
25+
1) Install Conda
26+
https://conda.io/projects/conda/en/latest/user-guide/install/index.html
27+
28+
2) Set up a python 3 conda environment
29+
```
30+
conda create -n msd-env python=3
31+
```
32+
3) Activate
33+
```
34+
conda activate msd-env
35+
```
36+
37+
4) Install rdkit and openbabel for handling biochemical structures
38+
```
39+
conda install -c rdkit rdkit
40+
conda install -c openbabel openbabel
41+
```
42+
43+
5) Install eQuilibrator for retrieving thermodynamics data
44+
```
45+
conda install wxPython
46+
pip install quilt
47+
pip install equilibrator_api
48+
```
49+
_(you might not need to install `wxPython`, I had trouble with dependencies on a mac)_
50+
51+
6) Export path to local python libraries
52+
```
53+
export PYTHONPATH=$PYTHONPATH:<path-to-repository>/ModelSEEDDatabase/Libs/Python/
54+
```
55+
56+
7) Test
57+
```
58+
./Biochemistry/Reprint_Biochemistry.py
59+
git status -s
60+
```
61+
(The script should run without throwing any errors and there should be
62+
no change in the biochemistry data)

0 commit comments

Comments
 (0)