GitHub - hivdb/sgsdb: Scripts to generate SGS database

Steps for updating the SGS database

Install dependencies with pipenv install. Python 3.9 is required.
Update data/SGS.sequences.fact.csv spreadsheet with latest data. Follow Steps for adding studies for creating intermediate files.
Run command make fasta; wait until the command finished.
Run command make sierra; wait until the command finished.
Run command make build stat; wait until the command finished.
Run command git add data/upload, commit and push.

Steps for adding studies

pipenv run python scripts/add_study.py multiple <PMID1> <PMID2> <PMID3> ...

# or

pipenv run python scripts/add_study.py single <PMID> [ACCESSION1, ACCESSION2, ...]

Alternative way:

Find Genbank IDs for this study.
In Mangabey data entry program, check "No reference" checkbox in reference entry page then "Continue".
Click "Nucleotide Sequences".
Click "Add Genbank Sequence(s)" then type the Genbank IDs. Wait until all sequences loaded.
"Download" the output file.
Merge the downloaded TSV file with data/SGS.sequences.fact.csv. Remember to delete all non-pol sequences.

Fields

MedlineID
Accession
CollectionDate: Format YYYY-MM-DD
Source: Plasma, PBMC, etc
PtIdentifier: Format '{InternalRefID}-{SourcePtID}'
CometSubtype: Use COMET to determine subtype; can be empty
Rx: ART or None
DateAdded: Format YYYY-MM-DD
_Include: Always False (only used in research)
_Reservoir: Is this sample collected from virus reservoir?

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
data		data
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Steps for updating the SGS database

Steps for adding studies

Fields

About

Releases

Packages

Languages

License

hivdb/sgsdb

Folders and files

Latest commit

History

Repository files navigation

Steps for updating the SGS database

Steps for adding studies

Fields

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages