-
Notifications
You must be signed in to change notification settings - Fork 14
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Squashed 'lib/foldseek/' changes from e00a3dc1..15c0516f
15c0516f Remove --tar-include and --tar-exclude from createdb as they were never used e1394aac Rework createdb to correctly allow for only one directory or (new) tsv input, in additonal to loose files 0c3b7f23 scorecomplex minor update c4a4b6a4 scorecomplex redundant complex alignment re-solve e77b6431 scorecomplex complex alignment redundancy solved d725f0e3 scorecomplex rbh filtering with 0.7 as the margin d785457c change names for tm based rbh filtering 405e64c2 scorecomplex: rbh filtering with query tm score e5c52dda scorecomplex commit for benchmark test 537c6160 scorecomplex commit for benchmark test 4f3bc395 Merge branch 'master' of https://github.com/steineggerlab/foldseek 5b789cfd scorecomplex rollback to DBSCAN 493cefe7 Add mode to compute exact (slow) tmscore. 75a50f7c Fix steineggerlab/foldseek#214 38e5e93f Merge pull request #244 from steineggerlab/test 6b1dd706 fix typo c388d483 confilcts solved ecf85daf final update scorecomplex with nearest neighbors 802235db backtrace related issue detour f629bbe1 Merge commit '8faebba3f96210242892943d37e4fe9e8a5eed8d' 8faebba3 Squashed 'lib/mmseqs/' changes from 22a77eeb..950342d9 dc272d56 complexsearch with DBSCAN commit for benchmark a7fefa22 bitscore margine change a6b1928e DBSCAN update retry 25031967 foldseek DBSCAN with RBH filter and NN rescue ac2b1dcf nearest neighbors update 41c7f9c9 nearest neighbors update e4079c49 new nearest neighbors 2742f469 new nearest neighbors 4d426356 nearest neighbors new 6cb3ea6c revoke nearest neighbors only 093af914 test nearest neighbors only2 1dbaac36 test nearest neighbors only 3bf3cdf5 Fix this in a different way f4a1a527 Fix compile on older compilers d3fca9e8 Update citations for databases 39ade546 Update README.md 87caae8e rbh filter margine improvement 6e4184a1 implement getting neareast neighbors in sorecomplex 6e632c30 eps related update revoke 65550247 scorecomplex learningRate=0 issue solved 095102ff infinite loop bug fix scorecomplex 9ca20244 Merge branch 'master' of https://github.com/steineggerlab/foldseek 096613dc commit for pull 0eff0231 Update scorecomplex.cpp: eps related update rollback f690b9d5 Merge branch 'master' of https://github.com/steineggerlab/foldseek da825d55 rbh filtering with bitscore & clustering eps update 852434a4 Add --input-format to createdb to force an input structure format 00ab450f scorecomplex rbh filtering implement 6893dcc5 Add CATH50 steineggerlab/foldseek#232 bb090174 [expandcomplex] eased e-value for the 2nd alignment e9f76df6 update scorecomplex dbscan error fixed 1cb3a80d scorecomplex alignment clustering algorithm update 5433d6db Update README.md 1bc8d2e5 update to latest MMseqs2-App master c6f4f2a6 fix regression fail 10289c64 scorecomplex DBSCAN impropvement 6816a641 skip single chained complex for scorecomplex 2a187342 complexsearch initial search parameter adjustment 49dabe0d scorecomplex many against many bug fixed real final 1c4fdfa6 scorecomplex many against many bug fix final 9f8a2ef9 scorecomplex many against many bug fixed 3fe1f9e4 Merge branch 'master' of https://github.com/steineggerlab/foldseek c28e7938 fix wrong parameters in easy-complexsearch 035edc18 expandcomplex should now work correctly with both cluster and non-cluster dbs e396ca4d Carry extended dbtype for complexsearch to work with clustered dbs f05703dd Remove std::cout in structurerescorediagonal 7b68363d Merge branch 'master' of https://github.com/steineggerlab/foldseek 886021d2 Fix issue steineggerlab/foldseek#205 592ffa80 fix explanation of complex related tools a6a712c1 fix easycomplexsearch.sh wrong param 799d42ca update EasyComplexSearch; improve expandcomplex stability 258be0fc Update README.md c382b8fa Update README.md d3f4980d Update README.md a417633d Update README.md b156e065 implement complexsearch and sanitizing expandcomplex b220b5a9 Update README.md ec32bee1 update expandcomplex 76ffa031 update expandcomplex 10ba8f53 Fix easy-complexsearch workflow shell scripting issues 75cc763a fix regression failed 206f600a fix regression test failed 9cc02eb7 expandcomplex a695211e expandcomplex dffdf788 Increase buffer size 79f865d6 Add --complex-report-mode to allow disabling report in easy-scorecomplex 38290cf7 Cleanup easy-complexsearch 08f1db5e Expose --db-output for createcomplexreport bdeb0024 Update README.md 629de617 Merge branch 'master' of https://github.com/steineggerlab/foldseek f7857793 Fix sameDB (clusterDB) issue in structurealign 7180ed43 Merge branch 'master' of https://github.com/steineggerlab/foldseek 28a4a7f5 update createcomplexreport with multithreading issue fixed git-subtree-dir: lib/foldseek git-subtree-split: 15c0516fbae0d7e0903ee80f14cb927782b394d0
- Loading branch information
Showing
41 changed files
with
1,439 additions
and
523 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,46 @@ | ||
#!/bin/sh -e | ||
fail() { | ||
echo "Error: $1" | ||
exit 1 | ||
} | ||
|
||
notExists() { | ||
[ ! -f "$1" ] | ||
} | ||
|
||
if notExists "${TMP_PATH}/result.dbtype"; then | ||
# shellcheck disable=SC2086 | ||
"$MMSEQS" search "${QUERYDB}" "${TARGETDB}" "${TMP_PATH}/result" "${TMP_PATH}/search_tmp" ${SEARCH_PAR} \ | ||
|| fail "Search died" | ||
fi | ||
|
||
RESULT="${TMP_PATH}/result" | ||
if [ "$PREFMODE" != "EXHAUSTIVE" ]; then | ||
if notExists "${TMP_PATH}/result_expand_pref.dbtype"; then | ||
# shellcheck disable=SC2086 | ||
"$MMSEQS" expandcomplex "${QUERYDB}" "${TARGETDB}" "${RESULT}" "${TMP_PATH}/result_expand_pref" ${THREADS_PAR} \ | ||
|| fail "Expandcomplex died" | ||
fi | ||
if notExists "${TMP_PATH}/result_expand_aligned.dbtype"; then | ||
# shellcheck disable=SC2086 | ||
"$MMSEQS" $COMPLEX_ALIGNMENT_ALGO "${QUERYDB}" "${TARGETDB}" "${TMP_PATH}/result_expand_pref" "${TMP_PATH}/result_expand_aligned" ${COMPLEX_ALIGN_PAR} \ | ||
|| fail $COMPLEX_ALIGNMENT_ALGO "died" | ||
fi | ||
RESULT="${TMP_PATH}/result_expand_aligned" | ||
fi | ||
if notExists "${TMP_PATH}/complex_result.dbtype"; then | ||
# shellcheck disable=SC2086 | ||
$MMSEQS scorecomplex "${QUERYDB}" "${TARGETDB}" "${RESULT}" "${OUTPUT}" ${SCORECOMPLEX_PAR} \ | ||
|| fail "ScoreComplex died" | ||
fi | ||
|
||
if [ -n "${REMOVE_TMP}" ]; then | ||
# shellcheck disable=SC2086 | ||
"$MMSEQS" rmdb "${TMP_PATH}/result" ${VERBOSITY} | ||
if [ "$PREFMODE" != "EXHAUSTIVE" ]; then | ||
# shellcheck disable=SC2086 | ||
"$MMSEQS" rmdb "${TMP_PATH}/result_expand_aligned" ${VERBOSITY} | ||
fi | ||
rm -rf "${TMP_PATH}/search_tmp" | ||
rm -f "${TMP_PATH}/complexsearch.sh" | ||
fi |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Binary file not shown.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -39,9 +39,9 @@ std::vector<Command> baseCommands = { | |
"Slower, sensitive clustering", | ||
"mmseqs easy-cluster examples/DB.fasta result tmp\n" | ||
"# Cluster output\n" | ||
"# - result_rep_seq.fasta: Representatives\n" | ||
"# - result_all_seq.fasta: FASTA-like per cluster\n" | ||
"# - result_cluster.tsv: Adjacency list\n\n" | ||
"# - result_rep_seq.fasta: Representatives\n" | ||
"# - result_all_seqs.fasta: FASTA-like per cluster\n" | ||
"# - result_cluster.tsv: Adjacency list\n\n" | ||
"# Important parameter: --min-seq-id, --cov-mode and -c \n" | ||
"# --cov-mode \n" | ||
"# 0 1 2\n" | ||
|
@@ -62,9 +62,9 @@ std::vector<Command> baseCommands = { | |
"Fast linear time cluster, less sensitive clustering", | ||
"mmseqs easy-linclust examples/DB.fasta result tmp\n\n" | ||
"# Linclust output\n" | ||
"# - result_rep_seq.fasta: Representatives\n" | ||
"# - result_all_seq.fasta: FASTA-like per cluster\n" | ||
"# - result_cluster.tsv: Adjecency list\n\n" | ||
"# - result_rep_seq.fasta: Representatives\n" | ||
"# - result_all_seqs.fasta: FASTA-like per cluster\n" | ||
"# - result_cluster.tsv: Adjecency list\n\n" | ||
"# Important parameter: --min-seq-id, --cov-mode and -c \n" | ||
"# --cov-mode \n" | ||
"# 0 1 2\n" | ||
|
@@ -130,14 +130,21 @@ std::vector<Command> baseCommands = { | |
"<i:fastaFile1[.gz|.bz2]> ... <i:fastaFileN[.gz|.bz2]>|<i:stdin> <o:sequenceDB>", | ||
CITATION_MMSEQS2, {{"fast[a|q]File[.gz|bz2]|stdin", DbType::ACCESS_MODE_INPUT, DbType::NEED_DATA | DbType::VARIADIC, &DbValidator::flatfileStdinAndGeneric }, | ||
{"sequenceDB", DbType::ACCESS_MODE_OUTPUT, DbType::NEED_DATA, &DbValidator::flatfile }}}, | ||
{"makepaddedseqdb", makepaddedseqdb, &par.onlyverbosity, COMMAND_HIDDEN, | ||
"Generate a padded sequence DB", | ||
"Generate a padded sequence DB", | ||
"Martin Steinegger <[email protected]>", | ||
"<i:sequenceDB> <o:sequenceDB>", | ||
CITATION_MMSEQS2, {{"sequenceDB", DbType::ACCESS_MODE_INPUT, DbType::NEED_DATA|DbType::NEED_HEADER, &DbValidator::sequenceDb }, | ||
{"sequenceIndexDB", DbType::ACCESS_MODE_OUTPUT, DbType::NEED_DATA, &DbValidator::sequenceDb }}}, | ||
{"appenddbtoindex", appenddbtoindex, &par.appenddbtoindex, COMMAND_HIDDEN, | ||
NULL, | ||
NULL, | ||
"Milot Mirdita <[email protected]>", | ||
"<i:DB1> ... <i:DBN> <o:DB>", | ||
CITATION_MMSEQS2, {{"DB", DbType::ACCESS_MODE_INPUT, DbType::NEED_DATA | DbType::VARIADIC, &DbValidator::allDb }, | ||
{"DB", DbType::ACCESS_MODE_INPUT, DbType::NEED_DATA, &DbValidator::allDb }}}, | ||
{"indexdb", indexdb, &par.indexdb, COMMAND_HIDDEN, | ||
{"indexdb", indexdb, &par.indexdb, COMMAND_HIDDEN, | ||
NULL, | ||
NULL, | ||
"Martin Steinegger <[email protected]>", | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.