OpenMS
2.8.0
|
Helper class for calculations on decoy proteins. More...
#include <OpenMS/DATASTRUCTURES/FASTAContainer.h>
Classes | |
struct | Result |
Static Public Member Functions | |
template<typename T > | |
static Result | findDecoyString (FASTAContainer< T > &proteins) |
Heuristic to determine the decoy string given a set of protein names. More... | |
Static Public Attributes | |
static const std::vector< std::string > | affixes = { "decoy", "dec", "reverse", "rev", "reversed", "__id_decoy", "xxx", "shuffled", "shuffle", "pseudo", "random" } |
static const std::string | regexstr_prefix = std::string("^(") + ListUtils::concatenate<std::string>(affixes, "_*|") + "_*)" |
static const std::string | regexstr_suffix = std::string("(_") + ListUtils::concatenate<std::string>(affixes, "*|_") + ")$" |
Private Types | |
using | DecoyStringToAffixCount = std::unordered_map< std::string, std::pair< Size, Size > > |
using | CaseInsensitiveToCaseSensitiveDecoy = std::unordered_map< std::string, std::string > |
Helper class for calculations on decoy proteins.
struct OpenMS::DecoyHelper::Result |
|
private |
|
private |
|
inlinestatic |
Heuristic to determine the decoy string given a set of protein names.
Tested decoy strings are "decoy", "dec", "reverse", "rev", "__id_decoy", "xxx", "shuffled", "shuffle", "pseudo" and "random". Both prefix and suffix is tested and if one of the candidates above is found in at least 40% of all proteins, it is returned as the winner (see DecoyHelper::Result).
References OPENMS_LOG_DEBUG, OPENMS_LOG_ERROR, OPENMS_LOG_WARN, OpenMS::StringUtils::prefix(), DecoyHelper::regexstr_prefix, DecoyHelper::regexstr_suffix, OpenMS::StringUtils::suffix(), and String::toLower().
|
inlinestatic |
|
inlinestatic |
Referenced by DecoyHelper::findDecoyString().
|
inlinestatic |
Referenced by DecoyHelper::findDecoyString().