Home  · Classes  · Annotated Classes  · Modules  · Members  · Namespaces  · Related Pages
EnzymaticDigestion.h
Go to the documentation of this file.
1 // --------------------------------------------------------------------------
2 // OpenMS -- Open-Source Mass Spectrometry
3 // --------------------------------------------------------------------------
4 // Copyright The OpenMS Team -- Eberhard Karls University Tuebingen,
5 // ETH Zurich, and Freie Universitaet Berlin 2002-2017.
6 //
7 // This software is released under a three-clause BSD license:
8 // * Redistributions of source code must retain the above copyright
9 // notice, this list of conditions and the following disclaimer.
10 // * Redistributions in binary form must reproduce the above copyright
11 // notice, this list of conditions and the following disclaimer in the
12 // documentation and/or other materials provided with the distribution.
13 // * Neither the name of any author or any participating institution
14 // may be used to endorse or promote products derived from this software
15 // without specific prior written permission.
16 // For a full list of authors, refer to the file AUTHORS.
17 // --------------------------------------------------------------------------
18 // THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS"
19 // AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
20 // IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE
21 // ARE DISCLAIMED. IN NO EVENT SHALL ANY OF THE AUTHORS OR THE CONTRIBUTING
22 // INSTITUTIONS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL,
23 // EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO,
24 // PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS;
25 // OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY,
26 // WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR
27 // OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF
28 // ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
29 //
30 // --------------------------------------------------------------------------
31 // $Maintainer: Chris Bielow, Xiao Liang $
32 // $Authors: Marc Sturm, Chris Bielow $
33 // --------------------------------------------------------------------------
34 
35 #ifndef OPENMS_CHEMISTRY_ENZYMATICDIGESTION_H
36 #define OPENMS_CHEMISTRY_ENZYMATICDIGESTION_H
37 
38 #include <OpenMS/CONCEPT/Types.h>
41 
42 #include <string>
43 #include <vector>
44 
45 namespace OpenMS
46 {
61  class OPENMS_DLLAPI EnzymaticDigestion
62  {
63 public:
66  {
67  SPEC_FULL, //< fully enzyme specific, e.g., tryptic (ends with KR, AA-before is KR), or peptide is at protein terminal ends
68  SPEC_SEMI, //< semi specific, i.e., one of the two cleavage sites must fulfill requirements
69  SPEC_NONE, //< no requirements on start / end
70  SIZE_OF_SPECIFICITY
71  };
73  static const std::string NamesOfSpecificity[SIZE_OF_SPECIFICITY];
74 
76  static const std::string UnspecificCleavage;
77 
80 
83 
85  EnzymaticDigestion& operator=(const EnzymaticDigestion& rhs);
86 
88  Size getMissedCleavages() const;
89 
91  void setMissedCleavages(Size missed_cleavages);
92 
94  String getEnzymeName() const;
95 
97  void setEnzyme(const String name);
98 
100  Specificity getSpecificity() const;
101 
103  void setSpecificity(Specificity spec);
104 
107  static Specificity getSpecificityByName(const String& name);
108 
110  void digest(const AASequence& protein, std::vector<AASequence>& output) const;
111 
113  void digestUnmodifiedString(const StringView sequence, std::vector<StringView>& output, Size min_length = 1, Size max_length = 0) const;
114 
116  Size peptideCount(const AASequence& protein);
117 
119  bool isValidProduct(const AASequence& protein, Size pep_pos, Size pep_length, bool methionine_cleavage = false, bool ignore_missed_cleavages = true) const;
120 
123  bool isValidProduct(const String& protein, Size pep_pos, Size pep_length, bool methionine_cleavage = false, bool ignore_missed_cleavages = true) const;
124 
125 protected:
127  std::vector<Size> tokenize_(const String& protein) const;
128 
135  inline Size countMissedCleavages_(const std::vector<Size>& cleavage_positions, Size pep_start, Size pep_end) const;
136 
139 
142 
145  };
146 
147 } // namespace OpenMS
148 
149 #endif // OPENMS_CHEMISTRY_ENZYMATICDIGESTION_H
150 
A more convenient string class.
Definition: String.h:57
Definition: EnzymaticDigestion.h:69
Class for the enzymatic digestion of proteins.
Definition: EnzymaticDigestion.h:61
Representation of a peptide/protein sequence.
Definition: AASequence.h:108
Main OpenMS namespace.
Definition: FeatureDeconvolution.h:47
Specificity
when querying for valid digestion products, this determines if the specificity of the two peptide end...
Definition: EnzymaticDigestion.h:65
Definition: EnzymaticDigestion.h:67
Definition: EnzymaticDigestion.h:68
Enzyme enzyme_
Used enzyme.
Definition: EnzymaticDigestion.h:141
Specificity specificity_
specificity of enzyme
Definition: EnzymaticDigestion.h:144
Size missed_cleavages_
Number of missed cleavages.
Definition: EnzymaticDigestion.h:138
size_t Size
Size type e.g. used as variable which can hold result of size()
Definition: Types.h:128
static const std::string UnspecificCleavage
Name for unspecific cleavage.
Definition: EnzymaticDigestion.h:76
Representation of an enzyme.
Definition: Enzyme.h:56
StringView provides a non-owning view on an existing string.
Definition: String.h:480

OpenMS / TOPP release 2.3.0 Documentation generated on Tue Jan 9 2018 18:22:00 using doxygen 1.8.13