ProteomicsLFQ performs label-free quantification of peptides and proteins.
Input:
- Spectra in mzML format
- Identifications in idXML or mzIdentML format with posterior error probabilities as score type. To generate those we suggest to run: 1. PeptideIndexer to annotate target and decoy information. 2. PSMFeatureExtractor to annotate percolator features. 3. PercolatorAdapter tool (score_type = 'q-value', -post-processing-tdc) 4. IDFilter (pep:score = 0.01) to filter PSMs at 1% FDR
- An experimental design file:
(see ExperimentalDesign for details)
- A protein database in with appended decoy sequences in FASTA format
(e.g., generated by the OpenMS DecoyDatabase tool)
Processing:
ProteomicsLFQ has different methods to extract features: ID-based (targeted only), or both ID-based and untargeted. 1. The first method uses targeted feature dectection using RT and m/z information derived from identification data to extract features. Note: only identifications found in a particular MS run are used to extract features in the same run. No transfer of IDs (match between runs) is performed. 2. The second method adds untargeted feature detection to obtain quantities from unidentified features. Transfer of Ids (match between runs) is performed by transfering feature identifications to coeluting, unidentified features with similar mass and RT in other runs.
Requantification: 2. Optionally, a requantification step is performed that tries to fill NA values. If a peptide has been quantified in more than half of all maps, the peptide is selected for requantification. In that case, the mean observed RT (and theoretical m/z) of the peptide is used to perform a second round of targeted extraction. Output:
- mzTab file with analysis results
- MSstats file with analysis results for statistical downstream analysis in MSstats
- ConsensusXML file for visualization and further processing in OpenMS
experiments TODO: