OpenMS
2.6.0
|
De novo metabolite identification.
CSI:FingerID (Compound Structure Identification: FingerID) is a method for searching a tandem mass spectrum of a small molecule (metabolite) in a database of molecular structures.
To use this feature, the Sirius command line tool as well as a java installation is needed.
Sirius can be found on https://bio.informatik.uni-jena.de/software/sirius/
Please use Sirius Version 4.0.1
If you want to use the software with the Gurobi solver or CPLEX instead of GLPK, please follow the instructions in the sirius manual.
Internal procedure in SiriusAdpater
1. Input mzML (and optional featureXML)
2. Preprocessing (see below)
3. Parsed by SiriusMSConverter into (sirius internal) .ms format
4. Submission of .ms and additional parameters to wrapped SIRIUS.jar
5. Sirius output saved in interal temporary folder structure
6. Sirius output is parsed (SiriusMzTabWriter/CsiFingerIDMzTabWriter)
7. Merge corresponding output in one mzTab (out_sirius/out_fingerid)
Preprocessing (featureXML): By providing a featureXML, the feature information can be used for feature mapping. Sirius will then process the internally merged MS2 spectra allocated to one feature (instead of all available MS2). To reduce the feature space even further a masstrace filter can be set. Additional adduct information can be provided using a featureXML from the MetaboliteAdductDecharger or AccurateMassSearch.
The command line parameters of this tool are:
SiriusAdapter -- Tool for metabolite identification using single and tandem mass spectrometry Full documentation: http://www.openms.de/documentation/UTILS_SiriusAdapter.html Version: 2.6.0 Sep 30 2020, 12:54:34, Revision: c26f752 To cite OpenMS: Rost HL, Sachsenberg T, Aiche S, Bielow C et al.. OpenMS: a flexible open-source software platform for mass spectrometry data analysis. Nat Meth. 2016; 13, 9: 741-748. doi:10.1038/nmeth.3959. To cite SiriusAdapter: Kai Dührkop and Sebastian Böcker. Fragmentation trees reloaded. J Cheminform; 2016. doi:10.1186/s13321-016-0116-8. Kai Dührkop, Huibin Shen, Marvin Meusel, Juho Rousu, and Sebastian Böcker. Searching molecular structure databases with tandem mass spectra using CSI:FingerID. Proceedings of the National Academy of Sciences; 2015. doi:10.1073/pnas.1509788112. Usage: SiriusAdapter <options> Options (mandatory options marked with '*'): -executable <executable> The Sirius executable. Provide a full or relative path , or make sure it can be found in your PATH environmen t. -in <file>* MzML Input file (valid formats: 'mzML') -in_featureinfo <file> FeatureXML input with feature and adduct information (valid formats: 'featureXML') -out_sirius <file> MzTab Output file for SiriusAdapter results (valid formats: 'mzTab') -out_fingerid <file> MzTab output file for CSI:FingerID, if this parameter is given, SIRIUS will search for a molecular structure using CSI:FingerID after determining the sum formula (valid formats: 'mzTab') -out_ms <file> Internal SIRIUS .ms format after OpenMS preprocessing (valid formats: 'ms') -out_workspace_directory <directory> Output directory for SIRIUS workspace Preprocessing: -preprocessing:filter_by_num_masstraces <number> Features have to have at least x MassTraces. To use this parameter feature_only is neccessary (default: '1' min: '1') -preprocessing:precursor_mz_tolerance <value> Tolerance window for precursor selection (Feature sele ction in regard to the precursor) (default: '5.0e-03') -preprocessing:precursor_mz_tolerance_unit <choice> Unit of the precursor_mz_tolerance (default: 'Da' vali d: 'Da', 'ppm') -preprocessing:precursor_rt_tolerance <number> Tolerance window (left and right) for precursor select ion [seconds] (default: '5') -preprocessing:feature_only Uses the feature information from in_featureinfo to reduce the search space to MS2 associated with a featu re. Parameters for SIRIUS and CSI:FingerID: -sirius:profile <choice> Specify the used analysis profile (default: 'qtof' valid: 'qtof', 'orbitrap', 'fticr') -sirius:candidates <number> The number of candidates in the SIRIUS output. (defaul t: '5' min: '1') -sirius:database <choice> Search formulas in given database (default: 'all' vali d: 'all', 'chebi', 'custom', 'kegg', 'bio', 'natural products', 'pubmed', 'hmdb', 'biocyc', 'hsdb', 'knapsa ck', 'biological', 'zinc bio', 'gnps', 'pubchem', 'mes h', 'maconda') -sirius:noise <number> Median intensity of noise peaks (default: '0' min: '0') -sirius:ppm_max <number> Allowed ppm for decomposing masses (default: '10') -sirius:isotope <choice> How to handle isotope pattern data. Use 'score' to use them for ranking or 'filter' if you just want to remove candidates with bad isotope pattern. With 'both ' you can use isotopes for filtering and scoring. Use 'omit' to ignore isotope pattern. (default: 'both' valid: 'score', 'filter', 'both', 'omit') -sirius:elements <text> The allowed elements. Write CHNOPSCl to allow the elem ents C, H, N, O, P, S and Cl. Add numbers in brackets to restrict the maximal allowed occurrence of these elements: CHNOP[5]S[8]Cl[1]. (default: 'CHNOP[5]S[8]Cl [1]') -sirius:compound_timeout <number> Time out in seconds per compound. To disable the timeo ut set the value to 0 (default: '10' min: '0') -sirius:tree_timeout <number> Time out in seconds per fragmentation tree computation . (default: '0' min: '0') -sirius:top_n_hits <number> The number of top hits for each compound written to the CSI:FingerID output (default: '10' min: '1') -sirius:cores <number> The number of cores SIRIUS is allowed to use on the system (default: '1' min: '1') -sirius:auto_charge Use this option if the charge of your compounds is unknown and you do not want to assume [M+H]+ as defaul t. With the auto charge option SIRIUS will not care about charges and allow arbitrary adducts for the prec ursor peak. Common UTIL options: -ini <file> Use the given TOPP INI file -threads <n> Sets the number of threads allowed to be used by the TOPP tool (default: '1') -write_ini <file> Writes the default configuration file --help Shows options --helphelp Shows all options (including advanced)
INI file documentation of this tool: