OpenMS
|
PercolatorAdapter facilitates the input to, the call of and output integration of Percolator. Percolator (http://percolator.ms/) is a tool to apply semi-supervised learning for peptide identification from shotgun proteomics datasets.
pot. predecessor tools | → PercolatorAdapter → | pot. successor tools |
---|---|---|
PSMFeatureExtractor | IDFilter |
Percolator is search engine sensitive, i.e. it's input features vary, depending on the search engine. Must be prepared beforehand. If you do not want to use the specific features, use the generic_feature_set flag. Will incorporate the score attribute of a PSM, so be sure, the score you want is set as main score with IDScoreSwitcher . Be aware, that you might very well experience a performance loss compared to the search engine specific features. You can also perform protein inference with percolator when you activate the protein fdr parameter. Additionally you need to set the enzyme setting. We only read the q-value for protein groups since Percolator has a more elaborate FDR estimation. For proteins we add q-value as main score and PEP as metavalue. For PSMs you can choose the main score. Peptide level FDRs cannot be parsed and used yet.
Multithreading: The thread parameter is passed to percolator. Note: By default, a minimum of 3 threads is used (default of percolator) even if the number of threads is set to e.g. 1 for backwards compatibility reasons. You can still force the usage of less than 3 threads by setting the force flag.
The command line parameters of this tool are:
PercolatorAdapter -- Facilitate input to Percolator and reintegrate. Full documentation: http://www.openms.de/doxygen/release/3.2.0/html/TOPP_PercolatorAdapter.html Version: 3.2.0 Sep 18 2024, 16:00:56, Revision: e231942 To cite OpenMS: + Pfeuffer, J., Bielow, C., Wein, S. et al.. OpenMS 3 enables reproducible analysis of large-scale mass spec trometry data. Nat Methods (2024). doi:10.1038/s41592-024-02197-7. Usage: PercolatorAdapter <options> Options (mandatory options marked with '*'): -in <files> Input file(s) (valid formats: 'mzid', 'idXML') -in_decoy <files> Input decoy file(s) in case of separate searches (valid formats: 'mzi d', 'idXML') -in_osw <file> Input file in OSW format (valid formats: 'OSW') -out <file>* Output file (valid formats: 'idXML', 'mzid', 'osw') -out_type <type> Output file type -- default: determined from file extension or conten t. (valid: 'mzid', 'idXML', 'osw') -enzyme <enzyme> Type of enzyme: no_enzyme,elastase,pepsin,proteinasek,thermolysin,chy motrypsin,lys-n,lys-c,arg-c,asp-n,glu-c,trypsin,trypsinp (default: 'trypsin') (valid: 'no_enzyme', 'elastase', 'pepsin', 'proteinasek', 'thermolysin', 'chymotrypsin', 'lys-n', 'lys-c', 'arg-c', 'asp-n', 'glu-c', 'trypsin', 'trypsinp') -percolator_executable <executable>* The Percolator executable. Provide a full or relative path, or make sure it can be found in your PATH environment. -peptide_level_fdrs Calculate peptide-level FDRs instead of PSM-level FDRs. -protein_level_fdrs Use the picked protein-level FDR to infer protein probabilities. Use the -fasta option and -decoy_pattern to set the Fasta file and decoy pattern. -osw_level <osw_level> OSW: the data level selected for scoring. (default: 'ms2') (valid: 'ms1', 'ms2', 'transition') -score_type <type> Type of the peptide main score (default: 'q-value') (valid: 'q-value' , 'pep', 'svm') Common TOPP options: -ini <file> Use the given TOPP INI file -threads <n> Sets the number of threads allowed to be used by the TOPP tool (defau lt: '1') -write_ini <file> Writes the default configuration file --help Shows options --helphelp Shows all options (including advanced)
INI file documentation of this tool:
Percolator is written by Lukas Käll (http://per-colator.com/ Copyright Lukas Käll lukas) .kal l@sci life lab.s e