![]() |
OpenMS
|
The DatabaseFilter tool filters a protein database in fasta format according to one or multiple filtering criteria.
The resulting database is written as output. Depending on the reporting method (method="whitelist" or "blacklist") only entries are retained that passed all filters ("whitelist) or failed at least one filter ("blacklist").
Implemented filter criteria:
accession: Filter database according to the set of protein accessions contained in an identification file (idXML, mzIdentML)
The command line parameters of this tool are:
DatabaseFilter -- Filters a protein database (FASTA format) based on identified proteins
Full documentation: http://www.openms.de/doxygen/release/3.4.1/html/TOPP_DatabaseFilter.html
Version: 3.4.1 May 19 2025, 14:24:34, Revision: 8aec8ec
To cite OpenMS:
+ Pfeuffer, J., Bielow, C., Wein, S. et al.. OpenMS 3 enables reproducible analysis of large-scale mass spec
trometry data. Nat Methods (2024). doi:10.1038/s41592-024-02197-7.
Usage:
DatabaseFilter <options>
Options (mandatory options marked with '*'):
-in <file>* Input FASTA file, containing a protein database. (valid formats: 'fasta')
-id <file>* Input file containing identified peptides and proteins. (valid formats: 'idXML', 'mzid')
-method <choice> Switch between white-/blacklisting of protein IDs (default: 'whitelist') (valid: 'whitel
ist', 'blacklist')
-out <file>* Output FASTA file where the reduced database will be written to. (valid formats: 'fasta'
)
Common TOPP options:
-ini <file> Use the given TOPP INI file
-threads <n> Sets the number of threads allowed to be used by the TOPP tool (default: '1')
-write_ini <file> Writes the default configuration file
--help Shows options
--helphelp Shows all options (including advanced)
INI file documentation of this tool: