-
Notifications
You must be signed in to change notification settings - Fork 46
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changing "report_psms" to a super high value (eg >> 500) seems necessary to find peptides of very high rank (eg < 5) #143
Comments
So with the above in mind, I'm not concerned with interactions between report_psms and spectrum_q - they are behaving as they should. However, I did verify the missing the peptide you mentioned, and that is strange behavior. I'll check it out and get back to you. |
ok, cool -- I'm looking forward to hearing your thoughts regarding the alleged missing peptide etc. (One "ugly" workaround I'm thinking about, if a fix is time consuming etc, is to output an arbitrarily super high number of matches per spectra, eg, 5000, and then later filter out anything beyond say the top 50, since that "seems" to find the missing peptide, but a) I have no idea whether even 5000 is sufficiently large and b) I'm unsure if there are other hidden issues that this 'missing' peptide may be shedding light on, even if those issues are ultimately minor etc....) |
ok, silly follow-up question: how do we set database.min_ion_index to 0? I couldn't find that specific setting in the documented config.json description (or in the automatically outputted configuration json file etc). (Also, I'm assuming that if I set it to 0, I could revert the |
See below for an example. With {
"version": "0.14.7",
"database": {
"bucket_size": 8192,
"enzyme": {
"missed_cleavages": 2,
"min_len": null,
"max_len": null,
"cleave_at": "KR",
"restrict": null,
"c_terminal": null,
"semi_enzymatic": null
},
"fragment_min_mz": 150.0,
"fragment_max_mz": 1500.0,
"peptide_min_mass": 500.0,
"peptide_max_mass": 5000.0,
"ion_kinds": [
"b",
"y"
],
"min_ion_index": 0,
},
} |
d1280a7 removes the |
Sage 0.14.7 has been working great for me so far. However, I found a few circumstances when I couldn't explain why it didn't at least list a possible peptide within the top 500 or so. For example, in the attached tar file, I have provided a small mgf (of one entry) along with the config.json file, fasta file, and resulst.sage.tsv file. When I ran the sage0.14.7 search, I could not find the following entry of
(which should have matched at least 5 MS2 m/z fragments), even though the search output listed peptides with fewer than 5 fragments matching it. However, when I change the
report_psms
parameter in the config.json file to 5000, as in:sage0.14.7 finds (correctly) this peptide as ranks it as number 3 with 7 fragments matched. (I've noticed this for many other scenarios as well, which is why I've arbitrarily set the "report_psms" to a high number like 500, even though I only really care about the first 10 to 50 possible matches). Thoughts?
(Also, this may or may not be related, but I've noticed that (a) changing the "report_psms" parameter impacts the spectrum_q score, but I can't figure out why that should be the case since I would have thought that "report_psms" is simply a parameter to determine what gets reported out but it should not impact internal calculations; also, (b) I've noticed that setting "precursor_tol" to much much higher values than I technically need (e.g.,:
(instead of +- 1.5Da), seems to substantially impact the spectrum_q scores. (In particular, when I experimented, it seemed like a value of +- 96Da would not only decrease the spectrum_q values substantially, but seemed to rank the possible peptides a bit more accurately. Is that the 'expected' behaviour and if so, could you help me/us understand why this is? (My guess is that it has to do with having sufficient false matches to model the spectrum_q score accurately). (And super minor note: but when I run sage -V, I think it prints out the wrong version of 0.14.6 instead of 0.14.7)
Thanks in advance!
proofOfMissingPeptide.tar.gz
The text was updated successfully, but these errors were encountered: