Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

consistency checks #17

Open
amelie-stein opened this issue Jun 30, 2020 · 6 comments
Open

consistency checks #17

amelie-stein opened this issue Jun 30, 2020 · 6 comments
Assignees
Labels
enhancement New feature or request medium priority

Comments

@amelie-stein
Copy link
Contributor

What consistency checks are currently implemented? And to what extent are they automatically controlled, vs. which need a human to look at, and who does that? Would we spot / be notified if a relaxed structure had >0 REU, or if there suddenly was a large fraction of high-scoring ddGs?

@j0kaso
Copy link
Collaborator

j0kaso commented May 21, 2021

I added 3 sets of tests (https://github.com/KULL-Centre/PRISM/tree/dev_v0.1/software/rosetta_ddG_pipeline/test):

  • create mode checked by unittest as those should provide the exact same files (testing 4 different inputs)
  • mp & sp "scientific" tests: these check if the ddG values can be recalculated and raise an error if corr < 0.85. The values of each run are stored in a comparison file (e.g. for soluble example. For the mp test, those values are also compared to exp ddGs.

The test for the soluble protein (sp) could be expanded.
So far, relaxes or their structures are not tested. Would you @amelie-stein have an idea how or what to test?

@amelie-stein
Copy link
Contributor Author

Hm, the tests I was thinking of (above) are for individual runs, as each protein and structure are different. They were to set conditions in which a warning should be issued that a structure may not be a good starting point for ddG calculations (high energy, clashes). The tests you list are for reproducibility, which is a separate issue. I don't think we need to test plain Rosetta relax for that, as that is done elsewhere.

@j0kaso
Copy link
Collaborator

j0kaso commented May 21, 2021

Ah, got it! Indeed, that something different and I agree, a quality check of a structure would be good to have! Currently, nothing what you mention is implemented

@j0kaso
Copy link
Collaborator

j0kaso commented Mar 10, 2022

Regarding the original question: We do not automatically check for large fraction of high-scoring ddGs as this should be noted by the user and can be due to several reasons. Also I would not know how to set such values.

With respect to the Eres of the relaxed structure, this can be calculated using ../software/scripts/pdb_to_prism - rosetta_energy_to_prism function with the input parameters: relaxed_pdb_file (output from the pipeline), temp_dir (for calculations and information for the prism parser: pdbID, pdbchain, uniprot_id; and outputs: pdb_prism_file
It extracts the energies from the final output pdb of the relaxation (summed up as comments at the bottom of the file).

I noted both things in the README and would therefore close this issue if that is sufficient @amelie-stein.

@j0kaso
Copy link
Collaborator

j0kaso commented Mar 11, 2022

I added now the automatic calculation of the E_res file (a275fe5). I am not sure if we want a checking of large fractions of ddGs but if so, please elaborate about the criteria @amelie-stein

@amelie-stein
Copy link
Contributor Author

amelie-stein commented Mar 11, 2022 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request medium priority
Projects
None yet
Development

No branches or pull requests

3 participants