Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Hi ProteinGym team!
We would like to contribute to the zero-shot substitution benchmark with our model ESCOTT, presented in PRESCOTT: a population aware, epistatic and structural model accurately predicts missense effect, an updated version of GEMME that introduces structural information in the model.
ESCOTT achieves a weighted average Spearman of 0.473 (computed with
scripts/scoring_DMS_zero_shot/performance_subsitutions.sh
) and a simple average of 0.497. Results for the single assays can be found on Zenodo.Due to the many dependencies inherited by GEMME and the additional required installation of DSSP program, we provide a dedicated Docker image with the properly configured environment. It can be pulled from Docker Hub with
To reproduce the results, run
scripts/scoring_DMS_zero_shot/scoring_ESCOTT_substitutions.sh
.The script executes
proteingym/baselines/ESCOTT/compute_fitness.py
inside the Docker container, with the mounted data volumes. To run the script withoutroot
permissions, the user has to be added todocker
group.Please let us know if anything is missing, and if the provided solution with Docker matches your requirements.