This script retrieves gene name of BRCA1, its sequence, isoelectric point, 20 interacting proteins with quality score higher than 900 and PDB IDs of proteins with more than 10 structures available. It also provides the user with a column chart of amino acid composition and frequency, a network graph of its interaction partners and a bar chart of proteins with more than 10 structures available.
- User must input the protein of interest "Q3B891" when prompted. This initiates the pipeline to retrieve the information.
- The pipeline requires Perl 5.24.3 or later and Bio Perl installed.
- It also requires libraries Vis, LWP and the file LWP.pm (provided within this repository). Make sure these are available from the running directory.
- Program uses necessary modules from Bio Perl such as "pICalculator". Therefore,installing these modules is essential.
- Script can be run from Command line, Perl command line interpreter or on Linux with Perl version 5.24.3 or later installed including all related modules of Bio Perl.
- The HTML files generated from this running this script can be opened using any browser.
The user can retrieve information such as gene name, sequence, isoelectric point, 20 interacting proteins with quality score higher than 900 and PDB IDs of proteins with more than 10 structures available on the terminal as standard output. It also produces three HTML scripts to visualise the amino acid composition and frequency in column chart, a network graph of BRCA1 and its interacting proteins and a bar chart of proteins with more than 10 structures available. These HTML scripts are combined together (Combined_File.html) and available within the repository. This HTML file must be downloaded in order to open using browsers.