Retroelement Protein Expression Atlas

Project Information

This web-portal hosts LINE-1 protein identfications from re-analysed PRIDE datasets. Several visualisations and exploratory options are available to allow users to browse this information.

Website Endpoints

Endpoint	Description
/home	Homepage containing Interactive Human Protein Exp Data
/browse	Interactive Table containing analysed data by RetroMiner
/visualise	ProtVista Protein Viewer
/ideogram	Expression data for analysed datasets
/loci_ideogram	All Loci possible points of interest
/sequence	Displays sequence data for selected protein
/download	Download data from the database
/api	Use the built in API resolver to retrieve data
/info	Information on what powers the server
/contact	Members involved in creating the website / getting the data / knowledge

Breakdown of sections

Homepage

The homepage contains a SVG interactive human image that allows a user to explore the summary metrics of the Retroelement Protein Expression Atlas for the selected tissue / organ.

Hovering over the tissues allows for the graph to change to represent the summary metrics, clicking on an organ allows the graph to pause the data on the selected tissue.

The page also contains an explanation of RTPEA & RetroMiner, the tool which was used to generate the data.

On initial load, the API server and MongoDB server are checked to see if they are online.

Browse

This page contains a super table which displays all the data that has currently been analysed by RetroMiner.

The Table displays data by PXD (by default) which is a dataset identifier set by PRIDE (ProteomeExchange consortium)

Within each section (PXD) The Sample Number, Tissue, as well as the ORF1/2p Identifications.

Clicking on the Sample Number or anywhere on the table is another table which displays the variants as well as the confidence level and a link to the sequence. This sub-table for the samples displays the variants that may occur.

Filtering & Sorting is available on the table to sort by the PXD, Study, No. of Samples & Disease. The table can be sorted by any column header simply by clicking on it.

Sorting is also available on the sub-table which allows a user to find expression proteins.

The confidence levels are given as a percentage as well as colour coded.

Green = High Confidence
Orange = Med Confidence
Red = Low COnfidence

The NUMBER represents the confidence score calculated by PeptideShaker. This number ranges between 0 to 100 (highest) and indicates how confident we are that this protein is expressed in the given sample. The rate of false positive results is less than 1 in 1000 (FDR < 0.01).

We have given different variants of the same family a different name by simply adding an underscore followed by a number. (e.g: ORF1p_HS_10). The last number has no biological meaning.

There could be protein results by three LINE-1 protein families: HS, PA2 and PA3. And 3 different proteins by HERVs (the gag, pol and env groups)

Visualise

This section contains 3 sub-sections to visualise the data

Protein Centric

To visualise the data, ProtVista was used and heavily altered to produce results in a way that would convey the data appropriately.

By Selecting a Tissue type, State and Minimum Score, Variant expression data is generated along with the position, Description, Family, Tissue Type, Disease State and Confidence. If a dataset is Diseased, then an additional section displays the type of disease: Disease State e.g. METASTIC MELANOMA for Diseased Blood Serum.

An option exists to map the variant onto a chromosome which is related to the next section.

Chromosome Centric

The Chromosome Centric Mapper allows a user to view the highest expression variants on the human chromosome.

Clicking on the chromosome allows for a larger zoomed in version. The higher the bar, the higher expressing it is.

A user can sort by familes HS, or PA2, or can search specifically for a variant e.g HS_94.

To find the SEQUENCE, hover over the annotation and click the link.

All Loci

The All Loci Mapper allows a user to view all POSSIBLE expressing variants on the human chromosome. Clicking on the chromosome allows for a larger zoomed in version. This section operates the same was as in the Chromosome Centric Mapper.

Download / API

This section contains the option for downloading various bits of data that is stored in the database.

An API section also exists to allow a user to execute PXD numbers without the need for a commandline or programming tool.

Author Information

Created by Nayam Chowdhury (as part of an MSc project)

Guidance / Help:

Nazrath Nawaz (PhD)
Conrad Bessant (Professor)

Website Link: https://rtpea.com/

This project was made possible and bootstrapped with Create React App.

Development Ongoing	Development Version	Production Version
✅	v1.4	v1.4

Name		Name	Last commit message	Last commit date
Latest commit History 314 Commits
public		public
src		src
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
Fix_Json_ORF.py		Fix_Json_ORF.py
README - Copy.md		README - Copy.md
README.md		README.md
convert_results_to_json_working.R		convert_results_to_json_working.R
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Retroelement Protein Expression Atlas

Project Information

Website Endpoints

Breakdown of sections

Homepage

Browse

Visualise

Protein Centric

Chromosome Centric

All Loci

Download / API

Author Information

About

Releases 2

Packages

Contributors 2

Languages

NCHlab/RTPEA

Folders and files

Latest commit

History

Repository files navigation

Retroelement Protein Expression Atlas

Project Information

Website Endpoints

Breakdown of sections

Homepage

Browse

Visualise

Protein Centric

Chromosome Centric

All Loci

Download / API

Author Information

About

Resources

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 2

Languages

Packages