Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TCGA subtype problem #18

Open
eacho111 opened this issue Nov 9, 2023 · 2 comments
Open

TCGA subtype problem #18

eacho111 opened this issue Nov 9, 2023 · 2 comments

Comments

@eacho111
Copy link

eacho111 commented Nov 9, 2023

Hi,
it is very useful package! I download fpkm gene expression data from TCGA by TCGAbiolinks package, and i try to get the immune subtypes of the LUAD samples by using this package.when i compare this subtypes result with the subtypes of this paper(five_signature_mclust_ensemble_results.tsv.gz),their subtypes are different.
But when i use the gene expression of paper(ebppSubset.tsv.bz2) to classify these patients,the subtype result can match with the paper result(five_signature_mclust_ensemble_results.tsv.gz).So i want to know if there is any problem to classify immune subtype by using fpkm gene expression data from TCGA by TCGAbiolinks.if i need to do any thing with the gene expression download from TCGA?

Thanks!
here are some codes and results!
code and result .pptx

@eacho111 eacho111 changed the title TCGA subtype issun TCGA subtype problem Nov 9, 2023
@Gibbsdavidl
Copy link
Collaborator

Hey there!

Sorry I was traveling.

So I'm not sure, but something I've seen happen is that data is occasionally already normalized; where each gene is normalized across samples. That completely breaks the classification. It's important to make sure the data is unnormalized across samples. Otherwise, fpkm is fine as all other summarizations (tpm, rpkm, etc.)

Hopefully that helps!
-dave

@KaWingLee9
Copy link

@Gibbsdavidl Hi. As you mentioned, FPKM is suitable for the algorithm. So how to define the length of the genes? Exon lengths or maximum transcript length?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants