All data (including example data) producing gene count matrix with 0s #122

IzzyJRussell · 2019-07-10T13:50:06Z

Hi, we have been implementing scPipe into a scRNAseq analysis and have been unable to produce a gene count matrix with any expressed genes. Therefore we have run the example data (copying the code directly from the tutorial in the documentation) to check for issues with scPipe instead of the data input.

We found that when running the example code exactly the gene_count.csv produces a table full of 0s implying that there is no expression from the detected genes. Is this an intended result for the example data? If this was not intended what would be causing this issue?

IzzyJRussell · 2019-07-10T15:04:47Z

I believe it may be linked to this issue from 2018?

Shians · 2019-07-12T06:13:44Z

Hello Izzy,

Thanks for your issue report. The example isn't meant to produce a table full of 0s. It's unclear what kind of bug is causing this and I am looking into it.

Kind regards,
Shian

Shians · 2019-07-29T02:24:40Z

Sorry @IzzyJRussell for the delayed response, I've been travelling for the past couple of weeks. I've narrowed the issue down to the fact that sc_demultiplex(file.path(data_dir, "out.map.bam"), data_dir, barcode_annotation_fn,has_UMI=FALSE) has has_UMI=FALSE set.

@LuyiTian will have to confirm. But I think setting has_UMI=FALSE uses the read names as UMI and causes each UMI to only have count 1 which is then filtered out by UMI_correct1. Setting has_UMI=TRUE produces counts for the example, but I'm not sure why it was set to FALSE in the first place.

IzzyJRussell · 2019-07-30T12:49:49Z

Thanks for getting back to me, we determined that in this instance using the Rsubread featurecounting function would be more suitable for our reanalysis - I wish you luck resolving the issue.

jdumdie · 2019-09-19T17:43:56Z

In the tutorial, it's recommended for full-length protocols like SMART-seq, to set has_UMI=FALSE in the sc_demultiplex function. What do you recommend for SMART-seq protocols that have used UMIs? Is there a work-around to get the read counts out? I am just getting a matrix of zeros, as above.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

All data (including example data) producing gene count matrix with 0s #122

All data (including example data) producing gene count matrix with 0s #122

IzzyJRussell commented Jul 10, 2019

IzzyJRussell commented Jul 10, 2019

Shians commented Jul 12, 2019

Shians commented Jul 29, 2019

IzzyJRussell commented Jul 30, 2019

jdumdie commented Sep 19, 2019

All data (including example data) producing gene count matrix with 0s #122

All data (including example data) producing gene count matrix with 0s #122

Comments

IzzyJRussell commented Jul 10, 2019

IzzyJRussell commented Jul 10, 2019

Shians commented Jul 12, 2019

Shians commented Jul 29, 2019

IzzyJRussell commented Jul 30, 2019

jdumdie commented Sep 19, 2019