Account for mismatches between old cell typing results and current processed object and handle missing UMAPs #596

allyhawkins · 2023-11-27T20:03:11Z

Closes #591

Based on discussion in #591, we decided to handle any potential mismatches in barcodes between the processed object and the existing CellAssign or SingleR results by labeling any cells not present in the results as Unclassified cells. This PR makes that adjustment and fixes some other smaller errors I discovered during test runs.

I accounted for potential fails in density() following the suggestion in Figure out what to do when the numbers of cells don't match up between CellAssign results and processed SCE object #591 (comment).
All of the missing cells have been annotated as Unclassified cells in the processed SCE object. This avoids any cells being labeled as NA accidentally. This happens as part of add_celltypes_to_sce.R.
In the QC report, I check for any missing cells and output a warning that some cells may be missing from the cell type results. I also removed any Unclassified cells before continuing with plotting.
I also had some issues where UMAP was not being calculated, causing a failure when rendering the cell type section of the report. The UMAP issues have been resolved by updating scpcaTools (Update renv & python scpcaTools#242). However, I realized we probably want to account for potentially having no UMAP results and still having cell type results. We already have handling for missing UMAPs in the main report, but we don't have that for the cell type report, so I added that here. This involved updating the function for creating celltype_df to account for potentially missing UMAP results and then using has_umap throughout the report to ensure no UMAPs are printed if UMAP is missing.

Here's a copy of a rendered main and supplemental report with these changes:
SCPCL000495_qc.html.zip

SCPCL000495_celltype-report.html.zip

jashapiro

I think this all looks good. I had a number of small suggestions, but I don't think anything that should require another look.

bin/add_celltypes_to_sce.R

templates/qc_report/celltypes_qc.rmd

Co-authored-by: Joshua Shapiro <[email protected]>

allyhawkins and others added 5 commits November 22, 2023 16:01

account for NA's when creating density plot

4d3318f

add unclassified cells to objects

f98dc6a

remove unclassified cells and print out warning

69514c8

account for no missing barcodes

445eab3

account for missing umaps

3b893ca

allyhawkins requested a review from jashapiro November 27, 2023 20:03

jashapiro approved these changes Nov 27, 2023

View reviewed changes

Apply suggestions from code review

7415240

Co-authored-by: Joshua Shapiro <[email protected]>

allyhawkins merged commit 01a45ca into development Nov 27, 2023
3 checks passed

allyhawkins deleted the allyhawkins/celltype-NAs branch November 27, 2023 22:06

This was referenced Nov 28, 2023

Update versions to 0.7.0 in docs #598

Merged

Figure out what to do when the numbers of cells don't match up between CellAssign results and processed SCE object #591

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Account for mismatches between old cell typing results and current processed object and handle missing UMAPs #596

Account for mismatches between old cell typing results and current processed object and handle missing UMAPs #596

allyhawkins commented Nov 27, 2023

jashapiro left a comment

Account for mismatches between old cell typing results and current processed object and handle missing UMAPs #596

Account for mismatches between old cell typing results and current processed object and handle missing UMAPs #596

Conversation

allyhawkins commented Nov 27, 2023

jashapiro left a comment

Choose a reason for hiding this comment