Add process for converting merged objects to AnnData #613

allyhawkins · 2023-12-08T18:33:14Z

Closes #604
⚠️ Stacked on #610 (Because this is slightly dependent on decisions made in #610, I'm going to wait to request review)

This PR adds a process to convert the merged SCE objects to an AnnData object. When converting to AnnData, I chose to use the same scripts that we use in the main workflow to convert the object and then move the counts. The reason for this is I think it's probably best to be consistent in how we format our objects. In particular, with the AnnData objects, we add the sample metadata as columns in the colData and do a little bit of renaming. I don't think there's any reason we shouldn't be doing that here.

This process doesn't differ much from the conversion process in the main workflow, except here, we know that the only time we will have an altExp is if the has_adt value is true. I use that value to determine if the feature file should be created.

Also, all of the objects are considered processed objects here, so I run move_counts_anndata.py for all HDF5 files. Again, I'm doing this so that how we name things is consistent with the other AnnData objects.

The last idea I had was maybe we also want to add the sample metadata to the colData of the merged objects. That way, things like diagnosis, age, etc., would be easier to work with instead of living in a separate data frame in the metadata. What do we think?

…rged-sce-to-anndata

allyhawkins · 2023-12-14T16:23:46Z

I've gone ahead and tested this and this is ready for review!

jashapiro

LGTM!

sjspielman

LGTM! The only thing I am thinking is I know I saw some comment go by somewhere (github? slack?) recently from @jashapiro about updating the extension we use for hdf5 files... Should we do this? @jashapiro please weigh in!

allyhawkins · 2023-12-14T16:40:55Z

LGTM! The only thing I am thinking is I know I saw some comment go by somewhere (github? slack?) recently from @jashapiro about updating the extension we use for hdf5 files... Should we do this? @jashapiro please weigh in!

I think if we did that then we would need to change all the individual AnnData file extensions and then dev would also have to update their code. They can have .hdf5, .h5ad, or .h5, so I don't think it's worth the headache personally.

jashapiro · 2023-12-14T16:41:50Z

It is in #616, which seems like the place to do it (but AlexsLemonade/scpcaTools#244 has to happen first, as we do checks in scpcaTools)

add process for converting to anndata

8fccf4a

allyhawkins marked this pull request as draft December 8, 2023 18:33

Base automatically changed from allyhawkins/altexp-merged-objects to development December 11, 2023 15:17

allyhawkins and others added 5 commits December 11, 2023 09:17

Merge remote-tracking branch 'origin/development' into allyhawkins/me…

6e04a58

…rged-sce-to-anndata

Merge branch 'development' into allyhawkins/merged-sce-to-anndata

b2cdc19

use correct include_altexp arg

9b745e6

use scpcaTools:edge

ba9ae7b

some missed syntax and merge_groups

b93c5d0

allyhawkins marked this pull request as ready for review December 14, 2023 15:47

allyhawkins requested a review from sjspielman December 14, 2023 16:23

jashapiro approved these changes Dec 14, 2023

View reviewed changes

sjspielman approved these changes Dec 14, 2023

View reviewed changes

allyhawkins merged commit 8a4ac59 into development Dec 14, 2023
3 checks passed

allyhawkins deleted the allyhawkins/merged-sce-to-anndata branch December 14, 2023 16:45

allyhawkins mentioned this pull request Dec 14, 2023

Process for converting merged SCE to merged AnnData #604

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add process for converting merged objects to AnnData #613

Add process for converting merged objects to AnnData #613

allyhawkins commented Dec 8, 2023

allyhawkins commented Dec 14, 2023

jashapiro left a comment

sjspielman left a comment

allyhawkins commented Dec 14, 2023

jashapiro commented Dec 14, 2023

Add process for converting merged objects to AnnData #613

Add process for converting merged objects to AnnData #613

Conversation

allyhawkins commented Dec 8, 2023

allyhawkins commented Dec 14, 2023

jashapiro left a comment

Choose a reason for hiding this comment

sjspielman left a comment

Choose a reason for hiding this comment

allyhawkins commented Dec 14, 2023

jashapiro commented Dec 14, 2023