Allow for returning a dataframe from gget.mutate, w/ more context #169

austinv11 · 2024-12-11T18:18:36Z

Retained two additional columns for the dataframe: mutation_start and mutation_end. Which avoids the need to manually re-parse mutation strings if needed later on.

Additionally, all the extra info in the dataframe is lost in the python API since there are operations done on the dataframe that are copy-on-write which means that inputted dataframes with the update_df flag set to True only gain two potential columns: mutation_type and wt_sequence_full, whereas I would like all the additional context in my dataframe.

To combat this, I made it so that when out=None and update_df=True, the function will return a dataframe. Otherwise it will revert to the previous behavior.

Let me know if you have any feedback!

Add mutation start and end ranges to the final dataframe

austinv11 added 5 commits December 11, 2024 12:54

Add additional context columns

fbb9f0d

Add mutation start and end ranges to the final dataframe

Return the updated mutation df

adf606d

Only return a relevant subset of columns

faca7b3

Don't emit a warning when update_df is True but update_df_out is None

3d7d93b

Somehow an ending quote was missing

e6ca97e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow for returning a dataframe from gget.mutate, w/ more context #169

Allow for returning a dataframe from gget.mutate, w/ more context #169

austinv11 commented Dec 11, 2024

Allow for returning a dataframe from gget.mutate, w/ more context #169

Are you sure you want to change the base?

Allow for returning a dataframe from gget.mutate, w/ more context #169

Conversation

austinv11 commented Dec 11, 2024