-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
.ComputeClusterStats update; add pattern method to outer functions. #6
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
…ode of computeclusterstats; removed duplicated code - computeclusterstatsdualchain
Hello @brianpatrickneal, let's discuss the bolded points I made in the PR description |
Hi Daniil,
I agree with your suggestion to issue a warning when encountering NAs in
the count column.
I will be happy to take care of updating the documentation (it will be a
good way for me to review and familiarize myself with the changes). As a
heads up, it is almost certain that I will be slow to do so as the next two
weeks are going to be especially hellish for me. It is unlikely that I will
get it done before the weekend of 3/22. If this delay interferes with your
ongoing work, let me know and I'll do what I can to expedite, though I
can't make any credible promises, much as I would like to. Thanks for your
patience with me!
Best,
Brian
…On Wed, Mar 6, 2024 at 11:49 AM aidanil ***@***.***> wrote:
Hello, Brian, let's discuss the bolded points I made in the PR description
—
Reply to this email directly, view it on GitHub
<#6 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AWMANGELRRNOYYE52MZ7LSDYW5XNPAVCNFSM6AAAAABEJVXWF6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTSOBRGY3DSMBTGE>
.
You are receiving this because your review was requested.Message ID:
<mlizhangx/Network-Analysis-for-Repertoire-Sequencing-/pull/6/c1981669031@
github.com>
|
Hello Brian,
No worries, I think we are not in a rush, let's discuss it when you will have more time!
Also, I think at that time I will have more analysis of different adjacency matrix building methods, and we will add this info to docs too (and/or decide how to automatize the choice).
Sincerely,
Daniil Matveev
________________________________
From: brianpatrickneal ***@***.***>
Sent: Friday, March 8, 2024 5:19 PM
To: mlizhangx/Network-Analysis-for-Repertoire-Sequencing- ***@***.***>
Cc: Daniil Mikhailovich Matveev ***@***.***>; Author ***@***.***>
Subject: Re: [mlizhangx/Network-Analysis-for-Repertoire-Sequencing-] .ComputeClusterStats update; add pattern method to outer functions. (PR #6)
Hi Daniil,
I agree with your suggestion to issue a warning when encountering NAs in
the count column.
I will be happy to take care of updating the documentation (it will be a
good way for me to review and familiarize myself with the changes). As a
heads up, it is almost certain that I will be slow to do so as the next two
weeks are going to be especially hellish for me. It is unlikely that I will
get it done before the weekend of 3/22. If this delay interferes with your
ongoing work, let me know and I'll do what I can to expedite, though I
can't make any credible promises, much as I would like to. Thanks for your
patience with me!
Best,
Brian
On Wed, Mar 6, 2024 at 11:49 AM aidanil ***@***.***> wrote:
Hello, Brian, let's discuss the bolded points I made in the PR description
—
Reply to this email directly, view it on GitHub
<#6 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AWMANGELRRNOYYE52MZ7LSDYW5XNPAVCNFSM6AAAAABEJVXWF6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTSOBRGY3DSMBTGE>
.
You are receiving this because your review was requested.Message ID:
<mlizhangx/Network-Analysis-for-Repertoire-Sequencing-/pull/6/c1981669031@
github.com>
—
Reply to this email directly, view it on GitHub<#6 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ARB34CKV3476JHXUNXUDXRDYXJPSDAVCNFSM6AAAAABEJVXWF6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTSOBWGY3TIMRQGI>.
You are receiving this because you authored the thread.Message ID: ***@***.***>
|
Documentation has been updated. Further thoughts on 1.2 (handling NAs in count data):
I'll bring the question up for discussion during tomorrow's meeting. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
1.1. Speed improvement - ~300% for small toy data with 200 rows. ~300-700% for real CMV data with 10^5 strings.
1.2. Open question - how should we handle NA values in the count column? I don't think it's the best idea to use na.rm=TRUE. Maybe we should put some warning to the user, that the count column has NA values.
2.1. This function was duplicating .computeClusterStats in everything except it was computing string statistics for two columns.
3.1. We should rewrite the documentation for them, I need help with that.
Besides automatic tests, I tested that output for the new .computeClusterStats is the same as it was before on the toy data:
(code should be in the project root directory)