Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HATCHet2 RDR/BAF locally-aware clustering error with sparse bins #206

Open
aganezov opened this issue Feb 8, 2024 · 0 comments
Open

HATCHet2 RDR/BAF locally-aware clustering error with sparse bins #206

aganezov opened this issue Feb 8, 2024 · 0 comments

Comments

@aganezov
Copy link

aganezov commented Feb 8, 2024

First of all, thank you to all on HATCHet2 team for the recent release and bringing on exciting development of locally-aware RDR/BAF bin clustering (vs global clustering in HATCHet1).

With that mentioned, when trying to run on HATCHet2 on some data that contains occasional gaps between bins on a given chromosomal arm (e.g., long stretches of homozygosity and/or identity by descent in non-human organisms, where BAF values are impossible to obtain; or cases of WES where gaps define intergenic regions) I get an error mentioning that there is more than 1 gap per chromosome (coming from this code: https://github.com/raphael-group/hatchet/blob/master/src/hatchet/utils/cluster_bins.py#L158C1-L159C1 presumably).

What would be the best way to utilize HATCHet2 locally-aware RDR/BAF bin clustering while allowing for non-contiguous bin runs within a given chromosomal arm?
I'd imagine that currently every chromosomal arm is treated independently as a contiguous stretch of boundary-touching bins, while, in principal, this can be generalized for for a case of having more than 1 "bin-run" along a chromosomal arm, where every such stretch of contiguous bins is treated independently as is currently the case for chromosomal arms.

Is it something that can be readily tried with the current HATCHet2 code base? Are there any workarounds for it?

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant