Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The +1 problem - adding new samples incrementally #95

Open
boboppie opened this issue Aug 24, 2024 · 0 comments
Open

The +1 problem - adding new samples incrementally #95

boboppie opened this issue Aug 24, 2024 · 0 comments
Assignees

Comments

@boboppie
Copy link

Dear @zhengxwen ,

hope you are keeping well.

Would you please recommend the best practice to address the +1 problem using GDS, e.g. adding new samples incrementally by small batches. For example, our starting batch can be a biobank scale dataset, e.g. UKB, and we are able to convert a pVCF to a GDS. When we want to add new samples, SeqMerge function will do, however it needs to generate a new file object, and the storage space will be doubled at least (temporarily), is there a way to avoid this by appending the new samples to the existing GDS directly? Please share your advice and guidance on this.

Best wishes,
Fengyuan

@zhengxwen zhengxwen self-assigned this Sep 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants