-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merge multiple KEMET results #11
Comments
Dear Leandro, I'd personally do that using a combination of bash commands to extract the columns of interest from the .tsv table files. # move to the KEMET report folder
cd KEMET/reports_tsv
# create first column of summary file
echo samples > modules.start
# add modules ID in summary file
# replace [NAME] w/ any single .tsv filename
cut -f1 [NAME] >> modules.start
# extract module compleness per each genome as a tmp file
for f in *.tsv; do echo ${f:10:-4} > $f.tmp; cut -f3 $f >> $f.tmp; done
# create new folder for result
mkdir summary
# unite modules ID and result per each genome
paste modules.start *.tmp > summary/summarized_table.tsv
# clean from tmp files
rm *.tmp modules.start Do you have anything specific in mind? Best, |
Dear Matteo, thank you for your attention and help, your script worked perfectly. It was exactly what I needed. I ended up (parser) doing something similar in R, I'll post it below in case anyone needs a second solution: rm(list=ls()) setwd ("D:/ITV/KEMET_resultados/reports_tsv_KASS") path: To specify directory contain KEMET results: modules_id <- data_join$Module_id # colname: module_id My filenames pattern of KEMET results: reportKMC_Ga0541012_bin.tsv Best regards, |
Dear,
I performed kemet against several samples, can you give me some tips on how to merge these tables into one?
Best regards,
Leandro.
The text was updated successfully, but these errors were encountered: