Skip to content

v1.1.1

Latest
Compare
Choose a tag to compare
@rdgroussman rdgroussman released this 23 Jan 07:38
· 6 commits to main since this release
830b3fa

MarFERReT protein sequence reference library change log

v1.1.1

Improvements to MarFERReT assembly scripts

  • Incorporated inclusion/exclusion flag into build procedure to integrate QC process
  • Added check to see if translated protein sequence already exists
  • Changed scripts to pull database version from input metadata file
  • Streamlined the internal file naming process
    • Important output files are now generated with the same version ID as the input metadata.csv
  • The /data/source_seqs/ directory for input source sequences is now pre-generated

Added test data set and demo tutorial

  • Added a small data set to test the MarFERRet build process under /versions/v1_demo/
    • Seven-entry metadata.csv file (MarFERReT.v1_demo.metadata.csv)
    • Compressed tarball with small set of FASTA sequences (demo_mini_entries.tar.gz)
    • Demo tutorial script (marferret.v1_demo.tutorial.sh)

Zenodo repository release

  • Input metadata files and output files generated by the assemble_marferret.sh, pfam_annotate.sh, and build_diamond_db.sh MarFERReT build scripts have been deposited in a Zenodo repository linked to the v1.1.1 codebase: https://doi.org/10.5281/zenodo.10553848