Skip to content
Change the repository type filter

All

    Repositories list

    • A set of native implementation of common bioinformatics algorithms to be used as Arrow-Datafusion or SeQuiLa (Apache Spark) extensions.
      Rust
      Apache License 2.0
      00192Updated Jan 6, 2025Jan 6, 2025
    • Blazing fast bioinformatic operations on Python DataFrames
      Python
      Apache License 2.0
      02152Updated Jan 6, 2025Jan 6, 2025
    • Self service for Data Science labs
      HCL
      Apache License 2.0
      46021Updated Dec 21, 2024Dec 21, 2024
    • Jupyter Notebook
      Apache License 2.0
      45531Updated Dec 21, 2024Dec 21, 2024
    • Apache DataFusion Comet Spark Accelerator
      Rust
      Apache License 2.0
      169001Updated Nov 24, 2024Nov 24, 2024
    • TeX
      MIT License
      0000Updated Nov 11, 2024Nov 11, 2024
    • Jupyter Notebook
      1100Updated Oct 27, 2024Oct 27, 2024
    • phenodb

      Public
      Serverless vector database for deep phenotyping
      Apache License 2.0
      0000Updated Aug 29, 2024Aug 29, 2024
    • Fine-tuning LLaMA 2 for rare disease concept normalization
      Jupyter Notebook
      2000Updated Aug 9, 2024Aug 9, 2024
    • sequila

      Public
      SeQuiLa: Distributed analytics for genomics based on Apache Spark!
      HTML
      Apache License 2.0
      79238Updated Aug 2, 2024Aug 2, 2024
    • PhenoGPT

      Public
      Jupyter Notebook
      MIT License
      6000Updated May 16, 2024May 16, 2024
    • Launcher shortcuts for classic Jupyter Notebook & JupyterLab
      Python
      BSD 3-Clause "New" or "Revised" License
      11000Updated Feb 26, 2024Feb 26, 2024
    • PhenoTagger
      GAP
      MIT License
      16000Updated Jan 24, 2024Jan 24, 2024
    • rnafusion

      Public
      RNA-seq analysis pipeline for detection gene-fusions
      Nextflow
      MIT License
      98000Updated Dec 1, 2023Dec 1, 2023
    • rust-bio

      Public
      This library provides implementations of many algorithms and data structures that are useful for bioinformatics. All provided implementations are rigorously tested via continuous integration.
      Rust
      MIT License
      206000Updated Nov 18, 2023Nov 18, 2023
    • coitrees

      Public
      A very fast interval tree data structure
      Rust
      MIT License
      8000Updated Nov 10, 2023Nov 10, 2023
    • iitii

      Public
      Implicit Interval Tree with Interpolation Index
      Jupyter Notebook
      Apache License 2.0
      4000Updated Nov 9, 2023Nov 9, 2023
    • A little benchmarking tool for Python
      Python
      MIT License
      6000Updated Oct 15, 2023Oct 15, 2023
    • Python
      2000Updated Aug 24, 2023Aug 24, 2023
    • ds-images

      Public
      Shell
      Apache License 2.0
      00154Updated May 25, 2023May 25, 2023
    • sparkseq

      Public
      Scala
      Apache License 2.0
      0000Updated Mar 5, 2023Mar 5, 2023
    • popgen

      Public
      Scala
      0000Updated Mar 5, 2023Mar 5, 2023
    • Scala
      Apache License 2.0
      0000Updated Mar 5, 2023Mar 5, 2023
    • disq

      Public
      A library for manipulating bioinformatics sequencing formats in Apache Spark
      Java
      MIT License
      11100Updated Jan 29, 2023Jan 29, 2023
    • cannoli

      Public
      Distributed execution of bioinformatics tools on Apache Spark. Apache 2 licensed.
      Scala
      Apache License 2.0
      17000Updated Jan 29, 2023Jan 29, 2023
    • Python
      2000Updated Jan 17, 2023Jan 17, 2023
    • SeQuiLa recipes, examples and other cloud-related content
      HCL
      Apache License 2.0
      1300Updated Nov 15, 2022Nov 15, 2022
    • pysequila

      Public
      Python wrapper for SeQuiLa: Distributed analytics for genomics based on Apache Spark!
      HTML
      Apache License 2.0
      1233Updated Nov 4, 2022Nov 4, 2022
    • notebooks

      Public
      Jupyter Notebook
      Apache License 2.0
      1001Updated Oct 31, 2022Oct 31, 2022
    • Python
      0000Updated Jan 23, 2022Jan 23, 2022