Big Data Genomics

Popular repositories Loading

  1. ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.

    Scala 1k 314

  2. A scalable genome browser. Apache 2 licensed.

    Scala 127 31

  3. A Variant Caller, Distributed. Apache 2 licensed.

    Scala 71 41

  4. Open source formats for scalable genomic processing systems using Avro. Apache 2 licensed.

    Shell 41 35

  5. Distributed execution of bioinformatics tools on Apache Spark. Apache 2 licensed.

    Scala 41 17

  6. Ready-to-go Parquet-formatted public 'omics datasets

    Python 30 8