Big Data Genomics
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
Scala 1k 314
A scalable genome browser. Apache 2 licensed.
Scala 127 31
A Variant Caller, Distributed. Apache 2 licensed.
Scala 71 41
Open source formats for scalable genomic processing systems using Avro. Apache 2 licensed.
Shell 41 35
Distributed execution of bioinformatics tools on Apache Spark. Apache 2 licensed.
Scala 41 17
Ready-to-go Parquet-formatted public 'omics datasets
Python 30 8