felixcheung - Overview
Apache Spark - A unified analytics engine for large-scale data processing
Scala 43.1k 29.1k
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
Java 6.6k 2.8k
GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs
Scala 1.2k 266
Information for setting up for the BerkeleyX Spark Intro MOOC, and lab assignments for the course
Python 346 310
Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR
Shell 34 21
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
C++ 16.6k 4.1k