felixcheung - Overview

View felixcheung's full-sized avatar

:octocat:

Felix Cheung felixcheung

:octocat:

Block or report felixcheung

Pinned Loading

  1. Apache Spark - A unified analytics engine for large-scale data processing

    Scala 43.1k 29.1k

  2. Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.

    Java 6.6k 2.8k

  3. GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs

    Scala 1.2k 266

  4. Information for setting up for the BerkeleyX Spark Intro MOOC, and lab assignments for the course

    Python 346 310

  5. Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR

    Shell 34 21

  6. Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

    C++ 16.6k 4.1k