rgruener - Overview

View rgruener's full-sized avatar

Robert Gruener rgruener

  • Pittsburgh

Organizations

@create-at-cooper

Block or report rgruener

Pinned Loading

  1. Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch…

    Python 1.9k 285

  2. Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

    C++ 16.7k 4.1k

  3. A Scala API for Apache Beam and Google Cloud Dataflow.

    Scala 2.6k 526

  4. Apache Beam is a unified programming model for Batch and Streaming data processing.

    Java 8.5k 4.5k