Cloudera Inc
CDP Public Cloud is an integrated analytics and data management platform deployed on cloud services. It offers broad data analytics and artificial intelligence functionality along with secure user …
Java 361 233
Ansible playbooks for deploying Hortonworks Data Platform and DataFlow using Ambari Blueprints
Python 249 249
StreamLine - Streaming Analytics
Java 167 95
Vagrant files creating multi-node virtual Hadoop clusters with or without security.
HTML 67 51
A rough prototype of a tool for discovering Apache Hive schemas from JSON documents.
Java 42 32