oamazonasgabriel - Overview
This project demonstrate how to process data stored in a data lake fashion, transforming it into an OLAP optimized structure by using PySpark. The PySpark Job runs on AWS EMR, and the Data Pipelineā¦
Python 8 4
This is a quick start guide for the Delta Lake (delta.io) Python Spark connector, running on AWS Glue.
Python 5
Sample Cloud Formation Template YAML configurations
Python 1
Flame š„ Opinionated Flask & MongoDB backend boilerplate.
Python 4
AWS Glue PySpark - Apache Hudi Quick Start Guide
Repository for the code demoed in the talk
Jupyter Notebook 4 1