harrydevforlife - Overview

Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize and recommend app.

Python 39 8

Real-time data processing using Delta Pipeline Architecture, use Databricks Lakehouse to store Delta tables.

1 2

Forked from ashvardanian/NumKong

Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, …

C

Starlink – an educational SQL query engine in Python, built on Arrow.

Python 1