PDGGK - Overview

Hi, I'm Jason (Zihan Dai) ๐Ÿ‘‹

Bachelor's student @ University of Melbourne | CS & AI & Data Science

Currently focusing on LLM alignment, Text-to-SQL, and AI systems engineering.


๐Ÿค– LLM & AI Research

Project Description Tech Stack
FusionSQL Text-to-SQL solution with BGE-M3 fusion retrieval (table-level โˆช column-level), achieving 95.7% recall rate Python, BGE-M3, Qwen, FlagEmbedding
text2sql_research Text-to-SQL with graph structure & RAG Python, LangChain, Graph DB
LLM-Research-Internship-2025 Technical notes on LLM alignment & prompt engineering Markdown
2024_ai_intership_log AI system development: multimodal ad review with ReAct architecture Python, FastAPI, LangChain, Docker

๐Ÿ”ง Engineering Projects

Project Description Tech Stack
MarsCode_Winter_project Heimdallr: Full-stack frontend monitoring system with SDK, backend & dashboard TypeScript, Vue, MySQL

๐Ÿ“Š Data Science

Project Description Tech Stack
EODP-project Crime rate prediction in Victoria using socio-economic factors Python, Jupyter, Scikit-learn

๐Ÿ“š Course Projects

Project Description Tech Stack
OOSD-project Shadow Donkey Kong: 2D arcade game with enemy system Java, Bagel Engine
ads-project2 Algorithm & data structure implementations C

๐ŸŒ Open Source Contributions

Project Description Status
apache/beam Distributed data processing framework 3 PRs merged
apache/shardingsphere Database middleware 2 PRs merged
apache/iceberg Open table format 1 PR merged
apache/iotdb Time-series database PRs in review
apache/seatunnel Data integration platform PRs in review
opencv/opencv Computer vision library 1 PR merged

Tech Stack

Languages: Python ยท TypeScript ยท Java ยท C ยท SQL

AI/ML: PyTorch ยท LangChain ยท Transformers ยท RAG ยท ReAct

Backend: FastAPI ยท Node.js ยท MySQL

Tools: Docker ยท Git ยท Jupyter