Hi, I'm Jason (Zihan Dai) ๐
Bachelor's student @ University of Melbourne | CS & AI & Data Science
Currently focusing on LLM alignment, Text-to-SQL, and AI systems engineering.
๐ค LLM & AI Research
| Project | Description | Tech Stack |
|---|---|---|
| FusionSQL | Text-to-SQL solution with BGE-M3 fusion retrieval (table-level โช column-level), achieving 95.7% recall rate | Python, BGE-M3, Qwen, FlagEmbedding |
| text2sql_research | Text-to-SQL with graph structure & RAG | Python, LangChain, Graph DB |
| LLM-Research-Internship-2025 | Technical notes on LLM alignment & prompt engineering | Markdown |
| 2024_ai_intership_log | AI system development: multimodal ad review with ReAct architecture | Python, FastAPI, LangChain, Docker |
๐ง Engineering Projects
| Project | Description | Tech Stack |
|---|---|---|
| MarsCode_Winter_project | Heimdallr: Full-stack frontend monitoring system with SDK, backend & dashboard | TypeScript, Vue, MySQL |
๐ Data Science
| Project | Description | Tech Stack |
|---|---|---|
| EODP-project | Crime rate prediction in Victoria using socio-economic factors | Python, Jupyter, Scikit-learn |
๐ Course Projects
| Project | Description | Tech Stack |
|---|---|---|
| OOSD-project | Shadow Donkey Kong: 2D arcade game with enemy system | Java, Bagel Engine |
| ads-project2 | Algorithm & data structure implementations | C |
๐ Open Source Contributions
| Project | Description | Status |
|---|---|---|
| apache/beam | Distributed data processing framework | 3 PRs merged |
| apache/shardingsphere | Database middleware | 2 PRs merged |
| apache/iceberg | Open table format | 1 PR merged |
| apache/iotdb | Time-series database | PRs in review |
| apache/seatunnel | Data integration platform | PRs in review |
| opencv/opencv | Computer vision library | 1 PR merged |
Tech Stack
Languages: Python ยท TypeScript ยท Java ยท C ยท SQL
AI/ML: PyTorch ยท LangChain ยท Transformers ยท RAG ยท ReAct
Backend: FastAPI ยท Node.js ยท MySQL
Tools: Docker ยท Git ยท Jupyter