kokolerk - Overview
Hi there 👋 I'm Jiaqi Wang 🌱 I’m an undergraduate in HIT. Now I'm enjoying my Ph.D. third year in CUHK.
[NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models
Python 55 4
MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips
4.4k 526
Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).
Python 579 59
[NeurIPS 2025]A Signed Graph Approach to Understanding and Mitigating Oversmoothing in GNNs
Python 2
[TMLR2025] DivIL: Unveiling and Addressing Over-Invariance for Out-of- Distribution Generalization
Python 4