dingn42 - Overview

View dingn42's full-sized avatar

:electron:

StingNing dingn42

:electron:

Build something that works.

  • Tsinghua University

  • Earth

Block or report dingn42

Pinned Loading

  1. [ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

    Python 1.6k 102

  2. Scalable RL solution for advanced reasoning of language models

    Python 1.8k 111

  3. [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

    Python 1.1k 81

  4. Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

    Python 2.8k 137

  5. MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

    Jupyter Notebook 8.8k 569

  6. An Open-Source Framework for Prompt-Learning.

    Python 4.9k 485