Blog — Ling Yang

LLM Reasoning

LLM Reasoning via Thought Templates

Exploring how thought-augmented reasoning and template distillation can unlock stronger problem-solving capabilities in large language models — from Buffer of Thoughts to ReasonFlux.

BoT SuperCorrect ReasonFlux

Read post
Self-Evolving AI

Self-Evolving AI

Building AI systems that co-evolve environments, policies, and reward models — from code generation with reinforcement learning to personalized agent fine-tuning through natural conversation.

CURE RLAnything OpenClaw-RL

Read post
Diffusion Language Models

Diffusion Language Models

Advancing the frontier of diffusion-based language modeling — from reinforcement learning frameworks for discrete diffusion to multimodal generation and efficient parallel decoding.

TraceRL MMaDA MMaDA-Parallel

Read post