Kwai-Klear

Popular repositories Loading

  1. RL with Experience Replay

    Python 56 2

  2. Training Autoregressive Image Generation models via Reinforcement Learning

    Python 52 5

  3. Forked from SWE-agent/mini-swe-agent

    mini-swe-agent-plus: a tiny (~100 LOC) GitHub issue fixer—now with a robust multi-line text edit tool.

    Python 19 6

  4. Forked from suu990901/KlearReasoner

    CE-GPPO: Controlling Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning

    Python 17

Repositories

Type
Select type

Language
Select language

Sort
Select order

Showing 10 of 14 repositories

  • Kwai-Klear/DeepSynth-Eval’s past year of commit activity

    Python

    1

    MIT 0

    0 0

    Updated Apr 4, 2026

  • CE-GPPO Public Forked from suu990901/KlearReasoner

    CE-GPPO: Controlling Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning

    Kwai-Klear/CE-GPPO’s past year of commit activity

    Python

    17

    Apache-2.0

    10 0 0

    Updated Jan 23, 2026

  • Kwai-Klear/mini-swe-agent-plus’s past year of commit activity

    Python

    19

    MIT

    519 1 0

    Updated Jan 20, 2026

  • ERC Public

    Entropy Ratio Clipping as a Soft Global Constraint for Stable Reinforcement Learning

    Kwai-Klear/ERC’s past year of commit activity

    Python

    4

    Apache-2.0 0

    0 0

    Updated Jan 5, 2026

  • AR-GRPO Public

    Training Autoregressive Image Generation models via Reinforcement Learning

    Kwai-Klear/AR-GRPO’s past year of commit activity

    Python

    52 5 0 0

    Updated Nov 26, 2025

  • Kwai-Klear/Klear-AgentForge’s past year of commit activity

    12 1 1 0

    Updated Nov 13, 2025

  • Kwai-Klear/Klear1.0’s past year of commit activity

    19

    Apache-2.0 0

    1 0

    Updated Sep 7, 2025

  • vllm Public Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Kwai-Klear/vllm’s past year of commit activity

    Python

    1

    Apache-2.0

    15,538 0 0

    Updated Sep 4, 2025

  • Kwai-Klear/Leanabell-Prover-V2’s past year of commit activity

    1 1 0 0

    Updated Jul 29, 2025

  • Klear-Qwen3-Thinking-Preview Public

    A practical tutorial on how to effectively use RL to enhance reasoning capabilities on the Qwen3-8B model.

    Kwai-Klear/Klear-Qwen3-Thinking-Preview’s past year of commit activity

    9

    0

    0 0

    Updated Jul 29, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…