PGCodeLLM

Popular repositories Loading

  1. trl trl Public

    Forked from huggingface/trl

    [Downstream Fork DO NOT EDIT MAIN] Train transformer language models with reinforcement learning.

    Python

  2. [Fork] An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

    Python

  3. Forked from HKUNLP/critic-rl

    Code for Paper: Teaching Language Models to Critique via Reinforcement Learning

    Python

  4. Forked from rllm-org/rllm

    Democratizing Reinforcement Learning for LLMs

    Python

  5. Forked from hiyouga/LlamaFactory

    Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

    Python