PGCodeLLM

Popular repositories Loading

trl trl Public

Forked from huggingface/trl

[Downstream Fork DO NOT EDIT MAIN] Train transformer language models with reinforcement learning.

Python
[Fork] An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python
Forked from HKUNLP/critic-rl

Code for Paper: Teaching Language Models to Critique via Reinforcement Learning

Python
Forked from rllm-org/rllm

Democratizing Reinforcement Learning for LLMs

Python
Forked from hiyouga/LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python