PGCodeLLM
Popular repositories Loading
-
trl trl Public
Forked from huggingface/trl
[Downstream Fork DO NOT EDIT MAIN] Train transformer language models with reinforcement learning.
Python
-
[Fork] An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Python
-
Forked from HKUNLP/critic-rl
Code for Paper: Teaching Language Models to Critique via Reinforcement Learning
Python
-
Forked from hiyouga/LlamaFactory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Python