openrlhf

Here are 4 public repositories matching this topic...

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

RL training environments with verifiable rewards for coding agents. Works with TRL, Unsloth, verl, OpenRLHF.

A list of uv environments templates for LLM development.

python environment deep-learning conda pytorch venv uv llm flash-attn verl openrlhf

🌐 Streamline LLM development with ready-to-use environment templates for efficient setup and deployment.

python environment deep-learning conda pytorch venv uv llm flash-attn verl openrlhf

Add a description, image, and links to the openrlhf topic page so that developers can more easily learn about it.

To associate your repository with the openrlhf topic, visit your repo's landing page and select "manage topics."