on-policy-distillation

Here are 5 public repositories matching this topic...

Gen-Verse / OpenClaw-RL

OpenClaw-RL: Train any agent simply by talking

async gui-application coding slime tinker memory-systems skill-learning rlhf sglang grpo on-policy-distillation openclaw-skills open-claw

Updated Mar 27, 2026
Python

songmzhang / KDFlow

Star

A user-friendly & efficient knowledge distillation framework for LLMs, supporting off-policy, on-policy (OPD), cross-tokenizer, multimodal, and on-policy self-distillation.

knowledge-distillation distillation large-languge-models on-policy-distillation cross-tokenizer-distillation

Updated Mar 27, 2026
Python

chrisliu298 / awesome-on-policy-distillation

Star

A curated collection of papers, technical reports, frameworks, and tools for on-policy distillation of large language models

awesome rl llm on-policy-distillation

Updated Mar 27, 2026

Smooth-humvee686 / onpolicydistillation

Star

🛠️ Apply on-policy distillation to enhance Qwen3-0.6b's performance on GSM8K by learning from its own outputs, reducing bias during inference.

data-science machine-learning research reinforcement-learning ai deep-learning simulation tensorflow pytorch algorithm-development performance-evaluation educational-resources model-training policy-optimization on-policy-distillation

Updated Mar 28, 2026
Jupyter Notebook

sabalearning01 / OpenClaw-RL

Star

Train and customize OpenClaw agents using reinforcement learning with simple language feedback and fully asynchronous optimization.

agent async gui-application slime memory-systems skill-learning rlhf sglang grpo agentic-rl on-policy-distillation openclaw openclaw-skills open-claw

Updated Mar 27, 2026
JavaScript

Improve this page

Add a description, image, and links to the on-policy-distillation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the on-policy-distillation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

on-policy-distillation

Here are 5 public repositories matching this topic...

Gen-Verse / OpenClaw-RL

songmzhang / KDFlow

chrisliu298 / awesome-on-policy-distillation

Smooth-humvee686 / onpolicydistillation

sabalearning01 / OpenClaw-RL

Improve this page

Add this topic to your repo