-
Notifications
You must be signed in to change notification settings - Fork 526
Pull requests: allenai/open-instruct
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Stabilize GRPO LLM judge calls by routing them through the guarded LiteLLM helper
#1587
opened Apr 3, 2026 by
taivu1998
Loading…
grpo_fast: harden single-node startup resource checks and diagnostics
#1586
opened Apr 3, 2026 by
taivu1998
Loading…
Changes
DataPreparationActor so that we can configure it into a replay buffer
#1583
opened Apr 2, 2026 by
finbarrtimbers
Loading…
Wire evolving rubric config flags into GRPO training loop
#1581
opened Mar 31, 2026 by
RulinShao
Loading…
3 of 4 tasks
Rename num_unique_prompts_rollout and num_samples_per_prompt_rollout
#1538
opened Mar 19, 2026 by
finbarrtimbers
Loading…
Add DeepSpeed universal checkpoint (UCP) support for GRPO
#1517
opened Mar 7, 2026 by
MohdElgaar
Loading…
Migrate to vLLM 0.16.0 native weight transfer API
#1515
opened Mar 6, 2026 by
finbarrtimbers
Loading…
Add SLR-Bench (Scalable Logical Reasoning) verifier and dataset support for RLVR
#1511
opened Mar 6, 2026 by
lukashelff
Loading…
Rename TIS ratio cap, add low bound and hard filter flag
#1503
opened Mar 2, 2026 by
finbarrtimbers
Loading…
Add AppWorld environment integration for GRPO
#1501
opened Feb 27, 2026 by
hamishivi
Loading…
3 tasks done
Fix dataset mixer split validation in combined datasets
#1494
opened Feb 24, 2026 by
MohdElgaar
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.