Highlights
- Pro
Pinned Loading
-
thunlp/OpenBackdoor
thunlp/OpenBackdoor PublicAn open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)
-
OpenBMB/UltraFeedback
OpenBMB/UltraFeedback PublicA large-scale, fine-grained, diverse preference dataset (and models).
-
-
PRIME-RL/PRIME
PRIME-RL/PRIME PublicScalable RL solution for advanced reasoning of language models
-
PRIME-RL/ImplicitPRM
PRIME-RL/ImplicitPRM PublicRepo of paper "Free Process Rewards without Process Labels"
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.