Back to Glossary
RLVR
What is RLVR?
RLVR, or Reinforcement Learning with Verifiable Rewards, is a post-training method for AI models particularly useful in arenas with clear ground-truth answers like coding or math.