Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
miniyeti's picture
1

miniyeti

miniyeti
·

AI & ML interests

None yet

Organizations

None yet

Collections 1

PRM
  • Let's reward step by step: Step-Level reward model as the Navigators for Reasoning

    Paper • 2310.10080 • Published Oct 16, 2023 • 1
  • The Lessons of Developing Process Reward Models in Mathematical Reasoning

    Paper • 2501.07301 • Published Jan 13, 2025 • 99
PRM
  • Let's reward step by step: Step-Level reward model as the Navigators for Reasoning

    Paper • 2310.10080 • Published Oct 16, 2023 • 1
  • The Lessons of Developing Process Reward Models in Mathematical Reasoning

    Paper • 2501.07301 • Published Jan 13, 2025 • 99

models 3

miniyeti/Reinforce-CartPole-v1

Reinforcement Learning • Updated Jul 9, 2025

miniyeti/q-Taxi-v3

Reinforcement Learning • Updated Jul 5, 2025

miniyeti/q-FrozenLake-v1-4x4-noSlippery

Reinforcement Learning • Updated Jul 5, 2025

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs