Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Hao Peng's picture
6 27 16

Hao Peng

Wesleythu
tsq2000's profile picture Magifafa07's profile picture dark-pen's profile picture
·
  • h-peng17

AI & ML interests

None yet

Organizations

Knowledge Engineer Group @ Tsinghua University's profile picture

New activity in huggingface/InferenceSupport 10 months ago

THU-KEG/TULU3-VerIF

#3578 opened 10 months ago by
Wesleythu
commented a paper 12 months ago

VerIF: Verification Engineering for Reinforcement Learning in Instruction Following

Paper • 2506.09942 • Published Jun 11, 2025 • 5 •
2
New activity in huggingface/HuggingDiscussions 12 months ago

[FEEDBACK] Daily Papers

🔥❤️ 21
203
#32 opened almost 2 years ago by
kramp
commented 2 papers over 1 year ago

Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

Paper • 2502.19328 • Published Feb 26, 2025 • 23 •
2

Constraint Back-translation Improves Complex Instruction Following of Large Language Models

Paper • 2410.24175 • Published Oct 31, 2024 • 18 •
2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs