Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Penghui Qi's picture
5 29 8

Penghui Qi

QPHutu
exoplanet's profile picture Stars321123's profile picture dreamerdeo's profile picture
ยท
  • QPHutu

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago
Experiential Reinforcement Learning
authored a paper 17 days ago
Rethinking the Trust Region in LLM Reinforcement Learning
upvoted a paper 17 days ago
Rethinking the Trust Region in LLM Reinforcement Learning
View all activity

Organizations

Sea AI Lab's profile picture

liked 3 datasets 3 months ago

LLM360/guru-RL-92k

Viewer โ€ข Updated Aug 20, 2025 โ€ข 91.9k โ€ข 936 โ€ข 45

zwhe99/DeepMath-103K

Viewer โ€ข Updated May 29, 2025 โ€ข 103k โ€ข 9.22k โ€ข 351

sail/Sanity-Test-R1D-1.5B

Viewer โ€ข Updated Nov 15, 2025 โ€ข 1.52k โ€ข 42 โ€ข 7
liked a model 3 months ago

zz1358m/SofT-GRPO-master

Updated Nov 13, 2025 โ€ข 8
liked a dataset 5 months ago

SynthLabsAI/Big-Math-RL-Verified

Viewer โ€ข Updated Mar 25, 2025 โ€ข 251k โ€ข 5.97k โ€ข 222
liked a Space over 1 year ago
Sleeping
4

Pipeline Parallellism with Controllable Memory

๐Ÿ†
4

Calculate and visualize pipeline schedules

liked a Space almost 2 years ago
Runtime error
150

EditAnything

๐Ÿฆ€
150

liked a Space about 2 years ago
Running
21

Zero Bubble Pipeline Parallellism

๐Ÿ†
21

Calculate and visualize pipeline schedules

Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs