Jon Peng
jonp07
ยท
AI & ML interests
None yet
Recent Activity
authored a paper about 2 months ago
HiPER: Hierarchical Reinforcement Learning with Explicit Credit Assignment for Large Language Model Agents updated a model about 2 months ago
jonp07/GRPO-ALFWorld published a model about 2 months ago
jonp07/GRPO-ALFWorld