Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Ruiyi Wang's picture
1 2

Ruiyi Wang

ruiyiwang
https://ruiyiw.github.io
  • RuiyiWang153
  • ruiyiw

AI & ML interests

social agents, LLM reasoning, reinforcement learning

Recent Activity

updated a dataset 7 days ago
ruiyiwang/grpo-qwen1.5b-textworld-policy-logits
published a dataset 7 days ago
ruiyiwang/grpo-qwen1.5b-textworld-policy-logits
updated a model 25 days ago
ruiyiwang/grpo-qwen-1.5b-textworld-w2-o3-q4-param-3
View all activity

Organizations

None yet

models 7

ruiyiwang/grpo-qwen-1.5b-textworld-w2-o3-q4-param-3

Updated 25 days ago

ruiyiwang/grpo-qwen-1.5b-textworld-w2-o3-q4-param-2

Updated 25 days ago

ruiyiwang/grpo-qwen-1.5b-textworld-w2-o3-q4

Updated 26 days ago

ruiyiwang/alfworld-qwen-7b-sft-admissible

Updated Nov 26, 2025

ruiyiwang/SFT-alfworld-text-only-Qwen2.5-VL-7B-Instruct

Updated Nov 20, 2025

ruiyiwang/SFT-alfworld-visual-text-Qwen2.5-VL-7B-Instruct

Updated Nov 20, 2025

ruiyiwang/SFT-alfworld-visual-only-Qwen2.5-VL-7B-Instruct

Updated Nov 20, 2025

datasets 3

ruiyiwang/grpo-qwen1.5b-textworld-policy-logits

Viewer • Updated 7 days ago • 8.9k • 38

ruiyiwang/meow-tea-oolong-dataset

Viewer • Updated Nov 21, 2025 • 13.1k • 2

ruiyiwang/ALFRED

Viewer • Updated Nov 4, 2025 • 6.83k • 3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs