Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
james's picture
1

james

jamesjunyuguo
·
https://jamesjunyuguo.github.io/
  • jamesjunyuguo

AI & ML interests

None yet

Organizations

UC Berkeley's profile picture

Collections 1

reward modelling
  • Inference-Time Scaling for Generalist Reward Modeling

    Paper • 2504.02495 • Published Apr 3, 2025 • 57
reward modelling
  • Inference-Time Scaling for Generalist Reward Modeling

    Paper • 2504.02495 • Published Apr 3, 2025 • 57

models 11

jamesjunyuguo/llama-3-3b-math-orca-qlora-10k-ep1

Updated Jun 13, 2025

jamesjunyuguo/dpo-llama-3-1-8b-math

Text Generation • 8B • Updated Apr 23, 2025 • 4

jamesjunyuguo/llama-3-1-8b-math-orca-qlora-10k-ep1

Updated Apr 23, 2025

jamesjunyuguo/llama-3-1-8b-sft

Updated Apr 16, 2025

jamesjunyuguo/qwen-2.5-3b-r1-countdown

Text Generation • 3B • Updated Apr 10, 2025 • 14

jamesjunyuguo/qwen-2.5-3b-r1-distort-4.0

3B • Updated Mar 13, 2025 • 5

jamesjunyuguo/qwen-2.5-3b-r1-distort-1.0

Text Generation • 3B • Updated Mar 13, 2025 • 8

jamesjunyuguo/qwen-2.5-3b-r1-distort-3.0

Text Generation • 3B • Updated Mar 13, 2025 • 5

jamesjunyuguo/qwen-2.5-3b-r1-distort

3B • Updated Mar 13, 2025 • 5

jamesjunyuguo/llama-3-1-8b-math-orca-qlora-10k-ep1-merged

8B • Updated Feb 28, 2025 • 4
View 11 models

datasets 1

jamesjunyuguo/philschmid-llama-3-1-8b-math-orca-spectr-philschmid-DMath-candidates

Viewer • Updated Jul 24, 2025 • 1.96k • 9
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs