Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
qihoo360
/
Light-R1-14B-DS
like
37
Follow
北京奇虎科技有限公司
281
Text Generation
Transformers
Safetensors
qwen2
conversational
text-generation-inference
arxiv:
2503.10460
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
5
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (2)
Sort: Recently created
GRPO dataset?
#5 opened 12 months ago by
Armaan11
Update README.md
#3 opened 12 months ago by
dyc-sh
Can you release online RL prompts ?
#2 opened 12 months ago by
sparsh35