Ray2333/reward-model-Mistral-7B-instruct-Unified-Feedback Text Classification • 7B • Updated Feb 5, 2025 • 327 • 11
Ray2333/gpt2-large-helpful-reward_model Text Classification • 0.8B • Updated Jun 2, 2024 • 56.3k • • 13
Ray2333/gpt2-large-harmless-reward_model Text Classification • 0.8B • Updated Jun 2, 2024 • 63.5k • 4