oceanpty/TOA-Ultrafeedback-SFT-Rand-lla3.1-8b-inst
Viewer
•
Updated
•
59.9k
•
17
oceanpty/TOA-Ultrafeedback-SFT-Rand-qwen2-7b-inst
Viewer
•
Updated
•
59.9k
•
33
oceanpty/TOA-Ultrafeedback-SFT-PRS-lla3.1-8b-inst
Viewer
•
Updated
•
59.9k
•
18
oceanpty/TOA-Ultrafeedback-SFT-PRS-qwen2-7b-inst
Viewer
•
Updated
•
59.9k
•
15
oceanpty/TOA-Ultrafeedback-SFT-Ensemble-model-num-4
Viewer
•
Updated
•
59.9k
•
8
oceanpty/TOA-Ultrafeedback-SFT-SeqRefine-model-num-4
Viewer
•
Updated
•
59.9k
•
21
oceanpty/TOA-Ultrafeedback-SFT-MoA-model-num-4
Viewer
•
Updated
•
59.4k
•
16
oceanpty/TOA-Ultrafeedback-SFT-TOA-model-num-4
Viewer
•
Updated
•
59.8k
•
23
oceanpty/TOA-Ultrafeedback-DPO-TOA-model-num-4
Viewer
•
Updated
•
57.1k
•
8
oceanpty/TOA-ultrafeedback-lla3-8b-inst-sft-data-Rand-lla31-8b-inst
8B
•
Updated
•
13
oceanpty/TOA-ultrafeedback-lla3-8b-inst-sft-data-PRS-lla31-8b-inst
8B
•
Updated
•
13
oceanpty/TOA-ultrafeedback-lla3-8b-inst-sft-data-small-scale-ensemble
8B
•
Updated
•
10
oceanpty/TOA-ultrafeedback-lla3-8b-inst-sft-data-small-scale-SeqRefine
8B
•
Updated
•
5
oceanpty/TOA-ultrafeedback-lla3-8b-inst-sft-data-small-scale-MoA
8B
•
Updated
•
13
oceanpty/TOA-ultrafeedback-lla3-8b-inst-sft-data-small-scale-TOA
8B
•
Updated
•
15
oceanpty/TOA-ultrafeedback-lla3-8b-inst-dpo-data-small-scale-mcts-n-40-pi-0-ni-30
8B
•
Updated
•
9