AI & ML interests
None defined yet.
models
27
AdversarialRLHF/rloo_pythia410m_tldr6.9b_rm410mdata_mergedsft_prefix_kl0.005
0.4B
•
Updated
•
6
AdversarialRLHF/rloo_pythia410m_tldr6.9b_rm410mdata_mergedsft0.3_prefix_nokl
0.4B
•
Updated
•
4
AdversarialRLHF/ppo_pythia410m_tldr6.9b_rm410mdata_mergedsft_prefix_nokl
0.4B
•
Updated
•
4
AdversarialRLHF/rloo_pythia410m_tldr6.9b_rm410mdata_propsft_propprefix_nokl
0.4B
•
Updated
•
4
AdversarialRLHF/rloo_pythia410m_tldr6.9b_rm410mdata_mergedsft_prefix_nokl
0.4B
•
Updated
•
4
AdversarialRLHF/rloo_pythia410m_tldr6.9b_rm410mdata_mergedsft_propprefix_nokl
Updated
AdversarialRLHF/rloo_pythia410m_tldr6.9b_rm410mdata_allprefixsft_prefix
0.4B
•
Updated
•
6
AdversarialRLHF/pythia410m-rm-tldr6.9b_prefix_in_chosen
Text Classification
•
0.4B
•
Updated
•
12
AdversarialRLHF/rloo_pythia410m_tldr6.9b_rm410mdata
0.4B
•
Updated
•
15
AdversarialRLHF/rloo_pythia410m_tldr6.9b_rm410mdata_mergedsft_prefix
0.4B
•
Updated
•
3
datasets
43
AdversarialRLHF/rloo_pythia410m_tldr6.9b_rm410mdata_mergedsft0.3_prefix_nokl_checkpoint-184_eval-dataset
Viewer
•
Updated
•
6.45k
•
7
AdversarialRLHF/rloo_pythia410m_tldr6.9b_rm410mdata_mergedsft0.3_prefix_nokl_checkpoint-26_eval-dataset
Viewer
•
Updated
•
6.45k
•
7
AdversarialRLHF/rloo_pythia410m_tldr6.9b_rm410mdata_mergedsft0.3_prefix_nokl_checkpoint-78_eval-dataset
Viewer
•
Updated
•
6.45k
•
5
AdversarialRLHF/rloo_pythia410m_tldr6.9b_rm410mdata_mergedsft0.3_prefix_nokl_checkpoint-52_eval-dataset
Viewer
•
Updated
•
6.45k
•
8
AdversarialRLHF/rloo_pythia410m_tldr6.9b_rm410mdata_mergedsft0.3_prefix_nokl_checkpoint-104_eval-dataset
Viewer
•
Updated
•
6.45k
•
8
AdversarialRLHF/rloo_pythia410m_tldr6.9b_rm410mdata_mergedsft0.3_prefix_nokl_checkpoint-255_eval-dataset
Viewer
•
Updated
•
6.45k
•
6
AdversarialRLHF/rloo_pythia410m_tldr6.9b_rm410mdata_mergedsft_prefix_nokl_checkpoint-255_eval-dataset
Viewer
•
Updated
•
6.45k
•
8
AdversarialRLHF/rloo_pythia410m_tldr6.9b_rm410mdata_checkpoint-255_eval-dataset
Viewer
•
Updated
•
6.45k
•
7
AdversarialRLHF/rloo_pythia410m_tldr6.9b_rm410mdata_checkpoint-104_eval-dataset
Viewer
•
Updated
•
6.45k
•
7
AdversarialRLHF/rloo_pythia410m_tldr6.9b_rm410mdata_checkpoint-52_eval-dataset
Viewer
•
Updated
•
6.45k
•
11