Lichang-Chen (Lichang Chen)

arxiv:2502.19613

arxiv:2410.12219

arxiv:2409.13156

spaces 2

Reward Decomposition

🌖

DEFT

👁

models 64

datasets 18

Lichang-Chen/omnixR-data

Viewer • Updated Nov 26, 2024 • 1.4k • 5

Lichang-Chen/llama_sft_dpo_bold_list_attack_eval_iter3

Viewer • Updated Sep 18, 2024 • 800 • 11

Lichang-Chen/llama_sft_dpo_bold_list_attack_eval_iter2

Viewer • Updated Sep 18, 2024 • 800 • 1

Lichang-Chen/llama_sft_dpo_bold_list_attack_iter1

Viewer • Updated Sep 18, 2024 • 800 • 3

Lichang-Chen/dpo_it_attack_list_and_bold

Viewer • Updated Sep 18, 2024 • 800 • 4

Lichang-Chen/llama3_it_dpo_attack_list_2epoch

Viewer • Updated Sep 18, 2024 • 800 • 2

Lichang-Chen/llama3_it_dpo_attack_bold_2epoch

Viewer • Updated Sep 18, 2024 • 800 • 4

Lichang-Chen/dpo_it_unbiased_ver3

Viewer • Updated Sep 18, 2024 • 800 • 4

Lichang-Chen/list_training_pairs

Viewer • Updated Sep 18, 2024 • 1k • 4

Lichang-Chen/bold_training_pairs

Viewer • Updated Sep 18, 2024 • 745 • 3

View 18 datasets

Lichang Chen

AI & ML interests

Organizations

Collections 1

Lichang-Chen/ODIN_L1_O1

Lichang-Chen/ODIN_L1

Lichang-Chen/ODIN-ReMax-L230-best

Lichang-Chen/ODIN-ReMax-L255-best

Lichang-Chen/ODIN_L1_O1

Lichang-Chen/ODIN_L1

Lichang-Chen/ODIN-ReMax-L230-best

Lichang-Chen/ODIN-ReMax-L255-best

Papers 13

spaces 2

Reward Decomposition

DEFT

models 64

Lichang-Chen/Qwen2.5-14B-Instruct-star-nl-3Rounds-iter-3

Lichang-Chen/Qwen2.5-14B-Instruct-star-nl-3Rounds-iter-2

Lichang-Chen/Qwen2.5-14B-Instruct-star-nl-3Rounds-iter-1

Lichang-Chen/game-play-point25-50

Lichang-Chen/multi-attempts-multi-examples-Jan9

Lichang-Chen/multi-turn-Jan5

Lichang-Chen/multi-turn-Jan4

Lichang-Chen/llama3-dpo-single-turn-point2247-dec15

Lichang-Chen/llama-8b-gemini-point60-100-wo-cot

Lichang-Chen/llama-8b-gemini-point21-60-wo-cot

datasets 18

Lichang-Chen/omnixR-data

Lichang-Chen/llama_sft_dpo_bold_list_attack_eval_iter3

Lichang-Chen/llama_sft_dpo_bold_list_attack_eval_iter2

Lichang-Chen/llama_sft_dpo_bold_list_attack_iter1

Lichang-Chen/dpo_it_attack_list_and_bold

Lichang-Chen/llama3_it_dpo_attack_list_2epoch

Lichang-Chen/llama3_it_dpo_attack_bold_2epoch

Lichang-Chen/dpo_it_unbiased_ver3

Lichang-Chen/list_training_pairs

Lichang-Chen/bold_training_pairs

Lichang Chen

AI & ML interests

Organizations

Collections 1

Papers 13

spaces 2 Sort: Recently updated

Reward Decomposition

DEFT

models 64 Sort: Recently updated

datasets 18 Sort: Recently updated

spaces 2

models 64

datasets 18