nabeelshan
/

rlhf-gpt2-pipeline

Text Generation

reinforcement-learning

instruction-tuning

Model card Files Files and versions

rlhf-gpt2-pipeline

Commit History

initial commit

07f58df
verified

nabeelshan commited on Sep 23