Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
GOVINDFROM
/
MindGamesCodeNames
like
0
Reinforcement Learning
Safetensors
game-theory
codenames
neurips-2025
graph-neural-networks
preference-learning
llm-distillation
License:
mit
Model card
Files
Files and versions
xet
Community
1
main
MindGamesCodeNames
Commit History
Update README.md
5c80055
verified
GOVINDFROM
commited on
15 days ago
Update README.md
e6db4f9
verified
GOVINDFROM
commited on
16 days ago
Upload model card
2890d84
verified
GOVINDFROM
commited on
16 days ago
Upload battleground_eval.json
e91ffab
verified
GOVINDFROM
commited on
16 days ago
Upload master_config.json
1f81885
verified
GOVINDFROM
commited on
16 days ago
Upload SFT model
43b7674
verified
GOVINDFROM
commited on
16 days ago
Upload policy_after_ppo.pt
f0ef1c3
verified
GOVINDFROM
commited on
16 days ago
Upload policy_after_distill.pt
cd470a3
verified
GOVINDFROM
commited on
16 days ago
Upload policy_final.pt
edb9110
verified
GOVINDFROM
commited on
16 days ago
initial commit
12f043b
verified
GOVINDFROM
commited on
16 days ago