arxiv:2505.19731
Daniil Tiapkin
dtiapkin
AI & ML interests
Reinforcement learning enjoyer
Recent Activity
updated
a dataset about 2 months ago
dtiapkin/prompt-collection-rlhflow published
a dataset about 2 months ago
dtiapkin/prompt-collection-rlhflow upvoted a paper 3 months ago
T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground Organizations
None yet