Infinity-Parser Collection Reinforcement Learning Document Parser and High-Quality Synthetic Dataset. • 4 items • Updated Oct 27
Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning Paper • 2505.24850 • Published May 30 • 8
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning Paper • 2504.08837 • Published Apr 10 • 43
COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values Paper • 2504.05535 • Published Apr 7 • 44
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper • 2411.04905 • Published Nov 7, 2024 • 127
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper • 2411.04905 • Published Nov 7, 2024 • 127
CostFormer:Cost Transformer for Cost Aggregation in Multi-view Stereo Paper • 2305.10320 • Published May 17, 2023 • 1