8 9 2

Baohao Liao

baohao

https://baohaoliao.github.io/

AI & ML interests

NLP

Recent Activity

upvoted a paper 11 days ago

3-in-1: 2D Rotary Adaptation for Efficient Finetuning, Efficient Batching and Composability

updated a dataset 21 days ago

baohao/Fineweb-Edu-1BT-len2048

published a dataset 21 days ago

baohao/Fineweb-Edu-1BT-len2048

View all activity

Organizations

upvoted a paper 11 days ago

3-in-1: 2D Rotary Adaptation for Efficient Finetuning, Efficient Batching and Composability

Paper • 2409.00119 • Published Aug 28, 2024 • 1

upvoted a collection 2 months ago

Reinforce-Ada

Collection

Training & test sets and finetuned models • 19 items • Updated Oct 26 • 3

upvoted a paper 2 months ago

Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training

Paper • 2510.04996 • Published Oct 6 • 15

upvoted an article 2 months ago

Article

Gaia2 and ARE: Empowering the community to study agents

Sep 22

•

120

upvoted a paper 5 months ago

Lost at the Beginning of Reasoning

Paper • 2506.22058 • Published Jun 27 • 1

upvoted 2 papers 7 months ago

Fractured Chain-of-Thought Reasoning

Paper • 2505.12992 • Published May 19 • 23

Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target Self-Distillation

Paper • 2505.06027 • Published May 9 • 18

upvoted an article 9 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Feb 11

•

upvoted a paper 10 months ago

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Paper • 2501.19324 • Published Jan 31 • 39