RL RAG

Team

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

akariasai authored a paper 11 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

JingmingZ authored a paper 11 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

shannons authored a paper 11 days ago

SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature

View all activity

akariasai

authored a paper 11 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published 12 days ago • 54

JingmingZ

authored a paper 11 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published 12 days ago • 54

shannons

authored 2 papers 11 days ago

SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature

Paper • 2406.07835 • Published Jun 10, 2024 • 1

SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models

Paper • 2510.09541 • Published Oct 10 • 14

hamishivi

authored 2 papers 11 days ago

RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Paper • 2511.07317 • Published 26 days ago • 13

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published 12 days ago • 54

shannons

authored a paper 11 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published 12 days ago • 54

rulins

updated a dataset about 1 month ago

rl-rag/1_sample_toy_rag_survey

Viewer • Updated Oct 24 • 8 • 34

rulins

published a dataset about 1 month ago

rl-rag/1_sample_toy_rag_survey

Viewer • Updated Oct 24 • 8 • 34

rulins

updated a dataset about 2 months ago

rl-rag/1_sample_toy

Viewer • Updated Oct 22 • 30 • 26

rulins

published a dataset about 2 months ago

rl-rag/1_sample_toy

Viewer • Updated Oct 22 • 30 • 26

rulins

updated a model about 2 months ago

rl-rag/rar_cb_bs_16_rollout_811759453746_checkpoints_step_100

333k • Updated Oct 11 • 2

rulins

published a model about 2 months ago

rl-rag/rar_cb_bs_16_rollout_811759453746_checkpoints_step_100

333k • Updated Oct 11 • 2

rulins

updated a dataset 2 months ago

rl-rag/rl-rag-RaR-Medicine-3k-o3-mini-converted

Viewer • Updated Oct 6 • 3k • 16

rulins

published a dataset 2 months ago

rl-rag/rl-rag-RaR-Medicine-3k-o3-mini-converted

Viewer • Updated Oct 6 • 3k • 16

rulins

updated a model 2 months ago

rl-rag/qwen3-8B-sft-mix-v20250921-plus-v20251001-onpolicy-rs-longform_0921

Text Generation • 8B • Updated Oct 6 • 146

rulins

published a model 2 months ago

rl-rag/qwen3-8B-sft-mix-v20250921-plus-v20251001-onpolicy-rs-longform_0921

Text Generation • 8B • Updated Oct 6 • 146

akariasai

updated a dataset 2 months ago

rl-rag/dpo_lf_sft0921_rubric_citation

Viewer • Updated Oct 3 • 1.32k • 18

akariasai

published a dataset 2 months ago

rl-rag/dpo_lf_sft0921_rubric_citation

Viewer • Updated Oct 3 • 1.32k • 18

akariasai

updated a dataset 2 months ago

rl-rag/sft_rejection_sampled_on_policy_long-_form_sft_0921

Viewer • Updated Oct 3 • 2.22k • 21

AI & ML interests

Recent Activity

Team members 7

rl-rag's activity