What will happen if we train a Q function for digital agents?
HAO BAI
JackBAI
AI & ML interests
Representation learning, language models.
Recent Activity
updated
a dataset
3 days ago
JackBAI/jack-latest-vllm-stack
published
a dataset
3 days ago
JackBAI/jack-latest-vllm-stack
authored
a paper
9 days ago
InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning