7 19

Wenxuan Luo

MateoAdams2

AI & ML interests

Research on LLM agents and evaluation. Mostly focused on experiments.

Recent Activity

liked a model 3 days ago

BeadiestStar64/yata_jupyter

liked a model 4 days ago

Qwen/Qwen2.5-7B-Instruct

upvoted a paper 5 days ago

Recovering Policy-Induced Errors: Benchmarking and Trajectory Synthesis for Robust GUI Agents

View all activity

Organizations

None yet

liked a model 3 days ago

BeadiestStar64/yata_jupyter

Updated 3 days ago • 1

liked a model 4 days ago

Qwen/Qwen2.5-7B-Instruct

Text Generation • 8B • Updated Jan 12, 2025 • 12.5M • • 1.33k

upvoted a paper 5 days ago

Recovering Policy-Induced Errors: Benchmarking and Trajectory Synthesis for Robust GUI Agents

Paper • 2605.29447 • Published 10 days ago • 20

liked a dataset 5 days ago

hyunnluna/nova5-dataset

Updated 5 days ago • 35 • 1

liked a Space 7 days ago

ProtectBirds

🏃

346

Protect Birds

liked a dataset 7 days ago

Data-Gouv-FR/caracteristiques-et-localisation-des-stations-de-recharge-supercharger-tesla

Viewer • Updated 7 days ago • 117 • 55 • 1

liked a dataset 10 days ago

HuggingFaceM4/the_cauldron

Viewer • Updated May 6, 2024 • 1.88M • 692k • 545

liked a dataset 16 days ago

QingyiSi/Alpaca-CoT

Preview • Updated Sep 14, 2023 • 11.4k • 771

liked a dataset 17 days ago

quintelamanuel/political-leaderboard-results

Viewer • Updated 2 days ago • 1 • 2.61k • 2

liked 2 models 20 days ago

openbmb/MiniCPM-V-4.6

Image-Text-to-Text • 1B • Updated 3 days ago • 596k • 1.1k

intfloat/multilingual-e5-small

liked a dataset 24 days ago

hariimoto/gt

Updated 11 days ago • 6.94k • 2

liked a model 27 days ago

newtalent001/co-h3

Updated 27 days ago • 1

upvoted a paper about 1 month ago

How Fast Should a Model Commit to Supervision? Training Reasoning Models on the Tsallis Loss Continuum

Paper • 2604.25907 • Published Apr 28 • 4

liked a model about 1 month ago

amazon/chronos-2

Time Series Forecasting • 0.1B • Updated 2 days ago • 13.5M • 311

liked a model about 2 months ago

inclusionAI/LLaDA2.0-Uni

Any-to-Any • 16B • Updated 11 days ago • 7.47k • 247

liked a dataset about 2 months ago

rkmr07/tourism-prediction

Preview • Updated Apr 12 • 1 • 1

upvoted 3 papers about 2 months ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 327

Graph-Based Chain-of-Thought Pruning for Reducing Redundant Reflections in Reasoning LLMs

Paper • 2604.05643 • Published Apr 7 • 13

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 632

Wenxuan Luo

AI & ML interests

Recent Activity

Organizations

MateoAdams2's activity

ProtectBirds