Xuankun Rong's picture

2 15 2

Xuankun Rong

XuankunRong

·

https://xuankunrong.github.io/

XuankunRong

AI & ML interests

AI Safety

Recent Activity

upvoted a paper 9 days ago

Latent Collaboration in Multi-Agent Systems

authored a paper 15 days ago

SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization

upvoted a paper 18 days ago

SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization

View all activity

Organizations

None yet

upvoted a paper 9 days ago

Latent Collaboration in Multi-Agent Systems

Paper • 2511.20639 • Published 11 days ago • 110

upvoted a paper 18 days ago

SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization

Paper • 2511.12982 • Published 19 days ago • 3

upvoted a paper 2 months ago

MAPO: Mixed Advantage Policy Optimization

Paper • 2509.18849 • Published Sep 23 • 26

upvoted a paper 3 months ago

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19 • 118

upvoted 2 papers 7 months ago

Backdoor Cleaning without External Guidance in MLLM Fine-tuning

Paper • 2505.16916 • Published May 22 • 17

GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning

Paper • 2505.11049 • Published May 16 • 60

upvoted 3 papers 8 months ago

FlowReasoner: Reinforcing Query-Level Meta-Agents

Paper • 2504.15257 • Published Apr 21 • 47

ShortV: Efficient Multimodal Large Language Models by Freezing Visual Tokens in Ineffective Layers

Paper • 2504.00502 • Published Apr 1 • 25

Efficient Inference for Large Reasoning Models: A Survey

Paper • 2503.23077 • Published Mar 29 • 46

upvoted 2 papers 9 months ago

Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation

Paper • 2503.19622 • Published Mar 25 • 31

A Survey of Safety on Large Vision-Language Models: Attacks, Defenses and Evaluations

Paper • 2502.14881 • Published Feb 14 • 2

upvoted 3 papers 10 months ago

VLM^2-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues

Paper • 2502.12084 • Published Feb 17 • 32

Logical Reasoning in Large Language Models: A Survey

Paper • 2502.09100 • Published Feb 13 • 24

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

Paper • 2502.09621 • Published Feb 13 • 28

upvoted a paper about 1 year ago

Personalized Visual Instruction Tuning

Paper • 2410.07113 • Published Oct 9, 2024 • 70