3 16

neil yu

yxl66666

AI & ML interests

None yet

Recent Activity

published a model 3 days ago

yxl66666/VisMem

upvoted a paper 7 days ago

OmniVideo-R1: Reinforcing Audio-visual Reasoning with Query Intention and Modality Attention

upvoted a paper 7 days ago

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

View all activity

Organizations

upvoted 2 papers 7 days ago

OmniVideo-R1: Reinforcing Audio-visual Reasoning with Query Intention and Modality Attention

Paper • 2602.05847 • Published 12 days ago • 12

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Paper • 2602.07026 • Published 15 days ago • 133

upvoted a paper 10 days ago

MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents

Paper • 2602.02474 • Published 15 days ago • 54

upvoted 2 papers 22 days ago

HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding

Paper • 2601.14724 • Published 27 days ago • 74

EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Paper • 2601.15876 • Published 26 days ago • 90

upvoted a paper about 2 months ago

MemEvolve: Meta-Evolution of Agent Memory Systems

Paper • 2512.18746 • Published Dec 21, 2025 • 31

upvoted 2 papers 2 months ago

Memory in the Age of AI Agents

Paper • 2512.13564 • Published Dec 15, 2025 • 151

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 155

upvoted 2 papers 3 months ago

VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models

Paper • 2511.11007 • Published Nov 14, 2025 • 15

Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks

Paper • 2511.15065 • Published Nov 19, 2025 • 77

upvoted 2 papers 4 months ago

Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views

Paper • 2510.18632 • Published Oct 21, 2025 • 22

Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow

Paper • 2509.21789 • Published Sep 26, 2025 • 9

upvoted a collection 4 months ago

Qwen3-VL

Collection

37 items • Updated Dec 31, 2025 • 628

upvoted a paper 4 months ago

From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning

Paper • 2509.23768 • Published Sep 28, 2025 • 49

upvoted a paper 5 months ago

DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing

Paper • 2510.02253 • Published Oct 2, 2025 • 15

upvoted a paper 8 months ago

CRISP-SAM2: SAM2 with Cross-Modal Interaction and Semantic Prompting for Multi-Organ Segmentation

Paper • 2506.23121 • Published Jun 29, 2025 • 2

neil yu

AI & ML interests

Recent Activity

Organizations

yxl66666's activity