1 71 5

james curry

ainbo

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Cosmos 3: Omnimodal World Models for Physical AI

upvoted a paper 11 days ago

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

upvoted a paper 11 days ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

View all activity

Organizations

upvoted a paper 4 days ago

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published 8 days ago • 101

upvoted 3 papers 11 days ago

upvoted a paper 12 days ago

LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence

Paper • 2605.25979 • Published 15 days ago • 27

upvoted a paper 14 days ago

Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models

Paper • 2605.21573 • Published 20 days ago • 109

upvoted a paper 23 days ago

Self-Distilled Agentic Reinforcement Learning

Paper • 2605.15155 • Published 26 days ago • 111

upvoted a paper 24 days ago

MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory

Paper • 2605.15128 • Published 26 days ago • 62

upvoted 2 papers 25 days ago

Qwen-Image-VAE-2.0 Technical Report

Paper • 2605.13565 • Published 27 days ago • 60

AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation

Paper • 2605.13724 • Published 27 days ago • 101

upvoted a paper 26 days ago

CausalCine: Real-Time Autoregressive Generation for Multi-Shot Video Narratives

Paper • 2605.12496 • Published 28 days ago • 29

upvoted a paper 27 days ago

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published 28 days ago • 191

upvoted a paper 29 days ago

HumanNet: Scaling Human-centric Video Learning to One Million Hours

Paper • 2605.06747 • Published May 7 • 52

upvoted 5 papers about 1 month ago

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published May 7 • 80

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Paper • 2604.28185 • Published Apr 30 • 90

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

Paper • 2604.26752 • Published Apr 29 • 108

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published Apr 27 • 118

ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents

Paper • 2604.23781 • Published Apr 26 • 33

upvoted a paper about 2 months ago

MultiWorld: Scalable Multi-Agent Multi-View Video World Models

Paper • 2604.18564 • Published Apr 20 • 46

upvoted an article about 2 months ago

Article

NEO-unify: Building Native Multimodal Unified Models End to End

sensenova

•

Mar 5

• 164

james curry

AI & ML interests

Recent Activity

Organizations

ainbo's activity

NEO-unify: Building Native Multimodal Unified Models End to End