Jaeyoon Jung's picture

Jaeyoon Jung PRO

lastdefiance20

·

AI & ML interests

multimodal

Recent Activity

liked a model 9 days ago

Overworld/Waypoint-1-Medium

upvoted a paper 9 days ago

Causal World Modeling for Robot Control

upvoted a paper 9 days ago

DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos

View all activity

Organizations

upvoted 2 papers 9 days ago

Causal World Modeling for Robot Control

Paper • 2601.21998 • Published 20 days ago • 30

DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos

Paper • 2602.06949 • Published 12 days ago • 30

upvoted a paper 10 days ago

Judging What We Cannot Solve: A Consequence-Based Approach for Oracle-Free Evaluation of Research-Level Math

Paper • 2602.06291 • Published 13 days ago • 23

upvoted a collection 30 days ago

Physical AI

Collection of open, commercial-grade datasets for physical AI developers • 28 items • Updated 6 days ago • 123

upvoted a paper about 1 month ago

What Users Leave Unsaid: Under-Specified Queries Limit Vision-Language Models

Paper • 2601.06165 • Published Jan 7 • 16

upvoted 4 papers 4 months ago

KORMo: Korean Open Reasoning Model for Everyone

Paper • 2510.09426 • Published Oct 10, 2025 • 86

Exploring Fine-Tuning of Large Audio Language Models for Spoken Language Understanding under Limited Speech data

Paper • 2509.15389 • Published Sep 18, 2025 • 3

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

Paper • 2510.05684 • Published Oct 7, 2025 • 143

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 508

upvoted a paper 7 months ago

Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs

Paper • 2507.07990 • Published Jul 10, 2025 • 46

upvoted 3 papers 9 months ago

Don't Look Only Once: Towards Multimodal Interactive Reasoning with Selective Visual Revisitation

Paper • 2505.18842 • Published May 24, 2025 • 36

Let's Predict Sentence by Sentence

Paper • 2505.22202 • Published May 28, 2025 • 19

Visual Planning: Let's Think Only with Images

Paper • 2505.11409 • Published May 16, 2025 • 57

upvoted 2 papers 10 months ago

TesserAct: Learning 4D Embodied World Models

Paper • 2504.20995 • Published Apr 29, 2025 • 22

VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models

Paper • 2504.15279 • Published Apr 21, 2025 • 78

upvoted a collection 10 months ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.67k

upvoted 4 papers 11 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7, 2025 • 205

R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model

Paper • 2503.05132 • Published Mar 7, 2025 • 57

KOFFVQA: An Objectively Evaluated Free-form VQA Benchmark for Large Vision-Language Models in the Korean Language

Paper • 2503.23730 • Published Mar 31, 2025 • 3

Gemma 3 Technical Report

Paper • 2503.19786 • Published Mar 25, 2025 • 55