8 12 9

Zhiding Yu

Zhiding

https://research.nvidia.com/person/zhiding-yu

Chrisding

AI & ML interests

None yet

Recent Activity

updated a Space about 13 hours ago

nvidia/LocateAnything

liked a Space about 18 hours ago

akhaliq/LocateAnything

commentedon a paper 2 days ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

View all activity

Organizations

updated a Space about 13 hours ago

LocateAnything

💬

Detect and label objects in images and videos

liked a Space about 18 hours ago

LocateAnything

💬

Detect and annotate objects in images or videos

commented a paper 2 days ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published 5 days ago • 122 •

authored a paper 3 days ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published 5 days ago • 122

upvoted a paper 3 days ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published 5 days ago • 122

liked a model 4 days ago

nvidia/LocateAnything-3B

Image-Text-to-Text • 4B • Updated 3 days ago • 18.3k • 456

liked a Space 4 days ago

LocateAnything

💬

Detect and label objects in images and videos

updated a model 5 days ago

nvidia/LocateAnything-3B

Image-Text-to-Text • 4B • Updated 3 days ago • 18.3k • 456

updated a collection 2 months ago

Eagle

Collection

Eagle is a family of frontier vision-language models with data-centric strategies. The model supports both HD image and long-context video input. • 17 items • Updated about 20 hours ago • 45

updated a collection 3 months ago

Eagle

Collection

Eagle is a family of frontier vision-language models with data-centric strategies. The model supports both HD image and long-context video input. • 17 items • Updated about 20 hours ago • 45

authored a paper 4 months ago

PhyCritic: Multimodal Critic Models for Physical AI

Paper • 2602.11124 • Published Feb 11 • 55

upvoted a paper 4 months ago

PhyCritic: Multimodal Critic Models for Physical AI

Paper • 2602.11124 • Published Feb 11 • 55

updated a collection 4 months ago

Eagle

Collection

Eagle is a family of frontier vision-language models with data-centric strategies. The model supports both HD image and long-context video input. • 17 items • Updated about 20 hours ago • 45

upvoted a paper 5 months ago

Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

Paper • 2601.09708 • Published Jan 14 • 55

liked a model 5 months ago

nvidia/llama-nemotron-embed-vl-1b-v2

upvoted a paper 6 months ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 128

New activity in nvidia/Eagle2.5-8B 6 months ago

Fix KeyError Bug to support in SGLang

#13 opened 7 months ago by

jonahbernard

upvoted an article 7 months ago

Article

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub

nvidia

•

Jun 27, 2025

• 31

Zhiding Yu

AI & ML interests

Recent Activity

Organizations

Zhiding's activity

LocateAnything

LocateAnything

LocateAnything

Fix KeyError Bug to support in SGLang

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub