25 13

吴晨

dibrimatter14

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

aixk/vlite3.6-nano-60M

upvoted a paper 3 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

upvoted a paper 4 days ago

IndusAgent: Reinforcing Open-Vocabulary Industrial Anomaly Detection with Agentic Tools

View all activity

Organizations

None yet

liked a model 2 days ago

aixk/vlite3.6-nano-60M

Updated about 21 hours ago • 1

upvoted a paper 3 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 13 days ago • 191

upvoted a paper 4 days ago

IndusAgent: Reinforcing Open-Vocabulary Industrial Anomaly Detection with Agentic Tools

Paper • 2605.20682 • Published 5 days ago • 81

liked a model 7 days ago

mistralai/Mistral-7B-Instruct-v0.3

7B • Updated Dec 3, 2025 • 4.36M • 2.59k

liked a model 11 days ago

fpadovani/eng_100mb_baseline

Text Generation • 0.1B • Updated 10 days ago • 251 • 1

liked a dataset 14 days ago

Tony15246/OPENUAV_DATASET

Preview • Updated about 15 hours ago • 547 • 1

upvoted a paper 17 days ago

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Paper • 2605.06130 • Published 18 days ago • 110

upvoted a paper 24 days ago

BARRED: Synthetic Training of Custom Policy Guardrails via Asymmetric Debate

Paper • 2604.25203 • Published 27 days ago • 8

upvoted 2 papers about 1 month ago

Micro Language Models Enable Instant Responses

Paper • 2604.19642 • Published Apr 21 • 3

DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off

Paper • 2604.13902 • Published Apr 15 • 62

liked 2 models about 1 month ago

WarriorMama777/OrangeMixs

Text-to-Image • Updated Jan 7, 2024 • 2.59k • 3.91k

autogluon/chronos-2

Time Series Forecasting • 0.1B • Updated Nov 24, 2025 • 13.1M • 23

upvoted 3 papers about 1 month ago

liked a dataset about 2 months ago

dhruvbansalup/dlgenai-nppe-dataset

Viewer • Updated Apr 9 • 58.2k • 231 • 1

upvoted 2 papers about 2 months ago

PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

Paper • 2603.25730 • Published Mar 26 • 53

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 342

liked a dataset about 2 months ago

AquaV/genshin-voices-separated

Updated Jul 6, 2024 • 103k • 18

upvoted a paper about 2 months ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 351

吴晨

AI & ML interests

Recent Activity

Organizations

dibrimatter14's activity