DavidDeng

ZiHDeng

AI & ML interests

None yet

Recent Activity

upvoted an article 3 days ago

We Got Claude to Fine-Tune an Open Source LLM

upvoted a paper 5 days ago

PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence

liked a dataset 7 months ago

jylins/videoxum

View all activity

Organizations

None yet

upvoted an article 3 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

24 days ago

•

535

upvoted a paper 5 days ago

PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence

Paper • 2512.16793 • Published 9 days ago • 71

liked a dataset 7 months ago

jylins/videoxum

Viewer • Updated Apr 22, 2024 • 14k • 212 • 14

upvoted an article 8 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7

•

263

liked a model 10 months ago

jingyaogong/MiniMind2

0.1B • Updated 15 days ago • 868 • 80

upvoted a paper 10 months ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 123

updated 3 models almost 2 years ago

updated a dataset almost 2 years ago

ZiHDeng/hf-ny8-v6

Viewer • Updated Feb 7, 2024 • 6.21k • 16

updated a model almost 2 years ago

ZiHDeng/peft-lora-starcoder1B-Instruction-ny8-0204

Updated Feb 4, 2024 • 15

updated a dataset almost 2 years ago

ZiHDeng/hf-ny8-v5

Viewer • Updated Feb 4, 2024 • 1.66k • 24

updated a model almost 2 years ago

ZiHDeng/peft-lora-starcoder1B-Instruction-ny8-0202

Updated Feb 2, 2024 • 17

updated a dataset almost 2 years ago

ZiHDeng/hf-ny8-v4

Viewer • Updated Feb 2, 2024 • 1.66k • 27

updated a model almost 2 years ago

ZiHDeng/peft-lora-starcoder1B-Instruction-ny8-MIX-2000

Updated Jan 30, 2024 • 8

updated a dataset almost 2 years ago

ZiHDeng/hf-ny8-v3

Viewer • Updated Jan 30, 2024 • 8.87k • 40

updated 3 models almost 2 years ago

ZiHDeng/peft-lora-starcoder1B-Instruction-ny8-MIX

Updated Jan 30, 2024 • 23

ZiHDeng/peft-lora-starcoder1B-Instruction-ny8-FIM

Updated Jan 29, 2024 • 6

ZiHDeng/peft-lora-starcoder1B-Instruction-ny8

Updated Jan 28, 2024 • 19

updated a dataset almost 2 years ago

ZiHDeng/hf-ny8-v1

Viewer • Updated Jan 26, 2024 • 7.66k • 25

DavidDeng

AI & ML interests

Recent Activity

Organizations

ZiHDeng's activity

We Got Claude to Fine-Tune an Open Source LLM

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge