GRPO/PPO Finetunes for Creative Writing
DV
AI & ML interests
Post training @ https://dphn.ai
Recent Activity
updated a model about 16 hours ago
Delta-Vector/distill-m-6a3lnzvb-code updated a dataset 1 day ago
NewEden/RL-seed-Decensor published a dataset 1 day ago
NewEden/RL-seed-DecensorOrganizations
Austral
Got bored - Did a weird tune on harbinger and now there's 10K of these, Models meant for Adventure/RP, Creative and smartz.
-
Delta-Vector/Austral-70B-Winton
Text Generation • 71B • Updated • 11 • • 6 -
Delta-Vector/Austral-32B-GLM4-Winton
Text Generation • 33B • Updated • 16 • 8 -
Delta-Vector/MS3.2-Austral-Winton
Text Generation • 24B • Updated • 18 • 12 -
Delta-Vector/Austral-24B-Winton
Text Generation • 24B • Updated • 89 • 17
Nanuq-R1
GRPO/PPO Finetunes for Creative Writing
Austral
Got bored - Did a weird tune on harbinger and now there's 10K of these, Models meant for Adventure/RP, Creative and smartz.
-
Delta-Vector/Austral-70B-Winton
Text Generation • 71B • Updated • 11 • • 6 -
Delta-Vector/Austral-32B-GLM4-Winton
Text Generation • 33B • Updated • 16 • 8 -
Delta-Vector/MS3.2-Austral-Winton
Text Generation • 24B • Updated • 18 • 12 -
Delta-Vector/Austral-24B-Winton
Text Generation • 24B • Updated • 89 • 17
models 113
Delta-Vector/distill-m-6a3lnzvb-code
Updated
Delta-Vector/Qwen-ckpt-100
Text Generation • Updated • 304
Delta-Vector/Austral-AFM-SFT
5B • Updated • 23
Delta-Vector/Rei-24B-KTO
Text Generation • 24B • Updated • 23 • 17
Delta-Vector/Dr-House-Evals
Updated
Delta-Vector/Austral-4.5B-Winton
Text Generation • 5B • Updated • 11 • 11
Delta-Vector/Nanuq-R1-9B
Text Generation • 11B • Updated • 9 • 4
Delta-Vector/Nanuq-R1-14B
Text Generation • 14B • Updated • 13 • 3
Delta-Vector/Elenchus
545k • Updated • 5 • 2
Delta-Vector/Austral-32B-GLM4-Winton
Text Generation • 33B • Updated • 16 • 8
datasets 126
Delta-Vector/Tauri-RL-Plaintext-System
Viewer • Updated • 128 • 75
Delta-Vector/Tauri-RL-Markdown-System
Viewer • Updated • 128 • 76
Delta-Vector/Tauri-RL-Styles
Viewer • Updated • 128 • 215
Delta-Vector/Tauri-RL-Styles-V2
Viewer • Updated • 128 • 42
Delta-Vector/CAI-critic-revision-8k-cleaned-sharegpt
Viewer • Updated • 8.1k • 15
Delta-Vector/Ursa-Armored-Core-6-Lore
Viewer • Updated • 166 • 6
Delta-Vector/wordlist
Viewer • Updated • 253 • 10
Delta-Vector/Hydrus-Olmo-3-sft-dedup-ngram-filter-r1
Viewer • Updated • 1.67M • 14
Delta-Vector/Ursa-Armored-Core-Lore-Kimi
Viewer • Updated • 286 • 11
Delta-Vector/Hydrus-Hardcode-Dphn
Viewer • Updated • 220 • 8