RLVF pipeline using parser oracles to align LMs for Icelandic and Danish. GPT-SW3 and Viking-13B trained with Delta-DPO.
Fakhar
Hodfa71
AI & ML interests
None yet
Recent Activity
updated a dataset about 4 hours ago
omniagentbench/OmniAgentBench updated a model about 6 hours ago
Hodfa71/saga-is-356m-kl-sft published a model about 6 hours ago
Hodfa71/saga-is-356m-kl-sft