Renat's picture

8 3

Renat

u-brixton

·

AI & ML interests

None yet

Organizations

upvoted 3 papers 10 months ago

Don't Make Your LLM an Evaluation Benchmark Cheater

Paper • 2311.01964 • Published Nov 3, 2023 • 2

KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding

Paper • 2503.02951 • Published Mar 4, 2025 • 33

Iterative Value Function Optimization for Guided Decoding

Paper • 2503.02368 • Published Mar 4, 2025 • 15

upvoted a collection about 1 year ago

Standard-format-preference-dataset

We collect the open-source datasets and process them into the standard format. • 14 items • Updated May 8, 2024 • 26

upvoted 3 collections almost 2 years ago

🔍 Interpretability & Analysis of LMs

Outstanding research in LM interpretability and evaluation, summarized • 135 items • Updated 18 days ago • 116

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 249

Zeroshot Classifiers

These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 12 items • Updated Jan 6, 2025 • 147

upvoted a collection about 2 years ago

Reward models on the hub

UNMAINTAINED: See RewardBench... A place to collect reward models, an often not released artifact of RLHF. • 18 items • Updated Apr 13, 2024 • 25