Sander Schulhoff's picture

6 3 3

Sander Schulhoff

Trigaten

·

https://trigaten.github.io

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks

authored a paper about 2 months ago

Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition

authored a paper about 2 months ago

GPT Deciphering Fedspeak: Quantifying Dissent Among Hawks and Doves

View all activity

Organizations

authored 4 papers about 2 months ago

BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks

Paper • 2312.02405 • Published Dec 5, 2023 • 1

Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition

Paper • 2303.13512 • Published Mar 23, 2023

GPT Deciphering Fedspeak: Quantifying Dissent Among Hawks and Doves

Paper • 2407.19110 • Published Jul 26, 2024 • 1

The Attacker Moves Second: Stronger Adaptive Attacks Bypass Defenses Against Llm Jailbreaks and Prompt Injections

Paper • 2510.09023 • Published Oct 10 • 9

authored a paper over 1 year ago

The Prompt Report: A Systematic Survey of Prompting Techniques

Paper • 2406.06608 • Published Jun 6, 2024 • 68

authored a paper almost 2 years ago

Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking Competition

Paper • 2311.16119 • Published Oct 24, 2023 • 2