BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks Paper • 2312.02405 • Published Dec 5, 2023 • 1
Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition Paper • 2303.13512 • Published Mar 23, 2023
GPT Deciphering Fedspeak: Quantifying Dissent Among Hawks and Doves Paper • 2407.19110 • Published Jul 26, 2024 • 1
The Attacker Moves Second: Stronger Adaptive Attacks Bypass Defenses Against Llm Jailbreaks and Prompt Injections Paper • 2510.09023 • Published Oct 10 • 9
The Prompt Report: A Systematic Survey of Prompting Techniques Paper • 2406.06608 • Published Jun 6, 2024 • 68
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking Competition Paper • 2311.16119 • Published Oct 24, 2023 • 2