Alexander Reinthal's picture

Alexander Reinthal

reinthal

·

https://www.reinthal.me

reinthal

AI & ML interests

Technical AI safety Jailbreaking, CyberSecurity Red-teaming with Agents, AI Control

Recent Activity

updated a model 7 days ago

claude-warriors/qwen3-32b-reward-hacking-code-inoculated

published a model 7 days ago

claude-warriors/qwen3-32b-reward-hacking-code-inoculated

new activity 15 days ago

FutureLivingLab/iFlow-ROME:Request for clarificiation about safety incident, crypto mining, etc

View all activity

Organizations

reinthal 's datasets

None public yet