Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Lukas Galke Poech's picture
11 23

Lukas Galke Poech

lgalke
EvilScript's profile picture namazifard's profile picture
·
https://lgalke.github.io
  • LukasGalke
  • lgalke
  • lukas-galke-8086b0155
  • lukasgalke.bsky.social

AI & ML interests

LLM interpretability, agentic/multi-agent safety

Recent Activity

authored a paper about 12 hours ago
The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment
upvoted a paper 1 day ago
The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment
authored a paper 6 days ago
BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and Upcycling
View all activity

Organizations

Danish Foundation Models's profile picture MLX Community's profile picture filter with espresso's profile picture RUNE Lab's profile picture Schneider-Kamp Lab's profile picture Machine Ecology Lab's profile picture Inversion Lab for AI Safety's profile picture AI Safety & Interpretability Lab's profile picture

Papers 20

arxiv:2606.10747
arxiv:2606.09707
arxiv:2606.06286
arxiv:2605.31170
View 20 papers

models 1

lgalke/Qwen3.5-35B-A3B-psysafe

Image-Text-to-Text • 36B • Updated Mar 25 • 28

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs