arxiv:2509.18058
Evgenii Kortukov
kortukov
AI & ML interests
LLM interpretability, AI safety
Recent Activity
published a dataset 5 minutes ago
honeypot-redteam/strategic_lies updated a dataset 2 days ago
honeypot-redteam/strategic_lies published a model 4 months ago
ISTA-MLCV/Qwen2.5-7B_ise