arxiv:2509.18058
Evgenii Kortukov
kortukov
AI & ML interests
LLM interpretability, AI safety
Recent Activity
updated a dataset about 8 hours ago
honeypot-redteam/strategic_lies published a dataset about 13 hours ago
honeypot-redteam/strategic_lies updated a dataset 2 days ago
honeypot-redteam/strategic_lies