Nikita Kezins
entfane
AI & ML interests
LLM post-training, adversarial training, safety, knowledge transfer
Recent Activity
updated a dataset about 1 hour ago
entfane/violent_eval published a dataset about 1 hour ago
entfane/violent_eval updated a model about 9 hours ago
entfane/gpt2_constitutional_classifier_violence