Safety classifiers fine-tuned on a bilingual dataset composed of the English QA pairs from BeaverTails and the Italian QA pairs from BeaverTails-IT.
Giuseppe Magazzù
saiteki-kai
AI & ML interests
My research focuses on the developement of safety mitigation strategies and benchmarks for large language models.
Recent Activity
liked
a dataset 2 days ago
walledai/XSTest liked
a model 2 days ago
swiss-ai/Apertus-8B-Instruct-2509 liked
a dataset 5 days ago
cais/mmlu