Filippo Tonini
filo362
AI & ML interests
LLM safety in multi-agent environments
Recent Activity
upvoted a paper about 10 hours ago
PsychoSafe: Eliciting Psychologically-Informed Refusals in Large Language Models upvoted a paper about 10 hours ago
BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and UpcyclingOrganizations
None yet