·
AI & ML interests
None yet
Organizations
-
-
-
-
-
-
-
-
-
-
-
view article Phare LLM benchmark V2: Reasoning models don't guarantee better security
view article Giskard Bot: Identifying robustness, performance and ethical vulnerabilities in the Top 10 Most Popular Hugging Face Models
view article LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs
view article Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs
upvoted a paper 10 months ago