JM's picture

5

JM

JMJM

·

AI & ML interests

None yet

Organizations

upvoted an article 2 months ago

Article

Phare LLM benchmark V2: Reasoning models don't guarantee better security

Dec 16, 2025

•

10

upvoted an article 5 months ago

Article

Giskard Bot: Identifying robustness, performance and ethical vulnerabilities in the Top 10 Most Popular Hugging Face Models

Mar 21, 2024

•

2

upvoted an article 7 months ago

Article

LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs

Jul 2, 2025

•

16

upvoted an article 10 months ago

Article

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

May 7, 2025

•

42

upvoted a paper 10 months ago

RealHarm: A Collection of Real-World Language Model Application Failures

Paper • 2504.10277 • Published Apr 14, 2025 • 10