Itay Itzhak's picture

Itay Itzhak

itay1itzhak

·

https://itay1itzhak.github.io/

itay1itzhak

AI & ML interests

NLP & Deep learning

Recent Activity

authored a paper about 22 hours ago

ManagerBench: Evaluating the Safety-Pragmatism Trade-off in Autonomous LLMs

authored a paper about 22 hours ago

Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures

authored a paper about 22 hours ago

Models In a Spelling Bee: Language Models Implicitly Learn the Character Composition of Tokens

View all activity

Organizations

authored 3 papers about 22 hours ago

ManagerBench: Evaluating the Safety-Pragmatism Trade-off in Autonomous LLMs

Paper • 2510.00857 • Published Oct 1, 2025 • 1

Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures

Paper • 2510.24081 • Published Oct 28, 2025 • 22

Models In a Spelling Bee: Language Models Implicitly Learn the Character Composition of Tokens

Paper • 2108.11193 • Published Jun 8, 2022

upvoted a paper 29 days ago

Alignment Makes Language Models Normative, Not Descriptive

Paper • 2603.17218 • Published about 1 month ago • 46

upvoted 2 papers about 1 month ago

Motivation in Large Language Models

Paper • 2603.14347 • Published Mar 15 • 17

Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs

Paper • 2603.09906 • Published Mar 10 • 75

upvoted 2 papers about 2 months ago

Empty Shelves or Lost Keys? Recall Is the Bottleneck for Parametric Factuality

Paper • 2602.14080 • Published Feb 15 • 21

STATe-of-Thoughts: Structured Action Templates for Tree-of-Thoughts

Paper • 2602.14265 • Published Feb 15 • 21

upvoted a paper 3 months ago

The Poisoned Apple Effect: Strategic Manipulation of Mediated Markets via Technology Expansion of AI Agents

Paper • 2601.11496 • Published Jan 16 • 47

upvoted a paper 6 months ago

DeLeaker: Dynamic Inference-Time Reweighting For Semantic Leakage Mitigation in Text-to-Image Models

Paper • 2510.15015 • Published Oct 16, 2025 • 11

liked a dataset 6 months ago

AdiSimhi/ManagerBench

Viewer • Updated Dec 9, 2025 • 1.3k • 36 • 11

upvoted a paper 7 months ago

Fine-Grained Detection of Context-Grounded Hallucinations Using LLMs

Paper • 2509.22582 • Published Sep 26, 2025 • 12

upvoted 2 papers 8 months ago

CRISP: Persistent Concept Unlearning via Sparse Autoencoders

Paper • 2508.13650 • Published Aug 19, 2025 • 16

Planted in Pretraining, Swayed by Finetuning: A Case Study on the Origins of Cognitive Biases in LLMs

Paper • 2507.07186 • Published Jul 9, 2025 • 3

liked a Space 9 months ago

MIB Leaderboard

Leaderboard for the Mechanistic Interpretability Benchmark

New activity in itay1itzhak/OLMo-Tulu-Seed-1 9 months ago

Update license and add project page link

#1 opened 9 months ago by

New activity in itay1itzhak/OLMo-Tulu-Seed-2 9 months ago

Add project page link to model card

#1 opened 9 months ago by

New activity in itay1itzhak/T5-Tulu-Seed-1 9 months ago

Improve model card: Update license & pipeline tag, add project page

#1 opened 9 months ago by

New activity in itay1itzhak/OLMo-Flan-Seed-1 9 months ago

Add link to project page

#1 opened 9 months ago by

New activity in itay1itzhak/T5-Tulu-Seed-0 9 months ago

Update model card: Refine pipeline tag, license, and add project page

#1 opened 9 months ago by