6 20 98

Yohan Lee

l-yohai

AI & ML interests

Large Language Models

Recent Activity

liked a model about 9 hours ago

skt/A.X-K1

liked a model 15 days ago

Qwen/Qwen-Image-Layered

liked a dataset 21 days ago

nvidia/Nemotron-Pretraining-Specialized-v1

View all activity

Organizations

upvoted a paper 2 months ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30, 2025 • 119

upvoted 2 papers 3 months ago

τ^2-Bench: Evaluating Conversational Agents in a Dual-Control Environment

Paper • 2506.07982 • Published Jun 9, 2025 • 7

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

Paper • 2510.05684 • Published Oct 7, 2025 • 141

upvoted an article 3 months ago

Article

Introducing RTEB: A New Standard for Retrieval Evaluation

Oct 1, 2025

•

133

upvoted a paper 8 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 321

upvoted 5 papers 11 months ago

Towards Fully-Automated Materials Discovery via Large-Scale Synthesis Dataset and Expert-Level LLM-as-a-Judge

Paper • 2502.16457 • Published Feb 23, 2025 • 11

SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?

Paper • 2502.12115 • Published Feb 17, 2025 • 46

upvoted an article 11 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4, 2025

•

1.31k

upvoted an article over 1 year ago

Article

How OpenGPT 4o works

Jul 17, 2024

•

upvoted a paper over 1 year ago

Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models

Paper • 2404.02575 • Published Apr 3, 2024 • 50

upvoted 3 papers about 2 years ago

Exponentially Faster Language Modelling

Paper • 2311.10770 • Published Nov 15, 2023 • 119

Orca 2: Teaching Small Language Models How to Reason

Paper • 2311.11045 • Published Nov 18, 2023 • 77

BitNet: Scaling 1-bit Transformers for Large Language Models

Paper • 2310.11453 • Published Oct 17, 2023 • 105

upvoted 4 papers over 2 years ago

Enable Language Models to Implicitly Learn Self-Improvement From Data

Paper • 2310.00898 • Published Oct 2, 2023 • 23

Large Language Models Cannot Self-Correct Reasoning Yet

Paper • 2310.01798 • Published Oct 3, 2023 • 36

Teach LLMs to Personalize -- An Approach inspired by Writing Education

Paper • 2308.07968 • Published Aug 15, 2023 • 26

PolyLM: An Open Source Polyglot Large Language Model

Paper • 2307.06018 • Published Jul 12, 2023 • 25

Yohan Lee

AI & ML interests

Recent Activity

Organizations

l-yohai's activity

Introducing RTEB: A New Standard for Retrieval Evaluation

Open-source DeepResearch – Freeing our search agents

How OpenGPT 4o works