Atsuki Yamaguchi's picture

Atsuki Yamaguchi

atsuki-yamaguchi

·

https://gucci-j.github.io/about/

AI & ML interests

Natural Language Processing

Recent Activity

published a model about 17 hours ago

ssu-project/OLMo-2-1124-13B-Instruct-ig-fft

updated a model about 17 hours ago

ssu-project/OLMo-2-1124-13B-Instruct-ig-fft

published a model about 17 hours ago

ssu-project/OLMo-2-1124-13B-Instruct-ig-gmt

View all activity

Organizations

upvoted a paper 1 day ago

Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates

Paper • 2512.04844 • Published 2 days ago • 2

upvoted a paper about 2 months ago

Deconstructing Attention: Investigating Design Principles for Effective Language Modeling

Paper • 2510.11602 • Published Oct 13 • 14

upvoted 3 papers 3 months ago

IntrEx: A Dataset for Modeling Engagement in Educational Conversations

Paper • 2509.06652 • Published Sep 8 • 24

Marco-Bench-MIF: On Multilingual Instruction-Following Capability of Large Language Models

Paper • 2507.11882 • Published Jul 16 • 1

Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation?

Paper • 2508.19827 • Published Aug 27 • 33

upvoted a paper 4 months ago

A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

Paper • 2508.07407 • Published Aug 10 • 98

upvoted a collection about 1 year ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Jul 21 • 666