Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Yandex Research

company
https://research.yandex.com/
YandexResearch
https://github.com/yandex-research
Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

quickjkee  updated a model about 9 hours ago
yresearch/register-guidance
quickjkee  published a model about 10 hours ago
yresearch/register-guidance
fzmushko  submitted a paper 3 days ago
One-Step Gradient Delay is Not a Barrier for Large-Scale Asynchronous Pipeline Parallel LLM Pretraining
View all activity

Papers

One-Step Gradient Delay is Not a Barrier for Large-Scale Asynchronous Pipeline Parallel LLM Pretraining

View all Papers

Dmitry Baranchuk's profile picturenikita's profile pictureDenis Kuznedelev's profile pictureIvan Rubachev's profile pictureGleb Bazhenov's profile pictureValerii's profile pictureSergey Kastryulin's profile pictureIlya Drobyshevskiy's profile picture
yresearch 's papers 1
Submitted by
Zmushko Philip
23

One-Step Gradient Delay is Not a Barrier for Large-Scale Asynchronous Pipeline Parallel LLM Pretraining

yresearch Yandex Research
2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs