Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Andre3000 's Collections
Research
Pre training
agents

Pre training

updated 6 days ago
Upvote
-

  • Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling

    Paper • 2401.16380 • Published Jan 29, 2024 • 51

  • OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

    Paper • 2602.05400 • Published 13 days ago • 314

  • The Pile: An 800GB Dataset of Diverse Text for Language Modeling

    Paper • 2101.00027 • Published Dec 31, 2020 • 9
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs