Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Salman Rahman's picture
4 22

Salman Rahman PRO

salmannyu
Tasninmitu's profile picture
·
https://salmanrahman.net/

AI & ML interests

Natural Language Processing, Deep Learning, Scalable Oversight, and Language Model Evaluation

Recent Activity

upvoted a collection 8 days ago
rlvr-weak-supervision
upvoted a paper 15 days ago
When Can LLMs Learn to Reason with Weak Supervision?
submitted a paper 15 days ago
When Can LLMs Learn to Reason with Weak Supervision?
View all activity

Organizations

Misinformation, AI & Responsible Society (MARS) Lab's profile picture New York University's profile picture ChessLLM-NYU's profile picture Pavel's Lab's profile picture

Papers 4

arxiv:2504.13203
arxiv:2504.07830
arxiv:2402.10965
arxiv:2401.14539

spaces 1

pinned
Sleeping

Argilla Space

✍

Oct 30, 2024

models 23

salmannyu/llama_base_thinking_sft_noisy_reward_0_9

Updated 20 days ago

salmannyu/llama_base_thinking_sft_majority_vote_math_1024_sample_8k

Updated 24 days ago

salmannyu/mid_train_llama_52b_thinking_data_effect_math_8_sample

Updated Mar 30

salmannyu/mid_train_llama_52b_thinking_noisy_reward_math_0.7_sample

Updated Mar 30

salmannyu/mid_train_llama_52b_thinking_noisy_reward_math_0.9_sample

Updated Mar 30

salmannyu/mid_train_llama_52b_thinking_majority_vote_math_1024_sample

Updated Mar 30

salmannyu/mid_train_llama_52b_thinking_data_effect_math_2048_sample

Updated Mar 30

salmannyu/data_effect_scp_do_llama_3b_2048_sample

Updated Mar 30

salmannyu/data_effect_scp_do_llama_3b_8_sample

Updated Mar 30

salmannyu/data_effect_math_do_llama_3b_8_sample

Updated Mar 30
View 23 models

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs