·
AI & ML interests
Safe A(G)I
Organizations
skrishna/smolm-toxicity-classifier
Text Classification
• 0.1B • Updated • 3
skrishna/sft-ref-policy-copy
Text Generation
• 0.1B • Updated Text Generation
• 0.1B • Updated skrishna/gpt2-toxicity-classifier
Updated
skrishna/gpt2-fineweb-soap-20250422_112211
Text Generation
• 0.1B • Updated skrishna/gpt2-fineweb-20250421_194111-64
Text Generation
• 0.1B • Updated • 1
skrishna/ethicsU-llama3-8b-w2s
Updated
skrishna/ethicsU-gptxl-weak2
Updated
skrishna/ethicsU-gptxl-weak
Updated
skrishna/gpt2-hellaswag-weak
Text Classification
• 0.1B • Updated • 1
skrishna/llama3-8b-hellaswag
Text Generation
• 8B • Updated • 4
skrishna/w2s_llama3-boolq
skrishna/finetuned_model_gpt2
Text Generation
• Updated skrishna/pythia-160m-toxicity-model
Text Classification
• 0.1B • Updated • 4
skrishna/pythia-410m-toxicity-model
Text Classification
• Updated • 1
skrishna/pythia-160m-toxic-model
Updated
skrishna/pythia-410mn-ntoxic
Text Classification
• Updated • 3
skrishna/finetuned_toxicity_410_model
Updated
skrishna/pythia-70m-toxicity-model
Text Classification
• Updated • 1
skrishna/pythia-160m-non-toxic
Text Generation
• Updated • 5
skrishna/pythia-160mn-ntoxic
Updated
skrishna/reward_model_irl
Updated
skrishna/pythia-70m-non-toxic
Text Generation
• Updated • 3
skrishna/pythia-410m-non-toxic
Text Generation
• Updated • 2
Text Generation
• Updated • 2
skrishna/eleuther-pythia6.9b-hh-dpo
Text Generation
• Updated • 2
skrishna/eleuther-pythia6.9b-hh-sft
Text Generation
• Updated • 3
skrishna/roberta-hate-speech-dynabench-r4-target
Text Generation
• Updated • 4