Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
45
1
Steven Goldfeather
treehugg3
Follow
Jervas's profile picture
itsliupeng's profile picture
2 followers
ยท
12 following
AI & ML interests
Messing with LLM weights, LLM alignment techniques
Recent Activity
new
activity
3 days ago
jukofyork/creative-writing-control-vectors-v3.0:
โThe doom lies in yourself, not in your name.โ
new
activity
3 days ago
jukofyork/creative-writing-control-vectors-v3.0:
โThe doom lies in yourself, not in your name.โ
new
activity
2 months ago
TIGER-Lab/MMLU-Pro:
Benchmark results feature design issues
View all activity
Organizations
None yet
treehugg3
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
jukofyork/creative-writing-control-vectors-v3.0
3 days ago
โThe doom lies in yourself, not in your name.โ
๐
7
444
#15 opened 7 months ago by
jukofyork
New activity in
TIGER-Lab/MMLU-Pro
2 months ago
Benchmark results feature design issues
3
#42 opened 2 months ago by
treehugg3
New activity in
jukofyork/creative-writing-control-vectors-v3.0
8 months ago
Wur doomed!
566
#14 opened about 1 year ago by
jukofyork
New activity in
nvidia/Nemotron-CC-v2
8 months ago
Acess Request
โ
9
3
#3 opened 8 months ago by
muchanem
New activity in
Arki05/Grok-1-GGUF
8 months ago
``` ๐ฅฒ ะะต ัะดะฐะปะพัั ะทะฐะณััะทะธัั ะผะพะดะตะปั Failed to load model error loading model: missing tensor 'blk.0.ffn_down_exps.weight' ```
1
#18 opened 10 months ago by
MAGIC000
New activity in
ByteDance-Seed/Seed-OSS-36B-Base-woSyn
8 months ago
What was the underlying training data distribution?
๐
1
6
#2 opened 8 months ago by
treehugg3
New activity in
mradermacher/model_requests
8 months ago
ByteDance-Seed/Seed-OSS-36B-Instruct
10
#1303 opened 8 months ago by
Poro7
https://huggingface.co/ByteDance-Seed/Seed-OSS-36B-Base-woSyn
1
#1317 opened 8 months ago by
treehugg3
New activity in
ggml-org/gpt-oss-120b-GGUF
8 months ago
What quantization level does this model use?
1
#2 opened 9 months ago by
ernestr
New activity in
nvidia/Nemotron-CC-v2
8 months ago
license may be too restrictive for its purposes
๐
2
1
#2 opened 8 months ago by
huu-ontocord
Which parts of the dataset are synthetic?
#1 opened 8 months ago by
treehugg3
New activity in
microsoft/Phi-tiny-MoE-instruct
8 months ago
Add library_name and link to code
3
#1 opened 10 months ago by
nielsr
New activity in
AmanPriyanshu/GPT-OSS-20B-MoE-expert-activations
8 months ago
Is there public code available to replicate this dataset?
3
#1 opened 8 months ago by
treehugg3
New activity in
deepseek-ai/DeepSeek-R1-0528
8 months ago
Any plans for 32B/70B distilled models?
๐
8
5
#83 opened 11 months ago by
NanaBanana22
New activity in
mradermacher/model_requests
8 months ago
https://huggingface.co/Undi95/dbrx-base
13
#1268 opened 8 months ago by
treehugg3
New activity in
Jinx-org/Jinx-gpt-oss-20b
8 months ago
Why did you make this model so good at math, an already uncensored topic?
1
#4 opened 8 months ago by
treehugg3
GGUF quantized model required
9
#1 opened 8 months ago by
treehugg3
New activity in
mradermacher/snowflake-arctic-base-GGUF
8 months ago
Not a base model? Empty-prompting results in instructions
#1 opened 8 months ago by
treehugg3
New activity in
moonshotai/Kimi-K2-Instruct
8 months ago
Please, someone distill this model!
๐
3
#53 opened 8 months ago by
treehugg3
New activity in
LnL-AI/dbrx-base-tokenizer
8 months ago
config.json embedding size of "vocab_size": 100352 does not match 100277
1
#6 opened 8 months ago by
treehugg3
Load more