Question

by Enderchef - opened about 10 hours ago

I really like your models! I have an idea to improve it a lot. It's just a theory and I don't want to waste your time with it, so:
I have a GPU; How can I run a post-train on this model? How long did your post-training take(was it a long time? How long on your RTX 5090)?
If it turns out well, I'll show you the results 🙃

CompactAI

CompactAI org about 10 hours ago

Glad your interested!
Currently im working on a way for anyone to contribute (related to poll on website) but it seems like its going to take a while 😓
This one took around a day to train fully on a 5090.
Pre training was 18 hours, post training was 3.
Currently I dont have a open source platform for fine-tuning these models but some platforms may support it.
If you want to contribute to CompactAI we can discuss on discord (if your open to that)

Enderchef

about 7 hours ago

I'm making a training pipeline to do my idea right now, I'll tell you if anything interesting comes out of it!
(I'm experimenting with RL and CoT, and looking into Effort like Opus has, I'm running the RL pipeline right now)

CompactAI

CompactAI org about 4 hours ago

Just wait for next Haiku, from testing it knows what you are talking about sometimes.
(Sonnet & Opus are waiting for the contributor hardware pooling app)

Enderchef

about 4 hours ago

Can I join the pool for a hour or two?

Enderchef

about 4 hours ago

*4 hrs a day

CompactAI

CompactAI org about 3 hours ago

•

edited about 3 hours ago

Sadly we dont actually have any infra built for it. (only a sick frontend :P)
If you know how to code complex apps we would be happy to let you contribute

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment