Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
47.1
TFLOPS
15
1
7
Jeremy Haschal
JermemyHaschal
Follow
Mi6paulino's profile picture
1 follower
Β·
5 following
AI & ML interests
None yet
Recent Activity
reacted
to
OzTianlu
's
post
with π€
7 days ago
O(1) inference is the foundational design of Spartacus-1B-Instruct π‘οΈ ! https://huggingface.co/NoesisLab/Spartacus-1B-Instruct We have successfully replaced the KV-cache bottleneck inherent in Softmax Attention with Causal Monoid State Compression. By defining the causal history as a monoid recurrence, , the entire prefix is lossily compressed into a fixed-size state matrix per head. The technical core of this architecture relies on the associativity of the monoid operator: Training: parallel prefix scan using Triton-accelerated JIT kernels to compute all prefix states simultaneously. Inference: True sequential updates. Memory and time complexity per token are decoupled from sequence length. Explicit Causality: We discard RoPE and attention masks. Causality is a first-class citizen, explicitly modeled through learned, content-dependent decay gates. Current zero-shot benchmarks demonstrate that Spartacus-1B-Instruct (1.3B) is already outperforming established sub-quadratic models like Mamba-1.4B and RWKV-6-1.6B on ARC-Challenge (0.3063). Recent integration of structured Chain-of-Thought (CoT) data has further pushed reasoning accuracy to 75%. The "Spartacus" era is about scaling intelligence, not the memory wall βΎοΈ.
new
activity
13 days ago
TheDrummer/Rocinante-X-12B-v1-GGUF:
Comparison with Rivermind-Lux-12B-v1b?
reacted
to
Reubencf
's
post
with π₯
about 1 month ago
π’ New release! World_events Dataset now available featuring global events spanning 2023 through 2025 π https://huggingface.co/collections/Reubencf/world-events π 2026 dataset dropping soon
View all activity
Organizations
None yet
JermemyHaschal
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
TheDrummer/Rocinante-X-12B-v1-GGUF
13 days ago
Comparison with Rivermind-Lux-12B-v1b?
4
#1 opened 14 days ago by
JermemyHaschal
New activity in
fancyfeast/joy-caption-beta-one
4 months ago
Joytag no longer works
4
#14 opened 4 months ago by
bro123123
New activity in
lodestones/Chroma1-Radiance
5 months ago
ERROR: Could not detect model type of: D:\...\Chroma1-Radiance-v0.1.safetensors
7
#2 opened 6 months ago by
Viennar
New activity in
openbmb/MiniCPM-V-4
7 months ago
HF Space?
10
#4 opened 7 months ago by
JermemyHaschal
New activity in
bosonai/higgs-audio-v2-generation-3B-base
7 months ago
Using tags?
3
#5 opened 7 months ago by
JermemyHaschal
New activity in
unsloth/dots.llm1.inst-GGUF
7 months ago
Please use `--jinja` or else gibberish!
π
β€οΈ
2
1
#2 opened 8 months ago by
danielhanchen
New activity in
concedo/llama-joycaption-beta-one-hf-llava-mmproj-gguf
8 months ago
Use with other model?
2
#3 opened 8 months ago by
JermemyHaschal
New activity in
silveroxides/Chroma-GGUF
8 months ago
Difference between normal and 'detail-calibrated'?
π
3
3
#13 opened 8 months ago by
JermemyHaschal
New activity in
alamios/Mistral-Small-3.1-DRAFT-0.5B-GGUF
11 months ago
Mistral-Nemo-Instruct-2407 compatibility?
3
#1 opened 11 months ago by
JermemyHaschal
New activity in
TheDrummer/Gemmasutra-9B-v1.1-GGUF
about 1 year ago
Request?
1
#1 opened about 1 year ago by
BlueNipples
New activity in
fishaudio/fish-speech-1.5
about 1 year ago
How to run this?
6
#4 opened about 1 year ago by
JermemyHaschal
New activity in
MaziyarPanahi/calme-2.1-qwen2-72b-GGUF
over 1 year ago
Perplexity loss?
2
#11 opened over 1 year ago by
JermemyHaschal
Load more