-
LTX-2: Efficient Joint Audio-Visual Foundation Model
Paper • 2601.03233 • Published • 141 -
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head
Paper • 2601.07832 • Published • 51 -
Motion Attribution for Video Generation
Paper • 2601.08828 • Published • 70 -
Post-LayerNorm Is Back: Stable, ExpressivE, and Deep
Paper • 2601.19895 • Published • 15
Gabriel Mongaras PRO
gmongaras
AI & ML interests
None yet
Recent Activity
updated
a collection
about 5 hours ago
Stuff I'm going to read
updated
a collection
about 23 hours ago
Stuff I'm going to read
updated
a collection
1 day ago
Stuff I'm going to read
Organizations
Stable Diffusion 3 Checkpoints
Collection of checkpoints from the stable diffusion 3 model I am training (https://github.com/gmongaras/Stable-Diffusion-3-From-Scratch)
-
gmongaras/datav3_attempt5_8GPU_SoftFlash_RoPE2d_2AccSteps_13batchsize_stage3
Updated -
gmongaras/datav3_attempt5_8GPU_SoftFlash_RoPE2d_2AccSteps_40batchsize_stage2
Updated -
gmongaras/datav3_attempt5_8GPU_SoftFlash_RoPE2d_2AccSteps_140batchsize_stage1
Updated -
gmongaras/datav3_attempt4_8GPU_SoftFlash_RoPE2dV2_2AccSteps_stage2
Updated
Stuff I'm going to read
-
LTX-2: Efficient Joint Audio-Visual Foundation Model
Paper • 2601.03233 • Published • 141 -
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head
Paper • 2601.07832 • Published • 51 -
Motion Attribution for Video Generation
Paper • 2601.08828 • Published • 70 -
Post-LayerNorm Is Back: Stable, ExpressivE, and Deep
Paper • 2601.19895 • Published • 15
Stable Diffusion 3 Checkpoints
Collection of checkpoints from the stable diffusion 3 model I am training (https://github.com/gmongaras/Stable-Diffusion-3-From-Scratch)
-
gmongaras/datav3_attempt5_8GPU_SoftFlash_RoPE2d_2AccSteps_13batchsize_stage3
Updated -
gmongaras/datav3_attempt5_8GPU_SoftFlash_RoPE2d_2AccSteps_40batchsize_stage2
Updated -
gmongaras/datav3_attempt5_8GPU_SoftFlash_RoPE2d_2AccSteps_140batchsize_stage1
Updated -
gmongaras/datav3_attempt4_8GPU_SoftFlash_RoPE2dV2_2AccSteps_stage2
Updated
models
26
gmongaras/datav3_attempt5_8GPU_SoftFlash_RoPE2d_2AccSteps_13batchsize_stage3
Updated
gmongaras/datav3_attempt5_8GPU_SoftFlash_RoPE2d_2AccSteps_40batchsize_stage2
Updated
gmongaras/t
Updated
gmongaras/datav3_attempt5_8GPU_SoftFlash_RoPE2d_2AccSteps_140batchsize_stage1
Updated
gmongaras/Llama3.1_8B_Instruct_GRPO_gsm8k
Updated
gmongaras/datav3_attempt4_8GPU_SoftFlash_RoPE2dV2_2AccSteps_stage2
Updated
gmongaras/datav3_attempt4_8GPU_SoftFlash_RoPE2dV2_2AccSteps
Updated
gmongaras/Latent_Diffusion_Model_Imagenet2012_Softmax_250000
Updated
gmongaras/Softmax_Attention_BERT
Feature Extraction
•
Updated
gmongaras/Cosine_Attention_BERT
Feature Extraction
•
Updated
•
4
datasets
35
gmongaras/CC12M_and_Imagenet21K_Recap
Viewer
•
Updated
•
22.7M
•
1.22k
•
7
gmongaras/Imagenet21K_Recaption
Viewer
•
Updated
•
13.1M
•
249
•
9
gmongaras/ReLaion-10TB
Updated
gmongaras/CC12M_and_Imagenet21K_Recap_Highqual_512
Viewer
•
Updated
•
19.8M
•
15
•
1
gmongaras/CC12M_and_Imagenet21K_Recap_Highqual_256
Viewer
•
Updated
•
19.8M
•
26
•
1
gmongaras/SlimPajama-627B_Reupload
Viewer
•
Updated
•
591M
•
505
•
1
gmongaras/Amazon-Reviews-2023
Viewer
•
Updated
•
572M
•
174
gmongaras/CC12M_and_Imagenet21K_Recap_Highqual
Viewer
•
Updated
•
19.8M
•
173
•
6
gmongaras/Imagenet21K
Viewer
•
Updated
•
13.2M
•
32.9k
•
6
gmongaras/ImageNet12
Viewer
•
Updated
•
1.28M
•
99