Qwen3-0.6B-html-aggressive

🎯 HTML-optimized | πŸ“¦ Aggressive pruning | ⚑ 1% weights pruned

This model is a aggressively pruned version of Qwen/Qwen3-0.6B.

Performance Comparison

Category Original Pruned Change
Python 0.0% 0.0% β†’
Html 0.0% 0.0% ⭐ β†’
Trivia 28.6% 28.6% β†’
Math 33.3% 26.7% ↓ 6.7%
Reasoning 20.0% 20.0% β†’
Medical 66.7% 66.7% β†’
Linux 40.0% 40.0% β†’
Writing 16.7% 16.7% β†’

Average: 25.7% β†’ 24.8% (-0.8%)

Comparison Graph

Quick Start

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("CompactAI/Qwen3-0.6B-html-aggressive")
tokenizer = AutoTokenizer.from_pretrained("CompactAI/Qwen3-0.6B-html-aggressive")

inputs = tokenizer("Your prompt here", return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=100)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Technical Details

Property Value
Base Model Qwen/Qwen3-0.6B
Specialization Html
Prune Mode Aggressive
Weight Reduction 1% weights pruned

License

This model inherits the license from the base model.

Downloads last month
10
Safetensors
Model size
0.8B params
Tensor type
F16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for CompactAI/Qwen3-0.6B-html-aggressive

Finetuned
Qwen/Qwen3-0.6B
Finetuned
(632)
this model

Collection including CompactAI/Qwen3-0.6B-html-aggressive