Update README.md
Browse files
README.md
CHANGED
|
@@ -12,9 +12,9 @@ language:
|
|
| 12 |
pipeline_tag: text-generation
|
| 13 |
---
|
| 14 |
|
| 15 |
-
# TermiGen-
|
| 16 |
|
| 17 |
-
**TermiGen-
|
| 18 |
|
| 19 |
📄 **Paper:** TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents
|
| 20 |
💻 **Environments:** https://github.com/ucsb-mlsec/terminal-bench-env
|
|
|
|
| 12 |
pipeline_tag: text-generation
|
| 13 |
---
|
| 14 |
|
| 15 |
+
# TermiGen-32B
|
| 16 |
|
| 17 |
+
**TermiGen-32B** achieves **31.3% pass@1** on [TerminalBench 1.0](https://github.com/laude-institute/terminal-bench), establishing a new open-weight state-of-the-art and surpassing proprietary models like o4-mini with Codex CLI (20.0%).
|
| 18 |
|
| 19 |
📄 **Paper:** TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents
|
| 20 |
💻 **Environments:** https://github.com/ucsb-mlsec/terminal-bench-env
|