zRzRzRzRzRzRzR
commited on
Commit
·
1525284
1
Parent(s):
ef113d3
update
Browse files
README.md
CHANGED
|
@@ -31,17 +31,13 @@ GLM-OCR is a multimodal OCR model for complex document understanding, built on t
|
|
| 31 |
|
| 32 |
**Key Features**
|
| 33 |
|
| 34 |
-
- State-of-the-Art Performance
|
| 35 |
-
Achieves 94.62 on OmniDocBench V1.5, ranking #1, and delivers SOTA results across major document understanding benchmarks, including formula recognition, table recognition, and information extraction.
|
| 36 |
|
| 37 |
-
- Optimized for Real-World Scenarios
|
| 38 |
-
Specifically optimized for practical business use cases, maintaining stable and accurate performance on complex tables, code documents, seals, and other challenging layouts.
|
| 39 |
|
| 40 |
-
- Efficient Inference
|
| 41 |
-
With only 0.9B parameters, GLM-OCR supports deployment via vLLM and SGLang, significantly reducing inference latency and compute cost—well suited for high-concurrency and edge deployments.
|
| 42 |
|
| 43 |
-
- Easy to Use
|
| 44 |
-
Fully open-sourced with a complete [SDK](https://github.com/zai-org/GLM-OCR) and inference toolchain, enabling one-line invocation and seamless integration into existing systems.
|
| 45 |
|
| 46 |
## Usage
|
| 47 |
|
|
|
|
| 31 |
|
| 32 |
**Key Features**
|
| 33 |
|
| 34 |
+
- **State-of-the-Art Performance**: Achieves a score of 94.62 on OmniDocBench V1.5, ranking #1 overall, and delivers state-of-the-art results across major document understanding benchmarks, including formula recognition, table recognition, and information extraction.
|
|
|
|
| 35 |
|
| 36 |
+
- **Optimized for Real-World Scenarios**: Designed and optimized for practical business use cases, maintaining robust performance on complex tables, code-heavy documents, seals, and other challenging real-world layouts.
|
|
|
|
| 37 |
|
| 38 |
+
- **Efficient Inference**: With only 0.9B parameters, GLM-OCR supports deployment via vLLM, SGLang, and Ollama, significantly reducing inference latency and compute cost, making it ideal for high-concurrency services and edge deployments.
|
|
|
|
| 39 |
|
| 40 |
+
- **Easy to Use**: Fully open-sourced and equipped with a comprehensive [SDK](https://github.com/zai-org/GLM-OCR) and inference toolchain, offering simple installation, one-line invocation, and smooth integration into existing production pipelines.
|
|
|
|
| 41 |
|
| 42 |
## Usage
|
| 43 |
|