Qwen
/

Qwen3-Coder-Next

Text Generation

Model card Files Files and versions

littlebird13 commited on 15 days ago

Commit

6f98d9d

·

verified ·

1 Parent(s): 144e4ba

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -114,8 +114,8 @@ See [its documentation](https://docs.sglang.ai/get_started/install.html) for mor
 The following command can be used to create an API endpoint at `http://localhost:30000/v1` with maximum context length 256K tokens using tensor parallel on 4 GPUs.
 ```shell
-python -m sglang.launch_server --model Qwen/Qwen3-Coder-Next --tp-size 2 --tool-call-parser qwen3_coder```
 > [!Note]
 > The default context length is 256K. Consider reducing the context length to a smaller value, e.g., `32768`, if the server fails to start.

 The following command can be used to create an API endpoint at `http://localhost:30000/v1` with maximum context length 256K tokens using tensor parallel on 4 GPUs.
 ```shell
+python -m sglang.launch_server --model Qwen/Qwen3-Coder-Next --port 30000 --tp-size 2 --tool-call-parser qwen3_coder```
+```
 > [!Note]
 > The default context length is 256K. Consider reducing the context length to a smaller value, e.g., `32768`, if the server fails to start.