SGLang deploy commands

by vvekthkr - opened 4 days ago

4 days ago

Could you please share recommended SGLang deploy commands, I currently use a rtx 5090 and a pro 6000. If all goes well, I might jump from a 4B model to 8B model with data-parallel pipeline of 2.

bflhc

Octen-Team org 4 days ago

I’m not sure whether sglang supports deployment for this yet, but we’ve used vLLM and it does work.

You can refer to this example for details: https://huggingface.co/Qwen/Qwen3-Embedding-8B#vllm-usage

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment