Enable thinking not works as expected while using VLLM

#16

by MRU4913 - opened Sep 16

Sep 16

•

{
"bos_token_id": 151643,
"do_sample": true,
"eos_token_id": [
151645,
151643
],
"pad_token_id": 151643,
"temperature": 0.6,
"top_k": 20,
"top_p": 0.95,
"chat_template_kwargs": {"enable_thinking": false},
"transformers_version": "4.51.0"
}

I'm using vlllm for deployment, but it still outputs <think>

tc-mb

OpenBMB org Sep 16

MRU4913 changed discussion status to closed Sep 16

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment