The inference example codes seem to output text only...
Yes, it output text only now, you can use a tts model to generate the audio. We will support think and output audio in step audio 2.5/3.
Β· Sign up or log in to comment