Specifies the output audio format. Must be one of wav
, mp3
, flac
, opus
,
or pcm16
.
The voice the model uses to respond. Supported voices are ash
, ballad
,
coral
, sage
, and verse
(also supported but not recommended are alloy
,
echo
, and shimmer
; these voices are less expressive).
Parameters for audio output. Required when audio output is requested with
modalities: ["audio"]
. Learn more.