Optional
prefix_Optional
silence_Duration of silence to detect speech stop (in milliseconds). With shorter values the model will respond more quickly, but may jump in on short pauses from the user.
Optional
thresholdSensitivity threshold (0.0 to 1.0) for voice activity detection. A higher threshold will require louder audio to activate the model, and thus might perform better in noisy environments.
Must be set to server_vad
to enable manual chunking using server side VAD.
Amount of audio to include before the VAD detected speech (in milliseconds).