Ephemeral key returned by the API. Only present when the session is created on the server via REST API.
Optionalinput_The format of input audio. Options are pcm16, g711_ulaw, or g711_alaw.
Optionalinput_Configuration of the transcription model.
OptionalmodalitiesThe set of modalities the model can respond with. To disable audio, set this to ["text"].
Optionalturn_Configuration for turn detection. Can be set to null to turn off. Server VAD
means that the model will detect the start and end of speech based on audio
volume and respond at the end of user speech.
A new Realtime transcription session configuration.
When a session is created on the server via REST API, the session object also contains an ephemeral key. Default TTL for keys is 10 minutes. This property is not present when a session is updated via the WebSocket API.