Interface InputAudioTranscription

Configuration for input audio transcription. The client can optionally set the language and prompt for transcription, these offer additional guidance to the transcription service.

interface InputAudioTranscription {
    language?: string;
    model?: "whisper-1" | "gpt-4o-transcribe" | "gpt-4o-mini-transcribe";
    prompt?: string;
}

Index

Properties

language? model? prompt?

Properties

`Optional`language

language?: string

The language of the input audio. Supplying the input language in ISO-639-1 (e.g. en) format will improve accuracy and latency.

`Optional`model

model?: "whisper-1" | "gpt-4o-transcribe" | "gpt-4o-mini-transcribe"

The model to use for transcription, current options are gpt-4o-transcribe, gpt-4o-mini-transcribe, and whisper-1.

`Optional`prompt

prompt?: string

An optional text to guide the model's style or continue a previous audio segment. For whisper-1, the prompt is a list of keywords. For gpt-4o-transcribe models, the prompt is a free text string, for example "expect words related to technology".

Interface InputAudioTranscription

Index

Properties

Properties

`Optional`language

`Optional`model

`Optional`prompt

Settings

On This Page

Interface InputAudioTranscription

Index

Properties

Properties

Optionallanguage

Optionalmodel

Optionalprompt

Settings

On This Page

`Optional`language

`Optional`model

`Optional`prompt