Configuration of the transcription model.

interface InputAudioTranscription {
    language?: string;
    model?: "whisper-1" | "gpt-4o-transcribe" | "gpt-4o-mini-transcribe";
    prompt?: string;
}

Properties

language?: string

The language of the input audio. Supplying the input language in ISO-639-1 (e.g. en) format will improve accuracy and latency.

model?: "whisper-1" | "gpt-4o-transcribe" | "gpt-4o-mini-transcribe"

The model to use for transcription. Can be gpt-4o-transcribe, gpt-4o-mini-transcribe, or whisper-1.

prompt?: string

An optional text to guide the model's style or continue a previous audio segment. The prompt should match the audio language.