Interface TranscriptionVerbose

Represents a verbose json transcription response returned by model, based on the provided input.

interface TranscriptionVerbose {
    duration: number;
    language: string;
    segments?: TranscriptionSegment[];
    text: string;
    words?: TranscriptionWord[];
}

Index

Properties

duration language segments? text words?

Properties

duration

duration: number

The duration of the input audio.

language

language: string

The language of the input audio.

`Optional`segments

segments?: TranscriptionSegment[]

Segments of the transcribed text and their corresponding details.

text

text: string

The transcribed text.

`Optional`words

words?: TranscriptionWord[]

Extracted words and their corresponding timestamps.