AudioResponseFormat:
    | "json"
    | "text"
    | "srt"
    | "verbose_json"
    | "vtt"

The format of the output, in one of these options: json, text, srt, verbose_json, or vtt.