Optional
apiOptional
audioParameters for audio output. Required when audio output is requested with
modalities: ["audio"]
.
Learn more.
Optional
cacheOptional
callbackOptional
callbacksOptional
configurationOptional
disableWhether to disable streaming.
If streaming is bypassed, then stream()
will defer to
invoke()
.
Optional
frequencyPenalizes repeated tokens according to frequency
Optional
logitDictionary used to adjust the probability of specific tokens being generated
Optional
logprobsWhether to return log probabilities of the output tokens or not. If true, returns the log probabilities of each output token returned in the content of message.
Optional
maxMaximum number of tokens to generate in the completion. -1 returns as many
tokens as possible given the prompt and the model's maximum context size.
Alias for maxTokens
for reasoning models.
Optional
maxThe maximum number of concurrent calls that can be made.
Defaults to Infinity
, which means no limit.
Optional
maxThe maximum number of retries that can be made for a single call, with an exponential backoff between each attempt. Defaults to 6.
Optional
maxThe maximum number of tokens that the model can process in a single response. This limits ensures computational efficiency and resource management.
Optional
metadataOptional
modalitiesOutput types that you would like the model to generate for this request. Most models are capable of generating text, which is the default:
["text"]
The gpt-4o-audio-preview
model can also be used to
generate audio. To request that
this model generate both text and audio responses, you can use:
["text", "audio"]
Optional
modelThe name of the model to use.
Optional
modelHolds any additional parameters that are valid to pass to openai.createCompletion
that are not explicitly specified on this class.
Optional
modelModel name to use
Alias for model
Optional
nNumber of completions to generate for each prompt
Optional
onCustom handler to handle failed attempts. Takes the originally thrown error object as input, and should itself throw an error if the input error is not retryable.
Optional
openAIApiAPI key to use when making requests to OpenAI. Defaults to the value of
OPENAI_API_KEY
environment variable.
Alias for apiKey
Optional
prefixChatGPT messages to pass as a prefix to the prompt
Optional
presencePenalizes repeated tokens
Optional
reasoningConstrains effort on reasoning for reasoning models. Currently supported values are low, medium, and high. Reducing reasoning effort can result in faster responses and fewer tokens used on reasoning in a response.
Optional
stopUp to 4 sequences where the API will stop generating further tokens. The
returned text will not contain the stop sequence.
Alias for stopSequences
Optional
stopUp to 4 sequences where the API will stop generating further tokens. The returned text will not contain the stop sequence.
Optional
streamWhether or not to include token usage data in streamed chunks.
Optional
streamingWhether or not to stream responses.
Optional
supportsWhether the model supports the strict
argument when passing in tools.
If undefined
the strict
argument will not be passed to OpenAI.
Optional
tagsOptional
temperatureThe temperature to use for sampling.
Optional
timeoutTimeout to use when making requests to OpenAI.
Optional
topAn integer between 0 and 5 specifying the number of most likely tokens to return at each token position, each with an associated log probability. logprobs must be set to true if this parameter is used.
Optional
topPTotal probability mass of tokens to consider at each step
Optional
userUnique string identifier representing your end-user, which can help OpenAI to monitor and detect abuse.
Optional
verbose
The Deepseek API key to use for requests.