Optional
apiOptional
cacheOptional
callbackOptional
callbacksOptional
concurrencyOptional
folderIDYandex Cloud Folder ID
Optional
iamYandex Cloud IAM token for service or user account
with the ai.languageModels.user
role.
Optional
maxThe maximum number of concurrent calls that can be made.
Defaults to Infinity
, which means no limit.
Optional
maxThe maximum number of retries that can be made for a single call, with an exponential backoff between each attempt. Defaults to 6.
Optional
maxMaximum limit on the total number of tokens used for both the input prompt and the generated response.
Optional
metadataOptional
modelModel name to use.
Optional
modelURIModel URI to use.
Optional
modelModel version to use.
Optional
onCustom handler to handle failed attempts. Takes the originally thrown error object as input, and should itself throw an error if the input error is not retryable.
Optional
tagsOptional
temperatureWhat sampling temperature to use. Should be a double number between 0 (inclusive) and 1 (inclusive).
Optional
verbose
Yandex Cloud Api Key for service account with the
ai.languageModels.user
role.