Optional
cacheOptional
callbackOptional
callbacksOptional
chunksExact chunks to emit (can include tool-call deltas)
Optional
disableWhether to disable streaming.
If streaming is bypassed, then stream()
will defer to
invoke()
.
Optional
maxThe maximum number of concurrent calls that can be made.
Defaults to Infinity
, which means no limit.
Optional
maxThe maximum number of retries that can be made for a single call, with an exponential backoff between each attempt. Defaults to 6.
Optional
metadataOptional
onCustom handler to handle failed attempts. Takes the originally thrown error object as input, and should itself throw an error if the input error is not retryable.
Optional
responsesFull AI messages to fall back to when no chunks
supplied
Optional
sleepMilliseconds to pause between fallback char-by-char chunks
Optional
tagsOptional
thrownThrow this error instead of streaming (useful in tests)
Optional
toolHow tool specs are formatted in bindTools
Optional
verbose
Interface for the Constructor-field specific to the Fake Streaming Chat model (all optional because we fill in defaults).