Interface GoogleAIBaseLanguageModelCallOptions

The params which can be passed to the API at request time.

interface GoogleAIBaseLanguageModelCallOptions {
    allowed_function_names?: string[];
    cachedContent?: string;
    callbacks?: Callbacks;
    configurable?: Record<string, any>;
    convertSystemMessageToHumanContent?: boolean;
    frequencyPenalty?: number;
    labels?: Record<string, string>;
    logprobs?: boolean;
    ls_structured_output_format?: {
        kwargs: {
            method: string;
        };
        schema?: JsonSchema7Type;
    };
    maxConcurrency?: number;
    maxOutputTokens?: number;
    maxReasoningTokens?: number;
    metadata?: Record<string, unknown>;
    model?: string;
    modelName?: string;
    presencePenalty?: number;
    reasoningEffort?: "low" | "medium" | "high";
    recursionLimit?: number;
    responseMimeType?: GoogleAIResponseMimeType;
    responseModalities?: string[];
    runId?: string;
    runName?: string;
    safetyHandler?: GoogleAISafetyHandler;
    safetySettings?: GoogleAISafetySetting[];
    seed?: number;
    signal?: AbortSignal;
    speechConfig?: GoogleSpeechConfig | GoogleSpeechConfigSimplified;
    stop?: string[];
    stopSequences?: string[];
    streamUsage?: boolean;
    streaming?: boolean;
    tags?: string[];
    temperature?: number;
    thinkingBudget?: number;
    timeout?: number;
    tool_choice?: ToolChoice;
    tools?: GoogleAIToolType[];
    topK?: number;
    topLogprobs?: number;
    topP?: number;
}

Hierarchy (view full)

BaseChatModelCallOptions
GoogleAIModelRequestParams
GoogleAISafetyParams
- GoogleAIBaseLanguageModelCallOptions

Properties

`Optional`allowed_function_names

allowed_function_names?: string[]

Allowed functions to call when the mode is "any". If empty, any one of the provided functions are called.

`Optional`cachedContent

cachedContent?: string

Used to specify a previously created context cache to use with generation. For Vertex, this should be of the form: "projects/PROJECT_NUMBER/locations/LOCATION/cachedContents/CACHE_ID",

See these guides for more information on how to use context caching: https://cloud.google.com/vertex-ai/generative-ai/docs/context-cache/context-cache-create https://cloud.google.com/vertex-ai/generative-ai/docs/context-cache/context-cache-use

`Optional`callbacks

callbacks?: Callbacks

Callbacks for this call and any sub-calls (eg. a Chain calling an LLM). Tags are passed to all callbacks, metadata is passed to handle*Start callbacks.

`Optional`configurable

configurable?: Record<string, any>

Runtime values for attributes previously made configurable on this Runnable, or sub-Runnables.

`Optional`convertSystemMessageToHumanContent

convertSystemMessageToHumanContent?: boolean

`Optional`frequencyPenalty

frequencyPenalty?: number

Frequency penalty applied to the next token's logprobs, multiplied by the number of times each token has been seen in the respponse so far. A positive penalty will discourage the use of tokens that have already been used, proportional to the number of times the token has been used: The more a token is used, the more dificult it is for the model to use that token again increasing the vocabulary of responses. Caution: A negative penalty will encourage the model to reuse tokens proportional to the number of times the token has been used. Small negative values will reduce the vocabulary of a response. Larger negative values will cause the model to start repeating a common token until it hits the maxOutputTokens limit.

`Optional`labels

labels?: Record<string, string>

Custom metadata labels to associate with the request. Only supported on Vertex AI (Google Cloud Platform). Labels are key-value pairs where both keys and values must be strings.

Example:

{
  labels: {
    "team": "research",
    "component": "frontend",
    "environment": "production"
  }
}

`Optional`logprobs

logprobs?: boolean

Whether to return log probabilities of the output tokens or not. If true, returns the log probabilities of each output token returned in the content of message.

`Optional`ls_structured_output_format

ls_structured_output_format?: {
    kwargs: {
        method: string;
    };
    schema?: JsonSchema7Type;
}

Describes the format of structured outputs. This should be provided if an output is considered to be structured

Type declaration

kwargs: {
method: string;
}
An object containing the method used for structured output (e.g., "jsonMode").
- method: string
Optionalschema?: JsonSchema7Type
The JSON schema describing the expected output structure.

`Optional`maxConcurrency

maxConcurrency?: number

Maximum number of parallel calls to make.

`Optional`maxOutputTokens

maxOutputTokens?: number

Maximum number of tokens to generate in the completion. This may include reasoning tokens (for backwards compatibility).

`Optional`maxReasoningTokens

maxReasoningTokens?: number

The maximum number of the output tokens that will be used for the "thinking" or "reasoning" stages.

`Optional`metadata

metadata?: Record<string, unknown>

Metadata for this call and any sub-calls (eg. a Chain calling an LLM). Keys should be strings, values should be JSON-serializable.

`Optional`model

model?: string

Model to use

`Optional`modelName

modelName?: string

Model to use Alias for model

`Optional`presencePenalty

presencePenalty?: number

Presence penalty applied to the next token's logprobs if the token has already been seen in the response. This penalty is binary on/off and not dependant on the number of times the token is used (after the first). Use frequencyPenalty for a penalty that increases with each use. A positive penalty will discourage the use of tokens that have already been used in the response, increasing the vocabulary. A negative penalty will encourage the use of tokens that have already been used in the response, decreasing the vocabulary.

`Optional`reasoningEffort

reasoningEffort?: "low" | "medium" | "high"

An OpenAI compatible parameter that will map to "maxReasoningTokens"

`Optional`recursionLimit

recursionLimit?: number

Maximum number of times a call can recurse. If not provided, defaults to 25.

`Optional`responseMimeType

responseMimeType?: GoogleAIResponseMimeType

Available for gemini-1.5-pro. The output format of the generated candidate text. Supported MIME types:

text/plain: Text output.
application/json: JSON response in the candidates.

Default

"text/plain"

`Optional`responseModalities

responseModalities?: string[]

The modalities of the response.

`Optional`runId

runId?: string

Unique identifier for the tracer run for this call. If not provided, a new UUID will be generated.

`Optional`runName

runName?: string

Name for the tracer run for this call. Defaults to the name of the class.

`Optional`safetyHandler

safetyHandler?: GoogleAISafetyHandler

`Optional`safetySettings

safetySettings?: GoogleAISafetySetting[]

`Optional`seed

seed?: number

Seed used in decoding. If not set, the request uses a randomly generated seed.

`Optional`signal

signal?: AbortSignal

Abort signal for this call. If provided, the call will be aborted when the signal is aborted.

See

https://developer.mozilla.org/en-US/docs/Web/API/AbortSignal

`Optional`speechConfig

speechConfig?: GoogleSpeechConfig | GoogleSpeechConfigSimplified

Speech generation configuration. You can use either Google's definition of the speech configuration, or a simplified version we've defined (which can be as simple as the name of a pre-defined voice).

`Optional`stop

stop?: string[]

Stop tokens to use for this call. If not provided, the default stop tokens for the model will be used.

`Optional`stopSequences

stopSequences?: string[]

`Optional`streamUsage

streamUsage?: boolean

Whether or not to include usage data, like token counts in the streamed response chunks.

Default

true

`Optional`streaming

streaming?: boolean

Whether or not to stream.

Default

false

`Optional`tags

tags?: string[]

Tags for this call and any sub-calls (eg. a Chain calling an LLM). You can use these to filter calls.

`Optional`temperature

temperature?: number

Sampling temperature to use

`Optional`thinkingBudget

thinkingBudget?: number

An alias for "maxReasoningTokens"

`Optional`timeout

timeout?: number

Timeout for this call in milliseconds.

`Optional`tool_choice

tool_choice?: ToolChoice

Specifies how the chat model should use tools.

Default

undefined

Possible values:
- "auto": The model may choose to use any of the provided tools, or none.
- "any": The model must use one of the provided tools.
- "none": The model must not use any tools.
- A string (not "auto", "any", or "none"): The name of a specific tool the model must use.
- An object: A custom schema specifying tool choice parameters. Specific to the provider.

Note: Not all providers support tool_choice. An error will be thrown
if used with an unsupported model.

`Optional`tools

tools?: GoogleAIToolType[]

`Optional`topK

topK?: number

Top-k changes how the model selects tokens for output.

A top-k of 1 means the selected token is the most probable among all tokens in the model’s vocabulary (also called greedy decoding), while a top-k of 3 means that the next token is selected from among the 3 most probable tokens (using temperature).

`Optional`topLogprobs

topLogprobs?: number

An integer between 0 and 5 specifying the number of most likely tokens to return at each token position, each with an associated log probability. logprobs must be set to true if this parameter is used.

`Optional`topP

topP?: number

Top-p changes how the model selects tokens for output.

Tokens are selected from most probable to least until the sum of their probabilities equals the top-p value.

For example, if tokens A, B, and C have a probability of .3, .2, and .1 and the top-p value is .5, then the model will select either A or B as the next token (using temperature).

Interface GoogleAIBaseLanguageModelCallOptions

Hierarchy (view full)

Index

Properties

Properties

Optionalallowed_function_names

OptionalcachedContent

Optionalcallbacks

Optionalconfigurable

OptionalconvertSystemMessageToHumanContent

OptionalfrequencyPenalty

Optionallabels

Optionallogprobs

Optionalls_structured_output_format

Type declaration

kwargs: { method: string; }

method: string

Optionalschema?: JsonSchema7Type

OptionalmaxConcurrency

OptionalmaxOutputTokens

OptionalmaxReasoningTokens

Optionalmetadata

Optionalmodel

OptionalmodelName

OptionalpresencePenalty

OptionalreasoningEffort

OptionalrecursionLimit

OptionalresponseMimeType

Default

OptionalresponseModalities

OptionalrunId

OptionalrunName

OptionalsafetyHandler

OptionalsafetySettings

Optionalseed

Optionalsignal

See

OptionalspeechConfig

Optionalstop

OptionalstopSequences

OptionalstreamUsage

Default

Optionalstreaming

Default

Optionaltags

Optionaltemperature

OptionalthinkingBudget

Optionaltimeout

Optionaltool_choice

Default

Optionaltools

OptionaltopK

OptionaltopLogprobs

OptionaltopP

Settings

On This Page

`Optional`allowed_function_names

`Optional`cachedContent

`Optional`callbacks

`Optional`configurable

`Optional`convertSystemMessageToHumanContent

`Optional`frequencyPenalty

`Optional`labels

`Optional`logprobs

`Optional`ls_structured_output_format

kwargs: {
method: string;
}

`Optional`schema?: JsonSchema7Type

`Optional`maxConcurrency

`Optional`maxOutputTokens

`Optional`maxReasoningTokens

`Optional`metadata

`Optional`model

`Optional`modelName

`Optional`presencePenalty

`Optional`reasoningEffort

`Optional`recursionLimit

`Optional`responseMimeType

`Optional`responseModalities

`Optional`runId

`Optional`runName

`Optional`safetyHandler

`Optional`safetySettings

`Optional`seed

`Optional`signal

`Optional`speechConfig

`Optional`stop

`Optional`stopSequences

`Optional`streamUsage

`Optional`streaming

`Optional`tags

`Optional`temperature

`Optional`thinkingBudget

`Optional`timeout

`Optional`tool_choice

`Optional`tools

`Optional`topK

`Optional`topLogprobs

`Optional`topP