A schema representing an evaluation run.

interface RunCreateResponse {
    created_at: number;
    data_source: CreateEvalCompletionsRunDataSource | CreateEvalJSONLRunDataSource;
    error: EvalAPIError;
    eval_id: string;
    id: string;
    metadata: null | Metadata;
    model: string;
    name: string;
    object: "eval.run";
    per_model_usage: OpenAIClient.Evals.Runs.RunCreateResponse.PerModelUsage[];
    per_testing_criteria_results: OpenAIClient.Evals.Runs.RunCreateResponse.PerTestingCriteriaResult[];
    report_url: string;
    result_counts: OpenAIClient.Evals.Runs.RunCreateResponse.ResultCounts;
    status: string;
}

Properties

created_at: number

Unix timestamp (in seconds) when the evaluation run was created.

Information about the run's data source.

An object representing an error response from the Eval API.

eval_id: string

The identifier of the associated evaluation.

id: string

Unique identifier for the evaluation run.

metadata: null | Metadata

Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard.

Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.

model: string

The model that is evaluated, if applicable.

name: string

The name of the evaluation run.

object: "eval.run"

The type of the object. Always "eval.run".

Usage statistics for each model during the evaluation run.

Results per testing criteria applied during the evaluation run.

report_url: string

The URL to the rendered evaluation run report on the UI dashboard.

Counters summarizing the outcomes of the evaluation run.

status: string

The status of the evaluation run.