Configuration for the DPO fine-tuning method.

interface DpoMethod {
    hyperparameters?: DpoHyperparameters;
}

Properties

Properties

hyperparameters?: DpoHyperparameters

The hyperparameters used for the DPO fine-tuning job.