EvSys

BenchmarkTaskResult

One scored task.

Attributes

attributetask_idstr
attributeinstructionstr
attributemodel_outputstr
attributeexpectedAny
attributerewardfloat
attributemetadatadict
= field(default_factory=dict)

Functions

func__init__(self, task_id, instruction, model_output, expected, reward, metadata=dict()) -> None
paramself
paramtask_idstr
paraminstructionstr
parammodel_outputstr
paramexpectedAny
paramrewardfloat
parammetadatadict
= dict()

Returns

None

On this page