BenchmarkTaskResult
One scored task.
Attributes
attributetask_idstrattributeinstructionstrattributemodel_outputstrattributeexpectedAnyattributerewardfloatattributemetadatadict= field(default_factory=dict)Functions
func__init__(self, task_id, instruction, model_output, expected, reward, metadata=dict()) -> Noneparamselfparamtask_idstrparaminstructionstrparammodel_outputstrparamexpectedAnyparamrewardfloatparammetadatadict= dict()Returns
None