EvalArtifacts
Final outputs of an eval run.
Attributes
attributesummaryEvalSummaryattributesummary_strict_primaryEvalSummary | NoneOptional strict variant (e.g. primary-only). None for plain model evals.
attributeper_row_resultslist[dict[str, Any]]Raw per-row, per-query results so downstream tooling can re-score.
Functions
func__init__(self, summary, summary_strict_primary, per_row_results) -> NoneparamselfparamsummaryEvalSummaryparamsummary_strict_primaryEvalSummary | Noneparamper_row_resultslist[dict[str, Any]]Returns
None