EvSys

RL

Attributes

attributenamestr
= 'rl'
attributeConfigtype
= RLConfig

Functions

func_check_inputs(self, ctx) -> None
paramself
paramctxRunContext

Returns

None
funcsetup(self, ctx, backend) -> None
paramself
paramctxRunContext
parambackendTinkerBackend

Returns

None
funcbuild_batch(self, step_idx) -> TrainingBatch
paramself
paramstep_idxint

Returns

evsys_sdk.training.loop.TrainingBatch
func_hyperparams_extra(self) -> dict[str, Any]
paramself

Returns

dict[str, typing.Any]
func_prep_task(self, t) -> HarborTask

Template the instruction + fill the verifier fn_name fallback, so the materialized harbor task is self-contained.

paramself
paramtHarborTask

Returns

evsys_sdk.data_types.HarborTask
func_slice(self, step_idx) -> list[HarborTask]
paramself
paramstep_idxint

Returns

list[evsys_sdk.data_types.HarborTask]

On this page