sdft
SDFT - self-distillation fine-tuning on the SDK training loop.
All the
composer plumbing lives in
:class:~evsys_sdk.algorithms.base.BaseAlgorithm; SDFT supplies the two
per-algorithm pieces:
- :meth:
setup- parse rows → :class:PromptExampledataset, build the frozen teacher sampling client, and stash a per-step student sampler provider (snapshots the current weights each step so rollouts stay on-policy - cookbook parity without monkey-patchingsdft.train_step). - :meth:
build_batch- on-policy student rollout → teacher topK score → CE distillation Datums. :meth:step_metricsaddstrain/mean_lossfrom the forward-backward output.
attribute__all__= ['SDFT', 'SDFTConfig']