Frames model, prompt, and system evaluation as a reproducible experiment with baselines, datasets, and explicit metrics.
Usage and install
- Slash command
/evaluation- Install
aegis skills install evaluation- Reference
- evaluation
- Runtime posture
- Available when invoked