Evaluation
MLOps
Evaluation
Frames model, prompt, and system evaluation as a reproducible experiment with baselines, datasets, and explicit metrics.
On demandAvailable when invokedBuilt In
Install from Aegis
aegis skills install evaluationBundled with the packaged Aegis CLI as a built-in procedural skill.
Already ships inside the packaged Aegis bundle. Use `aegis skills install evaluation` only when you want an explicit local materialization record.