Commit graph

1 commit

Author SHA1 Message Date
Corey Haines
11e9ea811f feat: add evals for all 29 remaining skills (197 total evals across 32 skills)
Each skill now has 5-8 evals covering:
- Core framework usage with realistic prompts
- Casual trigger phrase variants
- Sub-type and section-specific coverage
- Boundary tests (skill deferral to related skills)
- Structured assertions for grading

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-04 13:37:01 -08:00