Evaluation

A/B Eval Kit

A lightweight way to compare conventional AI product output against work grounded in a Domain Context Brief.

Loading the canonical Markdown source...