langchain/tests/unit_tests/evaluation
Zander Chase cc60fed3be
Add a Pairwise Comparison Chain (#6703)
Notebook shows preference scoring between two chains and reports wilson
score interval + p value

I think I'll add the option to insert ground truth labels but doesn't
have to be in this PR
2023-06-26 20:47:41 -07:00
..
comparison Add a Pairwise Comparison Chain (#6703) 2023-06-26 20:47:41 -07:00
criteria Update String Evaluator (#6615) 2023-06-26 14:16:14 -07:00
qa Update String Evaluator (#6615) 2023-06-26 14:16:14 -07:00
run_evaluators Update String Evaluator (#6615) 2023-06-26 14:16:14 -07:00
__init__.py