You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
langchain/docs/extras/guides/evaluation
William FH 8c73037dff
Simplify eval arg names (#6944)
It'll be easier to switch between these if the names of predictions are
consistent
1 year ago
..
agent_benchmarking.ipynb nit (#6305) 1 year ago
agent_vectordb_sota_pg.ipynb Docs nit (#6350) 1 year ago
benchmarking_template.ipynb Doc refactor (#6300) 1 year ago
comparisons.ipynb Simplify eval arg names (#6944) 1 year ago
criteria_eval_chain.ipynb Permit Constitutional Principles (#6807) 1 year ago
data_augmented_question_answering.ipynb Doc refactor (#6300) 1 year ago
generic_agent_evaluation.ipynb Clean up agent trajectory interface (#6799) 1 year ago
huggingface_datasets.ipynb Doc refactor (#6300) 1 year ago
index.mdx fix eval guide links (#6319) 1 year ago
llm_math.ipynb Doc refactor (#6300) 1 year ago
openapi_eval.ipynb docs/fix links (#6498) 1 year ago
qa_benchmarking_pg.ipynb Doc refactor (#6300) 1 year ago
qa_benchmarking_sota.ipynb Doc refactor (#6300) 1 year ago
qa_generation.ipynb Doc refactor (#6300) 1 year ago
question_answering.ipynb Doc refactor (#6300) 1 year ago
sql_qa_benchmarking_chinook.ipynb Doc refactor (#6300) 1 year ago