langchain/docs/extras/guides/evaluation
William FH a673a51efa
[Breaking] Update Evaluation Functionality (#7388)
- Migrate from deprecated langchainplus_sdk to `langsmith` package
- Update the `run_on_dataset()` API to use an eval config
- Update a number of evaluators, as well as the loading logic
- Update docstrings / reference docs
- Update tracer to share single HTTP session
2023-07-13 02:13:06 -07:00
..
agent_benchmarking.ipynb nit (#6305) 2023-06-16 16:21:27 -07:00
agent_vectordb_sota_pg.ipynb Docs nit (#6350) 2023-06-18 20:58:12 -07:00
benchmarking_template.ipynb Doc refactor (#6300) 2023-06-16 11:52:56 -07:00
comparisons.ipynb Fix make docs_build and related scripts (#7276) 2023-07-11 22:05:14 -04:00
criteria_eval_chain.ipynb [Breaking] Update Evaluation Functionality (#7388) 2023-07-13 02:13:06 -07:00
data_augmented_question_answering.ipynb Doc refactor (#6300) 2023-06-16 11:52:56 -07:00
generic_agent_evaluation.ipynb Fix make docs_build and related scripts (#7276) 2023-07-11 22:05:14 -04:00
huggingface_datasets.ipynb Doc refactor (#6300) 2023-06-16 11:52:56 -07:00
index.mdx fix eval guide links (#6319) 2023-06-16 17:53:46 -07:00
langsmith.ipynb [Breaking] Update Evaluation Functionality (#7388) 2023-07-13 02:13:06 -07:00
llm_math.ipynb Doc refactor (#6300) 2023-06-16 11:52:56 -07:00
openapi_eval.ipynb Polish reference docs (#7045) 2023-07-02 08:08:51 -06:00
qa_benchmarking_pg.ipynb [Document fix] Fix an expired link qa_benchmarking_pg.ipynb (#7110) 2023-07-03 19:03:16 -06:00
qa_benchmarking_sota.ipynb Doc refactor (#6300) 2023-06-16 11:52:56 -07:00
qa_generation.ipynb Doc refactor (#6300) 2023-06-16 11:52:56 -07:00
question_answering.ipynb Doc refactor (#6300) 2023-06-16 11:52:56 -07:00
sql_qa_benchmarking_chinook.ipynb Doc refactor (#6300) 2023-06-16 11:52:56 -07:00