langchain/docs/use_cases/evaluation
Harrison Chase d5b4393bb2
Harrison/llm math (#1808)
Co-authored-by: Vadym Barda <vadim.barda@gmail.com>
2023-03-20 07:53:26 -07:00
..
agent_benchmarking.ipynb Harrison/agent eval (#1620) 2023-03-14 12:37:48 -07:00
agent_vectordb_sota_pg.ipynb Harrison/agent eval (#1620) 2023-03-14 12:37:48 -07:00
benchmarking_template.ipynb Harrison/agent eval (#1620) 2023-03-14 12:37:48 -07:00
data_augmented_question_answering.ipynb Added other evaluation metrics for data-augmented QA (#1521) 2023-03-08 20:41:03 -08:00
huggingface_datasets.ipynb Update huggingface_datasets.ipynb (#1417) 2023-03-04 00:22:31 -08:00
llm_math.ipynb Harrison/llm math (#1808) 2023-03-20 07:53:26 -07:00
qa_benchmarking_pg.ipynb Harrison/agent eval (#1620) 2023-03-14 12:37:48 -07:00
qa_benchmarking_sota.ipynb Harrison/agent eval (#1620) 2023-03-14 12:37:48 -07:00
qa_generation.ipynb Harrison/agent eval (#1620) 2023-03-14 12:37:48 -07:00
question_answering.ipynb Harrison/agent eval (#1620) 2023-03-14 12:37:48 -07:00
sql_qa_benchmarking_chinook.ipynb Harrison/agent eval (#1620) 2023-03-14 12:37:48 -07:00