You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
langchain/docs/use_cases/evaluation
CG80499 cfd34e268e
Add ReAct eval chain (#3161)
- Adds GPT-4 eval chain for arbitrary agents using any set of tools
- Adds notebook

---------

Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>
1 year ago
..
agent_benchmarking.ipynb Add ReAct eval chain (#3161) 1 year ago
agent_vectordb_sota_pg.ipynb Fix notebook example (#3142) 1 year ago
benchmarking_template.ipynb Harrison/agent eval (#1620) 2 years ago
data_augmented_question_answering.ipynb Typo docs - Update data_augmented_question_answering.ipynb propriterary-> proprietary (#2626) 2 years ago
generic_agent_evaluation.ipynb Add ReAct eval chain (#3161) 1 year ago
huggingface_datasets.ipynb Update huggingface_datasets.ipynb (#1417) 2 years ago
llm_math.ipynb Harrison/llm math (#1808) 2 years ago
openapi_eval.ipynb Harrison/move eval (#2533) 2 years ago
qa_benchmarking_pg.ipynb WIP: Harrison/base retriever (#1765) 2 years ago
qa_benchmarking_sota.ipynb WIP: Harrison/base retriever (#1765) 2 years ago
qa_generation.ipynb Harrison/agent eval (#1620) 2 years ago
question_answering.ipynb fix typo (#2532) 2 years ago
sql_qa_benchmarking_chinook.ipynb Harrison/agent eval (#1620) 2 years ago