langchain/tests/unit_tests
Vashisht Madhavan aa439ac2ff
Adding an in-context QA evaluation chain + chain of thought reasoning chain for improved accuracy (#2444)
Right now, eval chains require an answer for every question. It's
cumbersome to collect this ground truth so getting around this issue
with 2 things:

* Adding a context param in `ContextQAEvalChain` and simply evaluating
if the question is answered accurately from context
* Adding chain of though explanation prompting to improve the accuracy
of this w/o GT.

This also gets to feature parity with openai/evals which has the same
contextual eval w/o GT.

TODO in follow-up:
* Better prompt inheritance. No need for seperate prompt for CoT
reasoning. How can we merge them together

---------

Co-authored-by: Vashisht Madhavan <vashishtmadhavan@Vashs-MacBook-Pro.local>
2023-04-06 22:32:41 -07:00
..
agents Feature: AgentExecutor execution time limit (#2399) 2023-04-06 12:54:32 -07:00
callbacks improve AsyncCallbackManager (#2410) 2023-04-05 09:31:42 +02:00
chains Removed duplicate BaseModel dependencies (#2471) 2023-04-06 12:45:16 -07:00
data Prompt from file proof of concept using plain text (#127) 2022-11-13 13:15:30 -08:00
docstore Harrison/document cleanup (#2062) 2023-03-27 16:32:55 -07:00
document_loader rm pandas dependency (#2102) 2023-03-28 08:38:19 -07:00
evaluation Adding an in-context QA evaluation chain + chain of thought reasoning chain for improved accuracy (#2444) 2023-04-06 22:32:41 -07:00
llms Removed duplicate BaseModel dependencies (#2471) 2023-04-06 12:45:16 -07:00
output_parsers PydanticOutputParser unit test (#2047) 2023-03-27 14:32:56 -07:00
prompts big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
tools Add Request body (#2507) 2023-04-06 13:02:42 -07:00
utilities Centralize logic for loading from LangChainHub, add ability to pin dependencies (#805) 2023-01-30 14:52:17 -08:00
__init__.py initial commit 2022-10-24 14:51:15 -07:00
test_bash.py skip unit tests that fail in Windows (#2238) 2023-04-01 12:52:21 -07:00
test_formatting.py initial commit 2022-10-24 14:51:15 -07:00
test_python.py Harrison/multiline commands (#2280) 2023-04-01 12:54:06 -07:00
test_sql_database_schema.py sql: do not hard code the LIMIT clause in the table_info section (#1563) 2023-03-13 23:08:27 -07:00
test_sql_database.py sql: do not hard code the LIMIT clause in the table_info section (#1563) 2023-03-13 23:08:27 -07:00
test_text_splitter.py Harrison/shallow metadata (#1599) 2023-03-11 09:18:25 -08:00