You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
langchain/tests/unit_tests
Vashisht Madhavan aa439ac2ff
Adding an in-context QA evaluation chain + chain of thought reasoning chain for improved accuracy (#2444)
Right now, eval chains require an answer for every question. It's
cumbersome to collect this ground truth so getting around this issue
with 2 things:

* Adding a context param in `ContextQAEvalChain` and simply evaluating
if the question is answered accurately from context
* Adding chain of though explanation prompting to improve the accuracy
of this w/o GT.

This also gets to feature parity with openai/evals which has the same
contextual eval w/o GT.

TODO in follow-up:
* Better prompt inheritance. No need for seperate prompt for CoT
reasoning. How can we merge them together

---------

Co-authored-by: Vashisht Madhavan <vashishtmadhavan@Vashs-MacBook-Pro.local>
1 year ago
..
agents Feature: AgentExecutor execution time limit (#2399) 1 year ago
callbacks improve `AsyncCallbackManager` (#2410) 1 year ago
chains Removed duplicate BaseModel dependencies (#2471) 1 year ago
data Prompt from file proof of concept using plain text (#127) 2 years ago
docstore Harrison/document cleanup (#2062) 1 year ago
document_loader rm pandas dependency (#2102) 1 year ago
evaluation Adding an in-context QA evaluation chain + chain of thought reasoning chain for improved accuracy (#2444) 1 year ago
llms Removed duplicate BaseModel dependencies (#2471) 1 year ago
output_parsers PydanticOutputParser unit test (#2047) 1 year ago
prompts big docs refactor (#1978) 1 year ago
tools Add Request body (#2507) 1 year ago
utilities Centralize logic for loading from LangChainHub, add ability to pin dependencies (#805) 1 year ago
__init__.py initial commit 2 years ago
test_bash.py skip unit tests that fail in Windows (#2238) 1 year ago
test_formatting.py initial commit 2 years ago
test_python.py Harrison/multiline commands (#2280) 1 year ago
test_sql_database.py sql: do not hard code the LIMIT clause in the table_info section (#1563) 1 year ago
test_sql_database_schema.py sql: do not hard code the LIMIT clause in the table_info section (#1563) 1 year ago
test_text_splitter.py Harrison/shallow metadata (#1599) 1 year ago