langchain/tests/unit_tests
Jason Fan 8effd90be0
Add new types of document transformers (#7379)
- Description: Add two new document transformers that translates
documents into different languages and converts documents into q&a
format to improve vector search results. Uses OpenAI function calling
via the [doctran](https://github.com/psychic-api/doctran/tree/main)
library.
  - Issue: N/A
  - Dependencies: `doctran = "^0.0.5"`
  - Tag maintainer: @rlancemartin @eyurtsev @hwchase17 
  - Twitter handle: @psychicapi or @jfan001

Notes
- Adheres to the `DocumentTransformer` abstraction set by @dev2049 in
#3182
- refactored `EmbeddingsRedundantFilter` to put it in a file under a new
`document_transformers` module
- Added basic docs for `DocumentInterrogator`, `DocumentTransformer` as
well as the existing `EmbeddingsRedundantFilter`

---------

Co-authored-by: Lance Martin <lance@langchain.dev>
Co-authored-by: Bagatur <baskaryan@gmail.com>
2023-07-12 23:53:30 -04:00
..
agents codespell: workflow, config + some (quite a few) typos fixed (#6785) 2023-07-12 16:20:08 -04:00
callbacks only add handlers if they are new (#7504) 2023-07-12 03:48:29 -04:00
chains Adds a chain around sympy for symbolic math (#6834) 2023-07-12 15:17:32 -04:00
chat_models Harrison/split schema dir (#7025) 2023-07-01 13:39:19 -04:00
client Base language model docstrings (#7104) 2023-07-07 16:09:10 -04:00
data
docstore Enable InMemoryDocstore to be constructed without providing a dict (#6976) 2023-07-05 16:56:31 -04:00
document_loaders Document loader for Cube Semantic Layer (#6882) 2023-07-05 15:18:12 -07:00
evaluation codespell: workflow, config + some (quite a few) typos fixed (#6785) 2023-07-12 16:20:08 -04:00
examples codespell: workflow, config + some (quite a few) typos fixed (#6785) 2023-07-12 16:20:08 -04:00
llms Harrison/split schema dir (#7025) 2023-07-01 13:39:19 -04:00
load Include placeholder value for all secrets, not just kwargs (#6421) 2023-06-19 15:41:45 +01:00
memory Add ZepMemory; improve ZepChatMessageHistory handling of metadata; Fix bugs (#7444) 2023-07-10 01:53:49 -04:00
output_parsers Re-use Trajectory Evaluator (#7248) 2023-07-06 07:00:24 -07:00
prompts Jinja2 validation changed to issue warnings rather than issuing exceptions. (#7161) 2023-07-05 14:04:29 -04:00
retrievers Add serialized object to retriever start callback (#7074) 2023-07-05 18:04:43 +01:00
tools codespell: workflow, config + some (quite a few) typos fixed (#6785) 2023-07-12 16:20:08 -04:00
utilities Fix graphql tool (#4984) 2023-05-19 15:27:50 -07:00
vectorstores Add maximal relevance search to SKLearnVectorStore (#5430) 2023-05-30 16:13:33 -07:00
__init__.py
conftest.py
test_bash.py Add Mastodon toots loader (#5036) 2023-05-22 16:43:07 -07:00
test_cache.py Unset Cache in Tests (#7362) 2023-07-07 11:05:09 -07:00
test_dependencies.py Docs combine document chain (#6994) 2023-07-04 12:51:04 -06:00
test_document_transformers.py Add new types of document transformers (#7379) 2023-07-12 23:53:30 -04:00
test_formatting.py
test_math_utils.py add get_top_k_cosine_similarity method to get max top k score and index (#5059) 2023-05-22 11:55:48 -07:00
test_pytest_config.py Block sockets for unit-tests (#4803) 2023-05-16 14:41:24 -04:00
test_python.py
test_schema.py Harrison/split schema dir (#7025) 2023-07-01 13:39:19 -04:00
test_sql_database_schema.py
test_sql_database.py Fix SQLAlchemy truncating text when it is too big (#5206) 2023-06-01 21:33:31 -04:00
test_sqlalchemy.py unit test sqlalachemy (#7582) 2023-07-12 03:03:16 -04:00
test_text_splitter.py Fix inconsistent behavior of CharacterTextSplitter when changing keep_separator (#7263) 2023-07-06 09:30:03 -04:00
test_utils.py Refac package version check (#7312) 2023-07-07 01:21:53 -04:00