langchain/tests/integration_tests
Martin Holzhauer 872605a5c5
Add an option to extract more metadata from crawled websites (#4347)
This pr makes it possible to extract more metadata from websites for
later use.

my usecase:
parsing ld+json or microdata from sites and store it as structured data
in the metadata field
2023-05-09 10:18:33 -07:00
..
agent [test] Add integration_test for PandasAgent (#4056) 2023-05-05 14:49:02 -07:00
cache Harrison/redis cache (#3766) 2023-04-28 20:47:18 -07:00
callbacks Callbacks Refactor [base] (#3256) 2023-04-30 11:14:09 -07:00
chains Callbacks Refactor [base] (#3256) 2023-04-30 11:14:09 -07:00
chat_models Check OpenAI model kwargs (#4366) 2023-05-08 16:37:34 -07:00
document_loaders Add an option to extract more metadata from crawled websites (#4347) 2023-05-09 10:18:33 -07:00
embeddings Dev2049/hf emb encode kwargs (#3925) 2023-05-01 20:27:41 -07:00
examples JSON loader (#4067) 2023-05-05 14:48:13 -07:00
llms Check OpenAI model kwargs (#4366) 2023-05-08 16:37:34 -07:00
memory mongodb support for chat history (#4266) 2023-05-08 08:34:05 -07:00
prompts Cleanup integration test dir (#3308) 2023-04-21 09:44:09 -07:00
retrievers added Wikipedia retriever (#4302) 2023-05-09 10:08:39 -07:00
utilities added Wikipedia document loader (#4141) 2023-05-06 09:32:45 -07:00
vectorstores OpenSearch: Add Similarity Search with Score (#4089) 2023-05-08 16:35:21 -07:00
__init__.py
.env.example Change in method name for creating an issue on JIRA (#3307) 2023-04-21 13:01:33 -07:00
conftest.py feat: improve pinecone tests (#2806) 2023-04-13 21:49:31 -07:00
test_document_transformers.py Contextual compression retriever (#2915) 2023-04-20 17:01:14 -07:00
test_nlp_text_splitters.py
test_pdf_pagesplitter.py cleanup: unify 3 different pdf loaders, rename PagedPDFSplitter (#1615) 2023-03-13 23:06:50 -07:00
test_schema.py Callbacks Refactor [base] (#3256) 2023-04-30 11:14:09 -07:00
test_text_splitter.py Fix TextSplitter.from_tiktoken(#4361) 2023-05-08 16:36:38 -07:00