mirror of
https://github.com/hwchase17/langchain
synced 2024-11-10 01:10:59 +00:00
4035a1d234
Thank you for contributing to LangChain! - [X] **PR title**: "community: Add source metadata to bedrock retriever response" - [X] **PR message**: - **Description:** Bedrock retrieve API returns extra metadata in the response which is currently not returned in the retriever response - **Issue:** The change adds the metadata from bedrock retrieve API response to the bedrock retriever in a backward compatible way. Renamed metadata to sourceMetadata as metadata term is being used in the Document already. This is in sync with what we are doing in llama-index as well. - **Dependencies:** No - [X] **Add tests and docs**: 1. Added unit tests 2. Notebook already exists and does not need any change 3. Response from end to end testing, just to ensure backward compatibility: `[Document(page_content='Exoplanets.', metadata={'location': {'s3Location': {'uri': 's3://bucket/file_name.txt'}, 'type': 'S3'}, 'score': 0.46886647, 'source_metadata': {'x-amz-bedrock-kb-source-uri': 's3://bucket/file_name.txt', 'tag': 'space', 'team': 'Nasa', 'year': 1946.0}})]` - [X] **Lint and test**: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Piyush Jain <piyushjain@duck.com> |
||
---|---|---|
.. | ||
__init__.py | ||
arcee.py | ||
arxiv.py | ||
azure_ai_search.py | ||
bedrock.py | ||
bm25.py | ||
breebs.py | ||
chaindesk.py | ||
chatgpt_plugin_retriever.py | ||
cohere_rag_retriever.py | ||
databerry.py | ||
docarray.py | ||
dria_index.py | ||
elastic_search_bm25.py | ||
embedchain.py | ||
google_cloud_documentai_warehouse.py | ||
google_vertex_ai_search.py | ||
kay.py | ||
kendra.py | ||
knn.py | ||
llama_index.py | ||
metal.py | ||
milvus.py | ||
outline.py | ||
pinecone_hybrid_search.py | ||
pubmed.py | ||
pupmed.py | ||
qdrant_sparse_vector_retriever.py | ||
rememberizer.py | ||
remote_retriever.py | ||
svm.py | ||
tavily_search_api.py | ||
tfidf.py | ||
thirdai_neuraldb.py | ||
vespa_retriever.py | ||
weaviate_hybrid_search.py | ||
web_research.py | ||
wikipedia.py | ||
you.py | ||
zep.py | ||
zilliz.py |