langchain/libs/community/langchain_community
Stefano Lottini 328d0c99f2
community[minor]: Add support for metadata indexing policy in Cassandra vector store (#22548)
This PR adds a constructor `metadata_indexing` parameter to the
Cassandra vector store to allow optional fine-tuning of which fields of
the metadata are to be indexed.

This is a feature supported by the underlying CassIO library. Indexing
mode of "all", "none" or deny- and allow-list based choices are
available.

The rationale is, in some cases it's advisable to programmatically
exclude some portions of the metadata from the index if one knows in
advance they won't ever be used at search-time. this keeps the index
more lightweight and performant and avoids limitations on the length of
_indexed_ strings.

I added a integration test of the feature. I also added the possibility
of running the integration test with Cassandra on an arbitrary IP
address (e.g. Dockerized), via
`CASSANDRA_CONTACT_POINTS=10.1.1.5,10.1.1.6 poetry run pytest [...]` or
similar.

While I was at it, I added a line to the `.gitignore` since the mypy
_test_ cache was not ignored yet.

My X (Twitter) handle: @rsprrs.
2024-06-05 11:23:26 -04:00
..
adapters
agent_toolkits community[patch]: Fix remaining __inits__ in community (#22037) 2024-05-22 17:42:17 +00:00
agents docs : Update docstrings for OpenAI base.py (#22221) 2024-06-04 17:24:17 +00:00
callbacks community, docs: update token usage tracking callback + how-to guides (#22145) 2024-05-29 17:00:47 -04:00
chains community[minor]: Enable retrieval api calls in PebbloRetrievalQA (#21958) 2024-06-04 10:18:50 -07:00
chat_loaders infra: rm unused # noqa violations (#22049) 2024-05-22 15:21:08 -07:00
chat_message_histories community[minor]: Add native async support to SQLChatMessageHistory (#22065) 2024-06-05 15:10:38 +00:00
chat_models community[minor]: Add tools calls to ChatEdenAI (#22320) 2024-06-04 10:29:28 -07:00
cross_encoders
docstore community[patch]: Fix remaining __inits__ in community (#22037) 2024-05-22 17:42:17 +00:00
document_compressors community[patch]: Update OpenVINO embedding and reranker to support static input shape (#22171) 2024-06-03 13:27:17 -07:00
document_loaders community[minor]: add user agent for web scraping loaders (#22480) 2024-06-05 15:20:34 +00:00
document_transformers infra: rm unused # noqa violations (#22049) 2024-05-22 15:21:08 -07:00
embeddings community[patch]: add detailed paragraph and example for BaichuanTextEmbeddings (#22031) 2024-06-05 10:18:11 -04:00
example_selectors
graphs infra: rm unused # noqa violations (#22049) 2024-05-22 15:21:08 -07:00
indexes
llms community: fix huggingface deprecations (#22522) 2024-06-05 04:13:13 +00:00
memory community[minor]: Add Zep Cloud components + docs + examples (#21671) 2024-05-27 12:50:13 -07:00
output_parsers infra: rm unused # noqa violations (#22049) 2024-05-22 15:21:08 -07:00
query_constructors
retrievers community[minor]: Add Zep Cloud components + docs + examples (#21671) 2024-05-27 12:50:13 -07:00
storage doc: Add doc for CassandraByteStore (#22126) 2024-05-24 10:57:55 -04:00
tools community[patch]: Adding HEADER to the list of supported locations (#21946) 2024-05-22 22:47:56 +00:00
utilities community: fix missing apify_api_token field in ApifyWrapper (#22421) 2024-06-03 14:32:57 +00:00
utils community[minor]: add user agent for web scraping loaders (#22480) 2024-06-05 15:20:34 +00:00
vectorstores community[minor]: Add support for metadata indexing policy in Cassandra vector store (#22548) 2024-06-05 11:23:26 -04:00
__init__.py
cache.py
py.typed