langchain/docs/extras/modules/data_connection/retrievers/integrations
Jeff Huber 2139d0197e
upgrade chroma to 0.4.0 (#7749)
** This should land Monday the 17th ** 

Chroma is upgrading from `0.3.29` to `0.4.0`. `0.4.0` is easier to
build, more durable, faster, smaller, and more extensible. This comes
with a few changes:

1. A simplified and improved client setup. Instead of having to remember
weird settings, users can just do `EphemeralClient`, `PersistentClient`
or `HttpClient` (the underlying direct `Client` implementation is also
still accessible)

2. We migrated data stores away from `duckdb` and `clickhouse`. This
changes the api for the `PersistentClient` that used to reference
`chroma_db_impl="duckdb+parquet"`. Now we simply set
`is_persistent=true`. `is_persistent` is set for you to `true` if you
use `PersistentClient`.

3. Because we migrated away from `duckdb` and `clickhouse` - this also
means that users need to migrate their data into the new layout and
schema. Chroma is committed to providing extension notification and
tooling around any schema and data migrations (for example - this PR!).

After upgrading to `0.4.0` - if users try to access their data that was
stored in the previous regime, the system will throw an `Exception` and
instruct them how to use the migration assistant to migrate their data.
The migration assitant is a pip installable CLI: `pip install
chroma_migrate`. And is runnable by calling `chroma_migrate`

-- TODO ADD here is a short video demonstrating how it works. 

Please reference the readme at
[chroma-core/chroma-migrate](https://github.com/chroma-core/chroma-migrate)
to see a full write-up of our philosophy on migrations as well as more
details about this particular migration.

Please direct any users facing issues upgrading to our Discord channel
called
[#get-help](https://discord.com/channels/1073293645303795742/1129200523111841883).
We have also created a [email
listserv](https://airtable.com/shrHaErIs1j9F97BE) to notify developers
directly in the future about breaking changes.

---------

Co-authored-by: Bagatur <baskaryan@gmail.com>
2023-07-18 17:20:54 -07:00
..
amazon_kendra_retriever.ipynb Fix make docs_build and related scripts (#7276) 2023-07-11 22:05:14 -04:00
arxiv.ipynb Doc refactor (#6300) 2023-06-16 11:52:56 -07:00
azure_cognitive_search.ipynb [Small upgrade] Allow document limit in AzureCognitiveSearchRetriever (#7690) 2023-07-13 23:04:40 -04:00
bm25.ipynb add bm25 module (#7779) 2023-07-17 07:30:17 -07:00
chaindesk.ipynb Rename Databerry to Chaindesk (#7022) 2023-07-07 17:28:04 -04:00
chatgpt-plugin.ipynb Doc refactor (#6300) 2023-06-16 11:52:56 -07:00
cohere-reranker.ipynb docs/fix links (#6498) 2023-06-20 14:06:50 -07:00
docarray_retriever.ipynb Fix make docs_build and related scripts (#7276) 2023-07-11 22:05:14 -04:00
elastic_search_bm25.ipynb Doc refactor (#6300) 2023-06-16 11:52:56 -07:00
knn.ipynb Doc refactor (#6300) 2023-06-16 11:52:56 -07:00
merger_retriever.ipynb upgrade chroma to 0.4.0 (#7749) 2023-07-18 17:20:54 -07:00
metal.ipynb Doc refactor (#6300) 2023-06-16 11:52:56 -07:00
pinecone_hybrid_search.ipynb Fixed a typo in pinecone_hybrid_search.ipynb (#7627) 2023-07-12 23:46:41 -04:00
pubmed.ipynb docs retrievers fixes (#6299) 2023-06-19 22:04:35 -07:00
svm.ipynb Doc refactor (#6300) 2023-06-16 11:52:56 -07:00
tf_idf.ipynb Doc refactor (#6300) 2023-06-16 11:52:56 -07:00
vespa.ipynb Doc refactor (#6300) 2023-06-16 11:52:56 -07:00
weaviate-hybrid.ipynb WeaviateHybridSearchRetriever option to enable scores. (#7861) 2023-07-18 07:57:17 -07:00
wikipedia.ipynb Doc refactor (#6300) 2023-06-16 11:52:56 -07:00
zep_memorystore.ipynb Fix make docs_build and related scripts (#7276) 2023-07-11 22:05:14 -04:00