mirror of
https://github.com/hwchase17/langchain
synced 2024-11-13 19:10:52 +00:00
02f0a29293
- **Description:** Adding notebook to demonstrate visual RAG which uses both video scene description generated by open source vision models (ex. video-llama, video-llava etc.) as text embeddings and frames as image embeddings to perform vector similarity search using VDMS. - **Issue:** N/A - **Dependencies:** N/A
18 MiB
18 MiB
The file is too large to be shown.
View Raw