langchain

mirror of https://github.com/hwchase17/langchain synced 2024-10-29 17:07:25 +00:00

Author	SHA1	Message	Date
Lance Martin	16a27ab244	Add prompt hub for various use-cases (#9879 ) Use prompt hub in our use-case docs and guides.	2023-09-03 15:32:22 -07:00
Leonid Ganeline	a52fe9528e	docs: fixed title in `Bittensor` example (#9893 ) Fixed title in the `Bittensor` example. The old title brakes the sorted order of items in the navbar. Added some formatting.	2023-09-03 15:10:42 -07:00
seamusp	abd8681341	docs: chains & memory fixes (#9895 ) Various improvements to the Chains & Memory sections of the documentation including formatting, spelling, and grammar fixes to improve readability.	2023-09-03 15:06:20 -07:00
Josh White	bc8cceebf7	Extend DynamoDBChatMessageHistory to support composite keys (#9896 ) - Description: Adds two optional parameters to the DynamoDBChatMessageHistory class to enable users to pass in a name for their PrimaryKey, or a Key object itself to enable the use of composite keys, a common DynamoDB paradigm. [AWS DynamoDB Key docs](https://aws.amazon.com/blogs/database/choosing-the-right-dynamodb-partition-key/) - Issue: N/A - Dependencies: N/A - Twitter handle: N/A --------- Co-authored-by: Josh White <josh@ctrlstack.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-09-03 15:05:16 -07:00
Blake (Yung Cher Ho)	f4bed8a04c	Takeoff baseurl support (#10091 ) ## Description This PR introduces a minor change to the TitanTakeoff integration. Instead of specifying a port on localhost, this PR will allow users to specify a baseURL instead. This will allow users to use the integration if they have TitanTakeoff deployed externally (not on localhost). This removes the hardcoded reference to localhost "http://localhost:{port}". ### Info about Titan Takeoff Titan Takeoff is an inference server created by [TitanML](https://www.titanml.co/) that allows you to deploy large language models locally on your hardware in a single command. Most generative model architectures are included, such as Falcon, Llama 2, GPT2, T5 and many more. Read more about Titan Takeoff here: - [Blog](https://medium.com/@TitanML/introducing-titan-takeoff-6c30e55a8e1e) - [Docs](https://docs.titanml.co/docs/titan-takeoff/getting-started) ### Dependencies No new dependencies are introduced. However, users will need to install the titan-iris package in their local environment and start the Titan Takeoff inferencing server in order to use the Titan Takeoff integration. Thanks for your help and please let me know if you have any questions. cc: @hwchase17 @baskaryan --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2023-09-03 14:45:59 -07:00
Pu Cao	05664a6f20	docs(text_splitter): update document of character splitter with tiktoken (#10001 ) The current document has not mentioned that splits larger than chunk size would happen. I update the related document and explain why it happens and how to solve it. related issue #1349 #3838 #2140	2023-09-03 14:45:45 -07:00
Leonid Ganeline	2221194450	`Yahoo Finance News` tool (#10014 ) Added: - the `Yahoo Finance News` tool - Ut-s - An example	2023-09-03 14:43:57 -07:00
Ismail Pelaseyed	5c3e9c9083	Add example of running Q&A over structured data using the `Airbyte` loaders and `pandas` (#10069 ) - Description: Added example of running Q&A over structured data using the `Airbyte` loaders and `pandas` - Dependencies: any dependencies required for this change, - Tag maintainer: @hwchase17 - Twitter handle: @pelaseyed	2023-09-03 14:32:33 -07:00
Lars von Wedel	6d82503eb1	Add parser and loader for Azure document intelligence service. (#10136 ) Hi, this PR contains loader / parser for Azure Document intelligence which is a ML-based service to ingest arbitrary PDFs / images, even if scanned. The loader generates Documents by pages of the original document. This is my first contribution to LangChain. Unfortunately I could not find the correct place for test cases. Happy to add one if you can point me to the location, but as this is a cloud-based service, a test would require network access and credentials - so might be of limited help. Dependencies: The needed dependency was already part of pyproject.toml, no change. Twitter: feel free to mention @LarsAC on the announcement	2023-09-03 14:25:39 -07:00
Harrison Chase	4abe85be57	Harrison/string inplace (#10153 ) Co-authored-by: Wrick Talukdar <wrick.talukdar@gmail.com> Co-authored-by: Anjan Biswas <anjanavb@amazon.com> Co-authored-by: Jha <nikjha@amazon.com> Co-authored-by: Lucky-Lance <77819606+Lucky-Lance@users.noreply.github.com> Co-authored-by: 陆徐东 <luxudong@MacBook-Pro.local>	2023-09-03 14:25:29 -07:00
Nino Risteski	0c0a7d19eb	Update openai_multi_functions_agent.ipynb (#10144 ) typo fix	2023-09-03 13:00:48 -07:00
Nino Risteski	f968b86652	Update apis.ipynb (#10145 ) few typo fixes	2023-09-03 13:00:22 -07:00
Guy Korland	765ef3b486	Add FalkorDB to imports (#10151 )	2023-09-03 12:52:28 -07:00
Nino Risteski	746c6ff9c3	Update index.mdx (#10142 ) fixed typos	2023-09-02 22:36:26 -07:00
Nino Risteski	fdebd3e02f	Update chat_vector_db.mdx (#10141 ) typo fix	2023-09-02 22:36:09 -07:00
Leonid Kuligin	30239b3025	added support for inference from Model Garden (#9367 ) #8850 --------- Co-authored-by: Leonid Kuligin <kuligin@google.com>	2023-09-01 15:58:21 -07:00
Leonid Ganeline	54a8df87b9	📖 docs: fixed `integration/llms` navbar (#9277 ) Fixed navbar: - renamed several files, so ToC is sorted correctly - made ToC items consistent: formatted several Titles - added several links - reformatted several docs to a consistent format - renamed several files (removed `_example` suffix) - added renamed files to the `docs/docs_skeleton/vercel.json`	2023-09-01 15:30:37 -07:00
Bagatur	b485c3048b	rm base64 images from docs (#10110 ) Causing problems indexing docs and notebook images don't render after markdown conversion anyways	2023-09-01 15:15:12 -07:00
William FH	f2fc4173c3	Update redirects meta tags (#10109 )	2023-09-01 15:14:34 -07:00
Leonid Ganeline	37e435bd00	docs: `youtube_search` tool example update (#9958 ) Added a link to source package; updated title, description.	2023-09-01 13:32:27 -07:00
Leonid Ganeline	3b8ee74e38	docs: `google-drive-tool` example fix (#10000 ) This notebook was mistakenly placed in the `toolkits` folder and appears within `Agents & Toolkits` menu. But it should be in `Tools`. Moved example into `tools/`; updated title to consistent format.	2023-09-01 13:31:26 -07:00
seamusp	afd96b2460	docs: agents & callbacks fixes (#10066 ) Various improvements to the Agents & Callbacks sections of the documentation including formatting, spelling, and grammar fixes to improve readability.	2023-09-01 13:28:55 -07:00
Benjamin Matson	58d7d86e51	feat: add bedrock chat model (#8017 ) Replace this comment with: - Description: Add Bedrock implementation of Anthropic Claude for Chat - Tag maintainer: @hwchase17, @baskaryan - Twitter handle: @bwmatson --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-09-01 13:16:57 -07:00
KyrianC	491089754d	EdenAI LLM update. Add models name option (#8963 ) This PR follows the Eden AI (LLM + embeddings) integration. #8633 We added an optional parameter to choose different AI models for providers (like 'text-bison' for provider 'google', 'text-davinci-003' for provider 'openai', etc.). Usage: ```python llm = EdenAI( feature="text", provider="google", params={ "model": "text-bison", # new "temperature": 0.2, "max_tokens": 250, }, ) ``` You can also change the provider + model after initialization ```python llm = EdenAI( feature="text", provider="google", params={ "temperature": 0.2, "max_tokens": 250, }, ) prompt = """ hi """ llm(prompt, providers='openai', model='text-davinci-003') # change provider & model ``` The jupyter notebook as been updated with an example well. Ping: @hwchase17, @baskaryan --------- Co-authored-by: RedhaWassim <rwasssim@gmail.com> Co-authored-by: sam <melaine.samy@gmail.com>	2023-09-01 12:11:33 -07:00
Bagatur	71c418725f	index rename delete_mode -> cleanup (#10103 )	2023-09-01 11:12:10 -07:00
Bagatur	b927277809	Bagatur/eden type 2 (#10102 )	2023-09-01 10:27:27 -07:00
Bagatur	d4380339c1	eden tool nb nit (#10101 )	2023-09-01 10:16:39 -07:00
KyrianC	c7a5504789	Add EdenAI Tools (#9764 ) This PR follows the Eden AI (LLM + embeddings) integration. #8633 We added different Tools to empower agents with new capabilities : - text: explicit content detection - image: explicit content detection - image: object detection - OCR: invoice parsing - OCR: ID parsing - audio: speech to text - audio: text to speech We plan to add more in the future (like translation, language detection, + others). Usage: ```python llm=EdenAI(feature="text",provider="openai", params={"temperature" : 0.2,"max_tokens" : 250}) tools = [ EdenAiTextModerationTool(providers=["openai"],language="en"), EdenAiObjectDetectionTool(providers=["google","api4ai"]), EdenAiTextToSpeechTool(providers=["amazon"],language="en",voice="MALE"), EdenAiExplicitImageTool(providers=["amazon","google"]), EdenAiSpeechToTextTool(providers=["amazon"]), EdenAiParsingIDTool(providers=["amazon","klippa"],language="en"), EdenAiParsingInvoiceTool(providers=["amazon","google"],language="en"), ] agent_chain = initialize_agent( tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True, return_intermediate_steps=True, ) result = agent_chain(""" i have this text : 'i want to slap you' first : i want to know if this text contains explicit content or not . second : if it does contain explicit content i want to know what is the explicit content in this text, third : i want to make the text into speech . if there is URL in the observations , you will always put it in the output (final answer) . """) ``` output: > Entering new AgentExecutor chain... > I need to extract the information from the ID and then convert it to text and then to speech > Action: edenai_identity_parsing > Action Input: "https://www.citizencard.com/images/citizencard-uk-id-card-2023.jpg" > Observation: last_name : > value : ANGELA > given_names : > value : GREENE > birth_place : > birth_date : > value : 2000-11-09 > issuance_date : > expire_date : > document_id : > issuing_state : > address : > age : > country : > document_type : > value : DRIVER LICENSE FRONT > gender : > image_id : > image_signature : > mrz : > nationality : > Thought: I now need to convert the information to text and then to speech > Action: edenai_text_to_speech > Action Input: "Welcome Angela Greene!" > Observation: https://d14uq1pz7dzsdq.cloudfront.net/0c494819-0bbc-4433-bfa4-6e99bd9747ea_.mp3?Expires=1693316851&Signature=YcMoVQgPuIMEOuSpFuvhkFM8JoBMSoGMcZb7MVWdqw7JEf5~67q9dEI90o5todE5mYXB5zSYoib6rGrmfBl4Rn5~yqDwZ~Tmc24K75zpQZIEyt5~ZSnHuXy4IFWGmlIVuGYVGMGKxTGNeCRNUXDhT6TXGZlr4mwa79Ei1YT7KcNyc1dsTrYB96LphnsqOERx4X9J9XriSwxn70X8oUPFfQmLcitr-syDhiwd9Wdpg6J5yHAJjf657u7Z1lFTBMoXGBuw1VYmyno-3TAiPeUcVlQXPueJ-ymZXmwaITmGOfH7HipZngZBziofRAFdhMYbIjYhegu5jS7TxHwRuox32A__&Key-Pair-Id=K1F55BTI9AHGIK > Thought: I now know the final answer > Final Answer: https://d14uq1pz7dzsdq.cloudfront.net/0c494819-0bbc-4433-bfa4-6e99bd9747ea_.mp3?Expires=1693316851&Signature=YcMoVQgPuIMEOuSpFuvhkFM8JoBMSoGMcZb7MVWdqw7JEf5~67q9dEI90o5todE5mYXB5zSYoib6rGrmfBl4Rn5~yqDwZ~Tmc24K75zpQZIEyt5~ZSnHuXy4IFWGmlIVuGYVGMGKxTGNeCRNUXDhT6TXGZlr4mwa79Ei1YT7KcNyc1dsTrYB96LphnsqOERx4X9J9XriSwxn70X8oUPFfQmLcitr-syDhiwd9Wdpg6J5y > > Finished chain. Other examples are available in the jupyter notebook. This PR is made in parallel with EdenAI LLM update #8963 I apologize for the messy PR. While working in implementing Tools we realized there was a few problems we needed to fix on LLM as well. Ping: @hwchase17, @baskaryan --------- Co-authored-by: RedhaWassim <rwasssim@gmail.com>	2023-09-01 08:26:56 -07:00
Bagatur	5f1c67b47c	Mv LCEL docs up a level (#10073 )	2023-09-01 08:20:55 -07:00
Harrison Chase	ad9e242a7a	add snippet for max concurrency (#9892 )	2023-08-31 16:52:28 -07:00
Stefano Lottini	c710c7303f	fix wrong import line in cassandra doc page for vector store (#10041 ) This fixes the exampe import line in the general "cassandra" doc page mdx file. (it was erroneously a copy of the chat message history import statement found below).	2023-08-31 16:05:46 -07:00
Jon Bennion	cc6a20d3e6	updated prompt name in documentation for sequential chain (#10048 ) Description: updated the prompt name in a sequential chain example so that it is not overwritten by the same prompt name in the next chain (this is a sequential chain example) Issue: n/a Dependencies: none Tag maintainer: not known Twitter handle: not on twitter, feel free to use my git username for anything	2023-08-31 16:05:18 -07:00
Zizhong Zhang	641b71e2cd	refactor: rename to OpaquePrompts (#10013 ) Renamed to OpaquePrompts cc @baskaryan Thanks in advance!	2023-08-31 12:21:24 -07:00
Bagatur	8d66b00c73	Data anonymizer notebook nit (#10062 )	2023-08-31 10:58:13 -07:00
Bagatur	3efab8d3df	implement vectorstores by tencent vectordb (#9989 ) Hi there！ I'm excited to open this PR to add support for using 'Tencent Cloud VectorDB' as a vector store. Tencent Cloud VectorDB is a fully-managed, self-developed, enterprise-level distributed database service designed for storing, retrieving, and analyzing multi-dimensional vector data. The database supports multiple index types and similarity calculation methods, with a single index supporting vector scales up to 1 billion and capable of handling millions of QPS with millisecond-level query latency. Tencent Cloud VectorDB not only provides external knowledge bases for large models to improve their accuracy, but also has wide applications in AI fields such as recommendation systems, NLP services, computer vision, and intelligent customer service. The PR includes: Implementation of Vectorstore. I have read your [contributing guidelines](`72b7d76d79/.github/CONTRIBUTING.md`). And I have passed the tests below make format make lint make coverage make test	2023-08-31 00:48:25 -07:00
Bagatur	b1644bc9ad	cr	2023-08-31 00:43:34 -07:00
Cameron Vetter	e37d51cab6	fix scoring profile example (#10016 ) - Description: A change in the documentation example for Azure Cognitive Vector Search with Scoring Profile so the example works as written - Issue: #10015 - Dependencies: None - Tag maintainer: @baskaryan @ruoccofabrizio - Twitter handle: @poshporcupine	2023-08-31 00:35:06 -07:00
Hyeokjun seo	e2e05ad89e	Fix Typo : `openai_api_key` -> `serpapi_api_key` (#10020 ) Fixed typo in the comments Notebook. (which says `openai_api_key` for SerpAPI)	2023-08-31 00:33:13 -07:00
Tomaz Bratanic	f2e8399cc8	Fix link in Neo4j provider page (#10023 )	2023-08-31 00:32:42 -07:00
Bagatur	7fa82900cb	guides docs nits (#10005 )	2023-08-30 11:07:42 -07:00
Bagatur	2f03e71e67	rename local llm guide (#10004 )	2023-08-30 10:52:46 -07:00
Bagatur	781f274d19	make privacy guide section (#10003 )	2023-08-30 10:49:20 -07:00
maks-operlejn-ds	a8f804a618	Add data anonymizer (#9863 ) ### Description The feature for anonymizing data has been implemented. In order to protect private data, such as when querying external APIs (OpenAI), it is worth pseudonymizing sensitive data to maintain full privacy. Anonynization consists of two steps: 1. Identification: Identify all data fields that contain personally identifiable information (PII). 2. Replacement: Replace all PIIs with pseudo values or codes that do not reveal any personal information about the individual but can be used for reference. We're not using regular encryption, because the language model won't be able to understand the meaning or context of the encrypted data. We use Microsoft Presidio together with Faker framework for anonymization purposes because of the wide range of functionalities they provide. The full implementation is available in `PresidioAnonymizer`. ### Future works - deanonymization - add the ability to reverse anonymization. For example, the workflow could look like this: `anonymize -> LLMChain -> deanonymize`. By doing this, we will retain anonymity in requests to, for example, OpenAI, and then be able restore the original data. - instance anonymization - at this point, each occurrence of PII is treated as a separate entity and separately anonymized. Therefore, two occurrences of the name John Doe in the text will be changed to two different names. It is therefore worth introducing support for full instance detection, so that repeated occurrences are treated as a single object. ### Twitter handle @deepsense_ai / @MaksOpp --------- Co-authored-by: MaksOpp <maks.operlejn@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-08-30 10:39:44 -07:00
Bagatur	98cce7dcd3	update moderation docs (#10002 )	2023-08-30 10:34:25 -07:00
Christophe Bornet	9870bfb9cd	Add bucket and object key to metadata in S3 loader (#9317 ) - Description: this PR adds `s3_object_key` and `s3_bucket` to the doc metadata when loading an S3 file. This is particularly useful when using `S3DirectoryLoader` to remove the files from the dir once they have been processed (getting the object keys from the metadata `source` field seems brittle) - Dependencies: N/A - Tag maintainer: ? - Twitter handle: _cbornet --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2023-08-30 11:03:24 -04:00
Guy Korland	24c0b01c38	Extend the FalkorDB QA demo (#9992 ) - Description: Extend the FalkorDB QA demo - Tag maintainer: @baskaryan	2023-08-30 10:13:18 -04:00
wlleiiwang	8c4e29240c	implement vectorstores by tencent vectordb	2023-08-30 16:40:58 +08:00
Leonid Ganeline	d03d6f6fd9	Merge branch 'master' into docs-tools-menu	2023-08-29 15:57:25 -07:00
Bagatur	8fb0a9594c	Add LLMonitor Callback Handler Integration - open-source observability & analytics (#9870 ) Adds support for [llmonitor](https://llmonitor.com) callbacks. It enables: - Requests tracking / logging / analytics - Error debugging - Cost analytics - User tracking Let me know if anythings neds to be changed for merge. Thank you!	2023-08-29 15:49:01 -07:00
leo-gan	8c1678a8c7	Updated titles, descriptions.	2023-08-29 15:42:28 -07:00

1 2 3 4 5 ...

1920 Commits