mirror of
https://github.com/hwchase17/langchain
synced 2024-10-29 17:07:25 +00:00
d7f807b71f
# Add AzureCognitiveServicesToolkit to call Azure Cognitive Services API: achieve some multimodal capabilities This PR adds a toolkit named AzureCognitiveServicesToolkit which bundles the following tools: - AzureCogsImageAnalysisTool: calls Azure Cognitive Services image analysis API to extract caption, objects, tags, and text from images. - AzureCogsFormRecognizerTool: calls Azure Cognitive Services form recognizer API to extract text, tables, and key-value pairs from documents. - AzureCogsSpeech2TextTool: calls Azure Cognitive Services speech to text API to transcribe speech to text. - AzureCogsText2SpeechTool: calls Azure Cognitive Services text to speech API to synthesize text to speech. This toolkit can be used to process image, document, and audio inputs. --------- Co-authored-by: Dev 2049 <dev.dev2049@gmail.com> |
||
---|---|---|
.. | ||
file_management | ||
openapi | ||
powerbi | ||
python | ||
requests | ||
shell | ||
__init__.py | ||
test_base.py | ||
test_exported.py | ||
test_json.py | ||
test_public_api.py | ||
test_signatures.py | ||
test_zapier.py |