forked from Archives/langchain
d7f807b71f
# Add AzureCognitiveServicesToolkit to call Azure Cognitive Services API: achieve some multimodal capabilities This PR adds a toolkit named AzureCognitiveServicesToolkit which bundles the following tools: - AzureCogsImageAnalysisTool: calls Azure Cognitive Services image analysis API to extract caption, objects, tags, and text from images. - AzureCogsFormRecognizerTool: calls Azure Cognitive Services form recognizer API to extract text, tables, and key-value pairs from documents. - AzureCogsSpeech2TextTool: calls Azure Cognitive Services speech to text API to transcribe speech to text. - AzureCogsText2SpeechTool: calls Azure Cognitive Services text to speech API to synthesize text to speech. This toolkit can be used to process image, document, and audio inputs. --------- Co-authored-by: Dev 2049 <dev.dev2049@gmail.com> |
||
---|---|---|
.. | ||
agent_executors/examples | ||
agents | ||
toolkits/examples | ||
tools | ||
agent_executors.rst | ||
agents.rst | ||
getting_started.ipynb | ||
how_to_guides.rst | ||
plan_and_execute.ipynb | ||
streaming_stdout_final_only.ipynb | ||
toolkits.rst | ||
tools.rst |