mirror of
https://github.com/hwchase17/langchain
synced 2024-10-29 17:07:25 +00:00
d7f807b71f
# Add AzureCognitiveServicesToolkit to call Azure Cognitive Services API: achieve some multimodal capabilities This PR adds a toolkit named AzureCognitiveServicesToolkit which bundles the following tools: - AzureCogsImageAnalysisTool: calls Azure Cognitive Services image analysis API to extract caption, objects, tags, and text from images. - AzureCogsFormRecognizerTool: calls Azure Cognitive Services form recognizer API to extract text, tables, and key-value pairs from documents. - AzureCogsSpeech2TextTool: calls Azure Cognitive Services speech to text API to transcribe speech to text. - AzureCogsText2SpeechTool: calls Azure Cognitive Services text to speech API to synthesize text to speech. This toolkit can be used to process image, document, and audio inputs. --------- Co-authored-by: Dev 2049 <dev.dev2049@gmail.com> |
||
---|---|---|
.. | ||
azure_cognitive_services.ipynb | ||
csv.ipynb | ||
gmail.ipynb | ||
jira.ipynb | ||
json.ipynb | ||
openai_openapi.yml | ||
openapi_nla.ipynb | ||
openapi.ipynb | ||
pandas.ipynb | ||
playwright.ipynb | ||
powerbi.ipynb | ||
python.ipynb | ||
spark_sql.ipynb | ||
spark.ipynb | ||
sql_database.ipynb | ||
titanic.csv | ||
vectorstore.ipynb |