mirror of
https://github.com/hwchase17/langchain
synced 2024-10-31 15:20:26 +00:00
87e502c6bc
Co-authored-by: jacoblee93 <jacoblee93@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>
47 lines
1.5 KiB
Plaintext
47 lines
1.5 KiB
Plaintext
# Apify
|
|
|
|
This page covers how to use [Apify](https://apify.com) within LangChain.
|
|
|
|
## Overview
|
|
|
|
Apify is a cloud platform for web scraping and data extraction,
|
|
which provides an [ecosystem](https://apify.com/store) of more than a thousand
|
|
ready-made apps called *Actors* for various scraping, crawling, and extraction use cases.
|
|
|
|
[![Apify Actors](/img/ApifyActors.png)](https://apify.com/store)
|
|
|
|
This integration enables you run Actors on the Apify platform and load their results into LangChain to feed your vector
|
|
indexes with documents and data from the web, e.g. to generate answers from websites with documentation,
|
|
blogs, or knowledge bases.
|
|
|
|
|
|
## Installation and Setup
|
|
|
|
- Install the Apify API client for Python with `pip install apify-client`
|
|
- Get your [Apify API token](https://console.apify.com/account/integrations) and either set it as
|
|
an environment variable (`APIFY_API_TOKEN`) or pass it to the `ApifyWrapper` as `apify_api_token` in the constructor.
|
|
|
|
|
|
## Wrappers
|
|
|
|
### Utility
|
|
|
|
You can use the `ApifyWrapper` to run Actors on the Apify platform.
|
|
|
|
```python
|
|
from langchain.utilities import ApifyWrapper
|
|
```
|
|
|
|
For a more detailed walkthrough of this wrapper, see [this notebook](/docs/modules/agents/tools/integrations/apify.html).
|
|
|
|
|
|
### Loader
|
|
|
|
You can also use our `ApifyDatasetLoader` to get data from Apify dataset.
|
|
|
|
```python
|
|
from langchain.document_loaders import ApifyDatasetLoader
|
|
```
|
|
|
|
For a more detailed walkthrough of this loader, see [this notebook](/docs/modules/data_connection/document_loaders/integrations/apify_dataset.html).
|