forked from Archives/langchain
1f11f80641
# docs cleaning Changed docs to consistent format (probably, we need an official doc integration template): - ClearML - added product descriptions; changed title/headers - Rebuff - added product descriptions; changed title/headers - WhyLabs - added product descriptions; changed title/headers - Docugami - changed title/headers/structure - Airbyte - fixed title - Wolfram Alpha - added descriptions, fixed title - OpenWeatherMap - - added product descriptions; changed title/headers - Unstructured - changed description ## Who can review? Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested: @hwchase17 @dev2049
1.2 KiB
1.2 KiB
Airbyte
Airbyte is a data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes. It has the largest catalog of ELT connectors to data warehouses and databases.
Installation and Setup
This instruction shows how to load any source from Airbyte
into a local JSON
file that can be read in as a document.
Prerequisites:
Have docker desktop
installed.
Steps:
- Clone Airbyte from GitHub -
git clone https://github.com/airbytehq/airbyte.git
. - Switch into Airbyte directory -
cd airbyte
. - Start Airbyte -
docker compose up
. - In your browser, just visit http://localhost:8000. You will be asked for a username and password. By default, that's username
airbyte
and passwordpassword
. - Setup any source you wish.
- Set destination as Local JSON, with specified destination path - lets say
/json_data
. Set up a manual sync. - Run the connection.
- To see what files are created, navigate to:
file:///tmp/airbyte_local/
.
Document Loader
See a usage example.
from langchain.document_loaders import AirbyteJSONLoader