langchain/docs/integrations/airbyte.md
Leonid Ganeline 1f11f80641
docs: cleaning (#5413)
# docs cleaning

Changed docs to consistent format (probably, we need an official doc
integration template):
- ClearML - added product descriptions; changed title/headers
- Rebuff  - added product descriptions; changed title/headers
- WhyLabs  - added product descriptions; changed title/headers
- Docugami - changed title/headers/structure
- Airbyte - fixed title
- Wolfram Alpha - added descriptions, fixed title
- OpenWeatherMap -  - added product descriptions; changed title/headers
- Unstructured - changed description

## Who can review?

Community members can review the PR once tests pass. Tag
maintainers/contributors who might be interested:

@hwchase17
@dev2049
2023-05-30 13:58:16 -07:00

1.2 KiB

Airbyte

Airbyte is a data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes. It has the largest catalog of ELT connectors to data warehouses and databases.

Installation and Setup

This instruction shows how to load any source from Airbyte into a local JSON file that can be read in as a document.

Prerequisites: Have docker desktop installed.

Steps:

  1. Clone Airbyte from GitHub - git clone https://github.com/airbytehq/airbyte.git.
  2. Switch into Airbyte directory - cd airbyte.
  3. Start Airbyte - docker compose up.
  4. In your browser, just visit http://localhost:8000. You will be asked for a username and password. By default, that's username airbyte and password password.
  5. Setup any source you wish.
  6. Set destination as Local JSON, with specified destination path - lets say /json_data. Set up a manual sync.
  7. Run the connection.
  8. To see what files are created, navigate to: file:///tmp/airbyte_local/.

Document Loader

See a usage example.

from langchain.document_loaders import AirbyteJSONLoader